Asha Khan
@ashakhan
Lead Data Engineer modernizing cloud lakehouses into trusted, governed data products for analytics and AI.
What I'm looking for
I’m a Lead Data Engineer and Data Architecture specialist with 10 years of experience designing, building, and scaling enterprise-grade data ecosystems. I focus on end-to-end cloud-native data engineering and lakehouse architecture across AWS, Azure, GCP, and Snowflake.
I bridge high-level architecture with hands-on delivery of high-performance batch and streaming platforms using Python, SQL, Spark, Databricks, Kafka, and Airflow. I’ve enabled analytics, machine learning, and Generative AI platforms by transforming raw data into trusted, business-ready assets—at petabyte scale.
My work consistently modernizes legacy platforms into Data Mesh and Medallion architectures while implementing enterprise governance, security, metadata management, lineage, and FinOps frameworks. I build data reliability engineering practices, including observability, data quality frameworks, SLA monitoring, and pipeline resiliency.
Across projects, I lead architectural decisions, code reviews, and performance tuning initiatives, partnering with executives, security, and product teams to align data architecture with business capabilities. I’m especially energized by building governed, cost-optimized, cloud platforms that keep analytics and AI teams productive.
Experience
Work history, roles, and key accomplishments
Senior Data Architect
ProCogia
Aug 2019 - Dec 2022 (3 years 4 months)
Led enterprise multi-cloud data architecture for analytics ecosystems across AWS, Azure, Snowflake, and GCP, defining ingestion, processing, storage, governance, security, and consumption layers. Built reference architectures for lakehouse, Data Mesh/Medallion, and event-driven real-time streaming using Kafka/Flink/Spark Streaming, including metadata, lineage, and FinOps-driven cost optimization.
Data Engineer
Enigma Technologies
Jun 2016 - Jul 2019 (3 years 1 month)
Developed scalable distributed data pipelines using Python, SQL, Spark, Airflow, and Hadoop ecosystem technologies, building enterprise data lakes and analytical warehouses. Implemented batch and streaming ingestion from databases, APIs, SaaS systems, and log sources, and delivered trusted models with validation, reconciliation, monitoring, and BI support.
Data Engineer
Datafold
Jan 2014 - May 2016 (2 years 4 months)
Built foundational ETL pipelines and enterprise data warehouse solutions using SQL, Python, and scripting frameworks, ingesting data from ERP/CRM platforms, flat files, and third-party sources. Created reporting datasets and analytical views while supporting dimensional modeling, metadata/documentation, scheduling, data quality checks, reconciliation, and production issue resolution.
Education
Degrees, certifications, and relevant coursework
Asha hasn't added their education
Don't worry, there are 90k+ talented remote workers on Himalayas
Tech stack
Software and tools used professionally
Azure Synapse
AWS Glue
Apache Flink
Superset
GitHub
GitLab
Kubernetes
Jenkins
GitHub Actions
GitLab CI
NumPy
Pandas
PySpark
dbt
MySQL
PostgreSQL
MongoDB
Cassandra
Hadoop
HBase
Gmail
Databricks
Neo4j
Redis
Terraform
Java
Perl
Julia
TensorFlow
PyTorch
MLflow
scikit-learn
Kubeflow
Kafka
Elasticsearch
Serverless
Airflow
Time Analytics
SQL
XGBoost
LightGBM
Dagster
Datafold
Delta Lake
Trino
Bash
Transform
Middleware
Bridge
Factory
Jan
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Asha?
You can contact Asha and 90k+ other talented remote workers on Himalayas.
Message AshaFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
