Nauman Khan
@naumankhan2
Senior Data Engineer and ETL/ELT Architect building scalable, AI-ready lakehouse platforms and low-latency streaming systems.
What I'm looking for
I’m a Senior Data Engineer with 8 years of experience building AI-ready data platforms, real-time streaming systems, and cloud-native ETL/ELT architectures across AI infrastructure, healthcare analytics, and financial data ecosystems.
I design scalable lakehouse platforms and distributed data pipelines using Apache Spark, Databricks, Kafka, Snowflake, and AWS—processing millions of daily events with high reliability and low latency.
In my current role, I’ve architected an enterprise lakehouse on AWS S3 and Delta Lake, delivered sub-second data with Kafka and Spark Structured Streaming (99.9% uptime), and migrated legacy ETL to cloud-native ELT, reducing processing time by 40% and improving data freshness SLAs. I also implemented Infrastructure-as-Code with Terraform and CI/CD via GitHub Actions, plus enterprise observability and quality frameworks that reduced downstream production incidents by 35%.
Experience
Work history, roles, and key accomplishments
Senior Data Engineer
Mage
Jan 2022 - Present (4 years 5 months)
Architected an AI-ready lakehouse on AWS using Databricks, Apache Spark, and Delta Lake to process 30M+ daily events with sub-second delivery and 99.9% uptime. Migrated legacy ETL to cloud-native ELT, cutting processing time by 40% and improving data freshness SLAs; implemented IaC/CI-CD and observability to reduce production incidents by 35%.
Built HIPAA-compliant healthcare data pipelines and integrated EHR/FHIR datasets into centralized healthcare lakehouse architectures. Designed Snowflake warehouse models and real-time ingestion workflows, improving Spark/SQL performance by 45% and implementing automated validation/monitoring to strengthen reliability and compliance reporting.
Designed and optimized large-scale financial ETL pipelines processing transaction and payment data for enterprise analytics workloads. Built real-time transaction ingestion for fraud detection/payment monitoring, improved query execution performance by 35% via indexing/partitioning/distributed optimizations, and supported event-driven analytics with Kafka and Spark Structured Streaming.
Education
Degrees, certifications, and relevant coursework
Nauman hasn't added their education
Don't worry, there are 90k+ talented remote workers on Himalayas
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Nauman?
You can contact Nauman and 90k+ other talented remote workers on Himalayas.
Message NaumanFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
