Skip to main content
Nauman KhanNK
Open to opportunities

Nauman Khan

@naumankhan2

Senior Data Engineer and ETL/ELT Architect building scalable, AI-ready lakehouse platforms and low-latency streaming systems.

United States
Message

What I'm looking for

I’m looking to build AI-ready lakehouse and real-time streaming platforms—focused on reliability, observability, and cloud cost optimization, with strong data governance and compliance for production-grade analytics.

I’m a Senior Data Engineer with 8 years of experience building AI-ready data platforms, real-time streaming systems, and cloud-native ETL/ELT architectures across AI infrastructure, healthcare analytics, and financial data ecosystems.

I design scalable lakehouse platforms and distributed data pipelines using Apache Spark, Databricks, Kafka, Snowflake, and AWS—processing millions of daily events with high reliability and low latency.

In my current role, I’ve architected an enterprise lakehouse on AWS S3 and Delta Lake, delivered sub-second data with Kafka and Spark Structured Streaming (99.9% uptime), and migrated legacy ETL to cloud-native ELT, reducing processing time by 40% and improving data freshness SLAs. I also implemented Infrastructure-as-Code with Terraform and CI/CD via GitHub Actions, plus enterprise observability and quality frameworks that reduced downstream production incidents by 35%.

Experience

Work history, roles, and key accomplishments

Mage logoMA
Current

Senior Data Engineer

Mage

Jan 2022 - Present (4 years 5 months)

Architected an AI-ready lakehouse on AWS using Databricks, Apache Spark, and Delta Lake to process 30M+ daily events with sub-second delivery and 99.9% uptime. Migrated legacy ETL to cloud-native ELT, cutting processing time by 40% and improving data freshness SLAs; implemented IaC/CI-CD and observability to reduce production incidents by 35%.

Innovaccer logoIN

Data Engineer

Jun 2019 - Dec 2021 (2 years 6 months)

Built HIPAA-compliant healthcare data pipelines and integrated EHR/FHIR datasets into centralized healthcare lakehouse architectures. Designed Snowflake warehouse models and real-time ingestion workflows, improving Spark/SQL performance by 45% and implementing automated validation/monitoring to strengthen reliability and compliance reporting.

Prophecy.io logoPR

ETL Developer

Jul 2017 - May 2019 (1 year 10 months)

Designed and optimized large-scale financial ETL pipelines processing transaction and payment data for enterprise analytics workloads. Built real-time transaction ingestion for fraud detection/payment monitoring, improved query execution performance by 35% via indexing/partitioning/distributed optimizations, and supported event-driven analytics with Kafka and Spark Structured Streaming.

Education

Degrees, certifications, and relevant coursework

Nauman hasn't added their education

Don't worry, there are 90k+ talented remote workers on Himalayas

Find your dream job

Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan