Skip to main content
HimalayasHimalayas logo
Ehmad AslamEA
Open to opportunities

Ehmad Aslam

@ehmadaslam

Senior data engineer with 7+ years building production ELT, streaming, and cloud-native analytics platforms at scale.

United States
Message

What I'm looking for

I want to build clean, maintainable data infrastructure—high-throughput ELT, real-time streaming, and cloud-native warehouses—where data quality, observability, and performance are valued, and I can mentor others while delivering measurable business impact.

I’m a results-driven Data Engineer with 7+ years of experience building and scaling production-grade data platforms across ecommerce and logistics SaaS. I bring deep expertise in Python, SQL, dbt, Databricks, and PySpark, and I’m comfortable shaping end-to-end data systems that reliably power analytics outcomes.

At Reveel Group, I architected and maintained end-to-end ELT pipelines ingesting 100M+ parcel invoice records per month from UPS, FedEx, and DHL carrier APIs and EDI feeds. I migrated legacy monolithic SQL transformations to dbt, delivering 200+ modular, versioned, and tested data models on top of Delta Lake—reducing pipeline failures by 40% and cutting feature development cycle time by 30%. I also built high-throughput PySpark jobs on Databricks for multi-carrier contract data and engineered the Finance Automation pipeline that eliminated ~15 hours/week of manual reconciliation.

I’ve also built the real-time and warehousing backbone that makes operational and analytics teams faster. As a Junior Data Engineer at Spreetail, I developed Kafka-based real-time streaming pipelines across 20+ marketplace platforms, and I designed Snowflake data warehouse models to consolidate order, inventory, and fulfillment data. I orchestrated pipelines with Dagster for lineage and observability, and used Airbyte connectors to standardize ingestion into AWS S3—significantly reducing time-to-integration for new sources.

I’m especially proud of the engineering rigor behind the work: I implement layered data quality frameworks (dbt tests and Great Expectations), enforce data contracts and pipeline SLAs, and optimize Spark performance through partition tuning, caching, and cluster autoscaling. I mentor junior engineers and interns on Databricks workflows, dbt best practices, and PySpark development—and I’m certified in Databricks and dbt Analytics Engineering.

Experience

Work history, roles, and key accomplishments

RG
Current

Data Engineer

Reveel Group

Jul 2019 - Present (6 years 10 months)

Architected end-to-end ELT pipelines ingesting 100M+ parcel invoice records per month from UPS, FedEx, and DHL APIs/EDI into Databricks Delta Lake, enabling automated audit and recovery workflows. Migrated legacy transformations to dbt (200+ modular models), reducing pipeline failures by 40% and cutting feature cycle time by 30%, while improving data freshness to sub-2 hours and reducing productio

SP

Junior Data Engineer

Spreetail

Jan 2017 - Jun 2019 (2 years 5 months)

Built Kafka-based real-time streaming pipelines ingesting marketplace order and inventory events from 20+ platforms to support same-day fulfillment SLA tracking and alerting. Implemented Snowflake warehouse models and dbt transformations, orchestrated workflows with Dagster, and used Airbyte to extract from 10+ APIs into AWS S3, accelerating integration for new data sources.

Education

Degrees, certifications, and relevant coursework

UN

University of Science and Technology (NUST)

Bachelor of Science, Computer Science

2012 - 2016

Earned a Bachelor of Science in Computer Science from the University of Science and Technology (NUST) from 2012 to 2016.

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan