Ehmad Aslam

EA

Open to opportunities

Ehmad Aslam

@ehmadaslam

Senior data engineer with 7+ years building production ELT, streaming, and cloud-native analytics platforms at scale.

What I'm looking for

I want to build clean, maintainable data infrastructure—high-throughput ELT, real-time streaming, and cloud-native warehouses—where data quality, observability, and performance are valued, and I can mentor others while delivering measurable business impact.

I’m a results-driven Data Engineer with 7+ years of experience building and scaling production-grade data platforms across ecommerce and logistics SaaS. I bring deep expertise in Python, SQL, dbt, Databricks, and PySpark, and I’m comfortable shaping end-to-end data systems that reliably power analytics outcomes.

At Reveel Group, I architected and maintained end-to-end ELT pipelines ingesting 100M+ parcel invoice records per month from UPS, FedEx, and DHL carrier APIs and EDI feeds. I migrated legacy monolithic SQL transformations to dbt, delivering 200+ modular, versioned, and tested data models on top of Delta Lake—reducing pipeline failures by 40% and cutting feature development cycle time by 30%. I also built high-throughput PySpark jobs on Databricks for multi-carrier contract data and engineered the Finance Automation pipeline that eliminated ~15 hours/week of manual reconciliation.

I’ve also built the real-time and warehousing backbone that makes operational and analytics teams faster. As a Junior Data Engineer at Spreetail, I developed Kafka-based real-time streaming pipelines across 20+ marketplace platforms, and I designed Snowflake data warehouse models to consolidate order, inventory, and fulfillment data. I orchestrated pipelines with Dagster for lineage and observability, and used Airbyte connectors to standardize ingestion into AWS S3—significantly reducing time-to-integration for new sources.

I’m especially proud of the engineering rigor behind the work: I implement layered data quality frameworks (dbt tests and Great Expectations), enforce data contracts and pipeline SLAs, and optimize Spark performance through partition tuning, caching, and cluster autoscaling. I mentor junior engineers and interns on Databricks workflows, dbt best practices, and PySpark development—and I’m certified in Databricks and dbt Analytics Engineering.

Experience

Work history, roles, and key accomplishments

RG

Current

Data Engineer

Current

Reveel Group

Jul 2019 - Present (7 years)

Architected end-to-end ELT pipelines ingesting 100M+ parcel invoice records per month from UPS, FedEx, and DHL APIs/EDI into Databricks Delta Lake, enabling automated audit and recovery workflows. Migrated legacy transformations to dbt (200+ modular models), reducing pipeline failures by 40% and cutting feature cycle time by 30%, while improving data freshness to sub-2 hours and reducing productio

Great Expectations Python SQL DBT Databricks Pyspark Delta Lake AWS Glue AWS S3 Apache Spark

SP

Junior Data Engineer

Spreetail

Jan 2017 - Jun 2019 (2 years 5 months)

Built Kafka-based real-time streaming pipelines ingesting marketplace order and inventory events from 20+ platforms to support same-day fulfillment SLA tracking and alerting. Implemented Snowflake warehouse models and dbt transformations, orchestrated workflows with Dagster, and used Airbyte to extract from 10+ APIs into AWS S3, accelerating integration for new data sources.

Kafka Python Snowflake DBT Dagster Airbyte AWS S3 Data Modeling Event Driven Architecture REST APIs

Education

Degrees, certifications, and relevant coursework

UN

University of Science and Technology (NUST)

Bachelor of Science, Computer Science

2012 - 2016

Earned a Bachelor of Science in Computer Science from the University of Science and Technology (NUST) from 2012 to 2016.

Tech stack

Software and tools used professionally

Amazon Redshift

Airbyte

Apache Spark

AWS Glue

AWS IAM

GitHub

GitHub Actions

Pandas

PySpark

dbt

PostgreSQL

Gmail

Databricks

Kafka

AWS Lambda

Airflow

SQL

Dagster

Delta Lake

Great Expectations

Bash

Jan

Availability

Open to opportunities

Location

United States

Authorized to work in

Job categories

Data Engineer Data Engineer Data Platform Engineer Data Warehouse Engineer Streaming Data Engineer Data Engineering Data Engineering Positions Cloud Data Engineer DataOps Engineer

Interested in hiring Ehmad?

You can contact Ehmad and 90k+ other talented remote workers on Himalayas.

People also viewed

View all talent

Get matched with your dream remote job

Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!