Ali Bangash
@alibangash1
Lead Data Engineer building cloud-native Lakehouse platforms that deliver reliable, cost-efficient analytics.
What I'm looking for
I am a hands-on Lead Data Engineer and Data Architect with 8+ years building cloud-native data platforms across AWS, Azure, and GCP. I specialize in Lakehouse/Delta architectures, streaming and batch pipelines, and analytics layers that are observable, governed, and cost-efficient.
I have consolidated 100+ TB into a Databricks Lakehouse, reduced end-to-end latency from 24 hours to under 2 hours, and drove approximately 30% cost savings through right-sizing and automation.
I design and productionize pipelines using Airflow, Databricks, Spark, Kafka, ADF, and dbt, and build warehouses in Snowflake, Redshift, and BigQuery. I implement automated data quality with Great Expectations, enforce data contracts, and establish CI/CD, runbooks, and observability to improve reliability. I partner closely with product, data science, and BI to operationalize models and accelerate insights.
I enjoy mentoring engineers, improving onboarding velocity, and delivering pragmatic, scalable solutions that balance governance, performance, and cost.
Experience
Work history, roles, and key accomplishments
Lead Data Engineer
Avanade
Feb 2022 - Present (3 years 6 months)
Architected an AWS S3 + Databricks Lakehouse centralizing 100+ TB and productionized 150+ Airflow DAGs/PySpark jobs, cutting batch runtimes ~40% and reducing end-to-end latency from 24h to under 2 hours.
Senior Data Engineer
Themesoft Inc.
Jul 2019 - Feb 2022 (2 years 7 months)
Built a Snowflake warehouse with dbt that improved KPI query performance ~50% and developed near real-time Kafka + Spark Streaming pipelines; modeled customer relationships in Neo4j to enable ~15% churn improvement.
Consolidated sales and marketing data into Amazon Redshift and introduced Airflow to orchestrate 60+ daily jobs, achieving 99.5% on-time execution; automated infra with Terraform and Docker, reducing environment setup time ~70% and passing audits with zero findings.
Data Engineer
ApTask
Mar 2016 - Aug 2017 (1 year 5 months)
Built and maintained Informatica PowerCenter mappings integrating Oracle/MySQL into an enterprise DWH, speeding key reports ~20% and authoring Python utilities that reduced data errors ~30%; developed CDC jobs to keep marts current while minimizing load windows.
Education
Degrees, certifications, and relevant coursework
Ali hasn't added their education
Don't worry, there are 90k+ talented remote workers on Himalayas
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Ali?
You can contact Ali and 90k+ other talented remote workers on Himalayas.
Message AliFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
