Hamza Ali
@hamzaali18
Senior Data Engineer specializing in scalable lakehouse pipelines, cloud data platforms, and cost-efficient analytics.
What I'm looking for
I’m a Senior Data Engineer with 7+ years of hands-on experience designing, building, and optimizing large-scale data pipelines, cloud data platforms, and analytics infrastructure across diverse industries. I’m known for translating complex business requirements into robust, scalable, and cost-efficient data systems.
In my most recent role, I architected and implemented a multi-layer Databricks lakehouse on Delta Lake with Medallion Architecture—reducing query latency by 40% and cutting storage costs by 30%. I’ve delivered end-to-end ELT pipelines using dbt and Snowflake for 200+ business users, and built automated ingestion with Snowpipe and Snowflake Streams/Tasks to enable near-real-time CDC reporting.
I also lead cloud infrastructure provisioning with Terraform across AWS and Azure, orchestrate multi-stage workflows processing 5TB+ daily, and partner closely with data science teams on feature engineering pipelines for ML training and inference at scale. I’m equally focused on reliability—establishing data quality frameworks with Great Expectations to achieve 99.5% data reliability SLA—and I’ve mentored a team of 3 junior data engineers through code reviews and best practices.
Experience
Work history, roles, and key accomplishments
Senior Data Engineer
People Inc.
Jan 2022 - Present (4 years 4 months)
Architected a Databricks lakehouse with Delta Lake and Medallion architecture, reducing query latency by 40% and cutting storage costs by 30%. Built production ELT and near-real-time ingestion using dbt with Snowflake plus Snowpipe/Streams/Tasks, and implemented data quality in CI/CD to achieve a 99.5% reliability SLA.
Data Engineer
Datasap Inc.
Mar 2019 - Dec 2021 (2 years 9 months)
Built Azure-native data pipelines with Azure Data Factory and Azure Databricks, migrating workloads for 3 enterprise clients and reducing operational costs by 35%. Implemented Snowflake with automated Snowpipe loading to cut reporting time from 4 hours to under 15 minutes, and delivered near-real-time streaming analytics using Kafka and PySpark.
Junior Data Engineer
Infosys
Jun 2017 - Feb 2019 (1 year 8 months)
Developed and maintained Python/SQL ETL pipelines ingesting data from REST APIs, flat files, and relational databases into centralized data warehouses. Improved report generation performance by 25% through PostgreSQL/MySQL query and stored-procedure optimization, and supported migration to AWS RDS by handling validation and post-migration tuning.
Education
Degrees, certifications, and relevant coursework
Hamza hasn't added their education
Don't worry, there are 90k+ talented remote workers on Himalayas
Tech stack
Software and tools used professionally
Fivetran
Azure Synapse
Apache Spark
AWS Glue
Talend
Microsoft Azure
Amazon S3
GitHub
GitHub Actions
NumPy
Pandas
PySpark
dbt
MySQL
PostgreSQL
MongoDB
Gmail
Databricks
Terraform
AWS CloudFormation
Azure DevOps
MLflow
Kafka
Azure Monitor
Airflow
Time Analytics
Amazon Web Services (AWS)
SQL
Apache Iceberg
Delta Lake
Great Expectations
Collibra
Bash
Transform
Unity Catalog
Factory
Jan
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Hamza?
You can contact Hamza and 90k+ other talented remote workers on Himalayas.
Message HamzaFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
