hazzy M
@hazzym1
Lead Data Engineer specializing in Databricks Lakehouse, Spark, and streaming platforms.
What I'm looking for
I am a Lead Data Engineer with 9+ years designing and delivering scalable data platforms for analytics, real-time processing, and machine learning. I specialize in Databricks Lakehouse architecture, Apache Spark, Delta Lake, and Kafka-based streaming.
I've led end-to-end data engineering efforts, implemented Bronze/Silver/Gold medallion architectures, and enabled ML-ready datasets and feature engineering for data science teams. My hands-on cloud experience spans AWS, Azure, and GCP and modern warehouses like Snowflake and BigQuery.
I have deep experience in data governance, security, and compliance—particularly HIPAA and PHI/PII handling—implementing lineage, RBAC/ABAC, and data quality frameworks. I also focus on performance tuning and cloud cost optimization for production workloads.
I mentor engineers, run architecture reviews, and champion best practices to improve reliability and scalability. I deliver pragmatic, secure, and cost-effective data solutions that accelerate analytics and ML outcomes.
Experience
Work history, roles, and key accomplishments
Lead Data Engineer
Domino Data Lab
Sep 2024 - Present (1 year 5 months)
Led design and implementation of a Databricks Lakehouse platform supporting analytics and ML workloads, built Spark and Delta Lake pipelines and Kafka ingestion to enable near-real-time insights and reduce infrastructure costs. Mentored engineers and implemented enterprise data governance, lineage, and access controls to ensure security and compliance.
Senior Data Engineer
Datica
May 2021 - Aug 2024 (3 years 3 months)
Engineered and maintained HIPAA-compliant healthcare data platforms, migrated legacy ETL to Azure Databricks, and implemented Bronze/Silver/Gold medallion architecture with real-time Spark streaming and Snowflake integration for analytics. Implemented data quality frameworks and compliance controls for PHI processing.
Data Engineer
Analytics8
Feb 2017 - Apr 2021 (4 years 2 months)
Delivered end-to-end data engineering solutions for enterprise clients, built scalable PySpark ETL/ELT pipelines, designed dimensional data models and data marts, and automated orchestration with Airflow to improve development efficiency and pipeline reliability.
Education
Degrees, certifications, and relevant coursework
Unknown Institution
Bachelor of Science, Computer Science
Bachelor of Science in Computer Science; coursework and completion details not provided.
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring hazzy?
You can contact hazzy and 90k+ other talented remote workers on Himalayas.
Message hazzyFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
