Open to opportunities

hazzy M

@hazzym1

Message

Lead Data Engineer specializing in Databricks Lakehouse, Spark, and streaming platforms.

United States

Message

What I'm looking for

I seek a leadership data-engineering role building secure, scalable Lakehouse platforms, mentoring teams, and enabling ML/analytics while optimizing costs.

I am a Lead Data Engineer with 9+ years designing and delivering scalable data platforms for analytics, real-time processing, and machine learning. I specialize in Databricks Lakehouse architecture, Apache Spark, Delta Lake, and Kafka-based streaming.

I've led end-to-end data engineering efforts, implemented Bronze/Silver/Gold medallion architectures, and enabled ML-ready datasets and feature engineering for data science teams. My hands-on cloud experience spans AWS, Azure, and GCP and modern warehouses like Snowflake and BigQuery.

I have deep experience in data governance, security, and compliance—particularly HIPAA and PHI/PII handling—implementing lineage, RBAC/ABAC, and data quality frameworks. I also focus on performance tuning and cloud cost optimization for production workloads.

I mentor engineers, run architecture reviews, and champion best practices to improve reliability and scalability. I deliver pragmatic, secure, and cost-effective data solutions that accelerate analytics and ML outcomes.

Experience

Work history, roles, and key accomplishments

Current

Lead Data Engineer

Current

Domino Data Lab

Sep 2024 - Present (1 year 10 months)

Led design and implementation of a Databricks Lakehouse platform supporting analytics and ML workloads, built Spark and Delta Lake pipelines and Kafka ingestion to enable near-real-time insights and reduce infrastructure costs. Mentored engineers and implemented enterprise data governance, lineage, and access controls to ensure security and compliance.

Databricks Apache Spark Delta Lake Kafka Python AWS Data Governance

Senior Data Engineer

Datica

May 2021 - Aug 2024 (3 years 3 months)

Engineered and maintained HIPAA-compliant healthcare data platforms, migrated legacy ETL to Azure Databricks, and implemented Bronze/Silver/Gold medallion architecture with real-time Spark streaming and Snowflake integration for analytics. Implemented data quality frameworks and compliance controls for PHI processing.

Databricks Spark Delta Lake Snowflake Azure Data Factory Python SQL Data Quality HIPAA

Data Engineer

Analytics8

Feb 2017 - Apr 2021 (4 years 2 months)

Delivered end-to-end data engineering solutions for enterprise clients, built scalable PySpark ETL/ELT pipelines, designed dimensional data models and data marts, and automated orchestration with Airflow to improve development efficiency and pipeline reliability.