Azran Mehrosh
@azranmehrosh
Senior Data Engineer building scalable lakehouse platforms, streaming pipelines, and cost-efficient analytics.
What I'm looking for
I’m a Senior Data Engineer with 10 years of experience designing scalable data platforms, streaming pipelines, and cloud-based analytics solutions across healthcare, SaaS, and enterprise environments. I build lakehouse architectures with Databricks, Delta Lake, and Spark Structured Streaming, focusing on performance optimization, reliability, and cost efficiency.
Across my roles, I’ve led ingestion and transformation at massive scale, implemented CI/CD automation with Terraform and GitHub Actions, and delivered real-time processing using Kafka and debuggable orchestration. I also support AI-driven data workflows—vector search, RAG pipelines, semantic retrieval systems, and LLM-ready architectures—while strengthening governance, data quality, and observability through frameworks like Great Expectations and Unity Catalog.
Experience
Work history, roles, and key accomplishments
Senior Data Engineer
Oznolo
Apr 2022 - Present (4 years 2 months)
Led enterprise data platform initiatives supporting analytics and AI workloads, building a Databricks lakehouse with Delta Lake and Structured Streaming. Scaled ingestion to handle 850M+ daily records with near real-time processing and improved compute cost efficiency through Spark/Delta performance optimizations.
Senior Data Engineer
HealthBridge Analytics
Jan 2019 - Mar 2022 (3 years 2 months)
Built HIPAA-compliant healthcare data pipelines using Spark, Hive, Airflow, and AWS to support patient analytics and operational reporting. Migrated reporting from on-prem to AWS Redshift and S3 and improved reliability with reconciliation, observability, and SLA-based alerting.
Data Engineer
Nexora Data Systems
Aug 2016 - Dec 2018 (2 years 4 months)
Developed scalable ETL and streaming pipelines using Spark, Kafka, Python, and AWS, including metadata-driven ingestion for analytics and machine learning workflows. Implemented Kafka + Debezium CDC, replaced legacy scheduling with Airflow, and optimized Spark jobs and storage to improve pipeline and query performance.
Education
Degrees, certifications, and relevant coursework
MacMurray College
Bachelor of Science, Computer Science
Earned a Bachelor of Science in Computer Science from MacMurray College in 2016.
Tech stack
Software and tools used professionally
Apache Spark
Apache Flink
Apache Hive
GitHub
Kubernetes
Jenkins
GitHub Actions
PySpark
Debezium
dbt
PostgreSQL
Gmail
Databricks
Terraform
MLflow
Kafka
OpenSearch
Protobuf
Airflow
Toolkit
Apache Ranger
SQL
Clickhouse
Dagster
Apache Iceberg
LangChain
Datafold
Pinecone
Monte Carlo
Delta Lake
OpenAI API
Great Expectations
OpenMetadata
Bash
pgvector
Unity Catalog
Availability
Location
Authorized to work in
Social media
Job categories
Skills
Interested in hiring Azran?
You can contact Azran and 90k+ other talented remote workers on Himalayas.
Message AzranFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
