Shoaib Mohammed
@shoaibmohammed
Lead Data Engineer and Cloud Data Architect specializing in lakehouse, streaming, and healthcare data solutions.
What I'm looking for
I am a Lead Data Engineer and Cloud Data Architect with 11+ years building cloud-native lakehouse platforms, real-time streaming ecosystems, and compliance-ready healthcare data solutions across Azure, AWS, and GCP. I design and deliver ETL/ELT pipelines, Medallion architectures, and AI/BI-ready datasets using Databricks, Snowflake, Spark, Kafka, and orchestration tools to accelerate analytics and ML adoption.
I have led data modernization efforts, embedded CI/CD and observability, and implemented governance frameworks (Collibra, Great Expectations, Unity Catalog) to achieve HIPAA and HITRUST alignment. I mentor teams, optimize cost and performance, and partner with clinical and data science stakeholders to productionize predictive models and drive measurable business and clinical impact.
Experience
Work history, roles, and key accomplishments
Lead Data Engineer
Hummingbird
Jul 2022 - Present (3 years 4 months)
Architected and delivered a cloud-native healthcare lakehouse on Azure/Databricks, engineered real-time Kafka/Flink pipelines for HL7/FHIR and device telemetry, and reduced storage/compute costs by 30% while ensuring HIPAA/HITRUST-aligned governance.
Senior Data Engineer
RTS Labs
Apr 2019 - Jun 2022 (3 years 2 months)
Designed and maintained Spark-based hybrid-cloud pipelines and lakehouse architectures (Snowflake/Delta), automated ETL with Airflow/dbt, and optimized Spark/SQL workloads to improve processing performance by 40%.
Big Data Engineer
Zencore
Mar 2016 - Mar 2019 (3 years)
Built real-time batch and streaming pipelines using Spark, Kafka, and AWS Glue, implemented a Delta Lake architecture and validation frameworks that boosted query performance by 40% and improved data quality for analytics and ML.
Data Engineer
ProCogia
May 2014 - Feb 2016 (1 year 9 months)
Developed ETL pipelines integrating EHR, lab, and billing data into centralized warehouses, designed dimensional models and connectors for HL7/FHIR, and automated validation/scheduling to reduce manual work by 60%.
Education
Degrees, certifications, and relevant coursework
The California State University
Bachelor of Science, Computer Sciences
Completed a Bachelor of Science in Computer Sciences focusing on core computing principles and practical data engineering skills.
Tech stack
Software and tools used professionally
Amazon Redshift
Matillion
Azure Synapse
Apache Spark
AWS Glue
Apache Flink
Talend
GitHub
Kubernetes
Jenkins
GitHub Actions
PySpark
dbt
DB
MySQL
PostgreSQL
MongoDB
Cassandra
Gmail
Databricks
Neo4j
Terraform
Pulumi
Azure DevOps
Java
TensorFlow
MLflow
scikit-learn
Kafka
Apache NiFi
Grafana
Prometheus
Datadog
GraphQL
Elasticsearch
Avro
Kafka Streams
Airflow
Apache Beam
Time Analytics
Google BigQuery
SQL
Apache Iceberg
Monte Carlo
Delta Lake
Great Expectations
ArgoCD
Trino
Apache Hudi
Collibra
Cosmos
Bash
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Shoaib?
You can contact Shoaib and 90k+ other talented remote workers on Himalayas.
Message ShoaibFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
