Loading...
Loading...
Himalayas
About usHimalayas PlusCommunityTech stackEmployee benefitsTerms and conditionsPrivacy policyContact usFor job seekers
Create your profileBrowse remote jobsDiscover remote companiesJob description keyword finderRemote work adviceCareer guidesJob application trackerAI resume builderResume examples and templatesAI cover letter generatorCover letter examplesAI headshot generatorAI interview prepInterview questions and answersAI interview answer generatorAI career coachFree resume builderResume summary generatorResume bullet points generatorResume skills section generator© 2025 Himalayas. All rights reserved. Built with Untitled UI. Logos provided by Logo.dev. Voice powered by Elevenlabs Grants
Join the remote work revolution
Join over 100,000 job seekers who get tailored alerts and access to top recruiters.
@sadafrash
I am a Lead Data Engineer specializing in cloud data platforms, MLOps, and scalable data infrastructure.
I am a Lead Data Engineer with 12+ years building scalable cloud lakehouse platforms and data infrastructure across AWS, Azure, and GCP. I led migrations to Snowflake and BigQuery, drove a platform overhaul that cut pipeline costs 25%, and built real-time ingestion pipelines reducing latency by 60%.
I design end-to-end MLOps/LLMOps systems—feature stores, CI/CD, observability, and governance—mentoring teams to deliver compliant, production-grade AI solutions. I champion platform modernization, reverse ETL, and secure data practices to enable actionable analytics and faster product delivery.
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Work history, roles, and key accomplishments
Milestone Systems
Mar 2021 - Present (4 years 8 months)
Directed full-stack lakehouse environments with Databricks and Delta Live Tables and led cloud migrations to Snowflake and BigQuery, reducing costs 25% and improving scalability. Pioneered real-time analytics with Kinesis, Spark Structured Streaming and Flink, established CI/CD via Argo Workflows, and implemented RBAC and encryption-at-rest for secure data access.
Apporto
Feb 2016 - Feb 2021 (5 years)
Built serverless ingestion and feature pipelines using PySpark, Delta Lake and Databricks, delivering real-time ingestion that reduced data latency 60% and improved reliability to 99.99%. Automated deployments with GitLab CI, Kubernetes and Terraform, created embedding repositories (Pinecone/Weaviate) with LangChain, and implemented monitoring for drift and latency.
Clover Health
Jan 2013 - Jan 2016 (3 years)
Devised distributed storage fabrics with Hadoop/HDFS and S3 to support petabyte workloads and provisioned clusters with Kubernetes and autoscaling to guarantee uptime. Implemented high-throughput messaging with Kafka and Flink, enabled schema evolution with Iceberg/Hudi, and improved query performance using Trino and Redshift Spectrum while enforcing SOC2 and HIPAA controls.
Degrees, certifications, and relevant coursework
Bachelor of Science, Computer Science
2008 - 2012
Completed a Bachelor of Science in Computer Science at COMSATS University Islamabad from September 2008 to October 2012.
Software and tools used professionally
Fivetran
Azure Synapse
Apache Spark
Superset
GitHub
GitLab
Kubernetes
GitHub Actions
GitLab CI
Jupyter
NumPy
Pandas
PySpark
dbt
Hadoop
Gmail
Databricks
Terraform
Jira
Loki
TensorFlow
PyTorch
Keras
Streamlit
Kafka
FastAPI
Grafana
Prometheus
OpenTelemetry
Datadog
OpenSearch
Serverless
Airflow
Time Analytics
SQL
LangChain
Weaviate
ChromaDB
Polars
Pinecone
DataHub
Delta Lake
Arize AI
Great Expectations
You can contact Sadaf and 90k+ other talented remote workers on Himalayas.
Message SadafSyeda Yasir
Staff Data Engineer, Monte Carlo
Rehan User
Lead Data Engineer, Avanade
Aley B.
Lead Data Engineer, Flatiron Health
Saujan Baniya
Senior Data Engineer, Pfizer
Code Dev
Staff Data Engineer, FinEdge Capital
N Jeff
Data Solution Architect, Cigna
Ryan A
Data Architect, Corteva Agriscience
ANISH BARAL
Senior Data Engineer, Cardinal Health
Aley Ban
Lead Data Engineer, Flatiron Health
Mohroze User
Lead Data Engineer, Employers