Sadaf Rash
@sadafrash
I am a Lead Data Engineer specializing in cloud data platforms, MLOps, and scalable data infrastructure.
What I'm looking for
I am a Lead Data Engineer with 12+ years building scalable cloud lakehouse platforms and data infrastructure across AWS, Azure, and GCP. I led migrations to Snowflake and BigQuery, drove a platform overhaul that cut pipeline costs 25%, and built real-time ingestion pipelines reducing latency by 60%.
I design end-to-end MLOps/LLMOps systems—feature stores, CI/CD, observability, and governance—mentoring teams to deliver compliant, production-grade AI solutions. I champion platform modernization, reverse ETL, and secure data practices to enable actionable analytics and faster product delivery.
Experience
Work history, roles, and key accomplishments
Lead Data Engineer
Milestone Systems
Mar 2021 - Present (4 years 5 months)
Directed full-stack lakehouse environments with Databricks and Delta Live Tables and led cloud migrations to Snowflake and BigQuery, reducing costs 25% and improving scalability. Pioneered real-time analytics with Kinesis, Spark Structured Streaming and Flink, established CI/CD via Argo Workflows, and implemented RBAC and encryption-at-rest for secure data access.
Cloud Data Engineer
Apporto
Feb 2016 - Feb 2021 (5 years)
Built serverless ingestion and feature pipelines using PySpark, Delta Lake and Databricks, delivering real-time ingestion that reduced data latency 60% and improved reliability to 99.99%. Automated deployments with GitLab CI, Kubernetes and Terraform, created embedding repositories (Pinecone/Weaviate) with LangChain, and implemented monitoring for drift and latency.
Data Infrastructure Engineer
Clover Health
Jan 2013 - Jan 2016 (3 years)
Devised distributed storage fabrics with Hadoop/HDFS and S3 to support petabyte workloads and provisioned clusters with Kubernetes and autoscaling to guarantee uptime. Implemented high-throughput messaging with Kafka and Flink, enabled schema evolution with Iceberg/Hudi, and improved query performance using Trino and Redshift Spectrum while enforcing SOC2 and HIPAA controls.
Education
Degrees, certifications, and relevant coursework
COMSATS University Islamabad
Bachelor of Science, Computer Science
2008 - 2012
Completed a Bachelor of Science in Computer Science at COMSATS University Islamabad from September 2008 to October 2012.
Tech stack
Software and tools used professionally
Fivetran
Azure Synapse
Apache Spark
Superset
GitHub
GitLab
Kubernetes
GitHub Actions
GitLab CI
Jupyter
NumPy
Pandas
PySpark
dbt
Hadoop
Gmail
Databricks
Terraform
Jira
Loki
TensorFlow
PyTorch
Keras
Streamlit
Kafka
FastAPI
Grafana
Prometheus
OpenTelemetry
Datadog
OpenSearch
Serverless
Airflow
Time Analytics
SQL
LangChain
Weaviate
ChromaDB
Polars
Pinecone
DataHub
Delta Lake
Arize AI
Great Expectations
Availability
Location
Authorized to work in
Portfolio
github.com/sadaf-rashJob categories
Skills
Interested in hiring Sadaf?
You can contact Sadaf and 90k+ other talented remote workers on Himalayas.
Message SadafFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
