Janice Jafri
@janicejafri
Lead Data Engineer specializing in cloud-native platforms, streaming, and AI/ML pipelines.
What I'm looking for
I am a Lead Data Engineer with 10 years building cloud-native platforms across AWS, Azure, and GCP, focused on reliable, compliant data solutions that enable AI/ML at scale.
I design and implement lakehouse and Medallion architectures, batch and real-time pipelines with Databricks, Snowflake, Spark, Kafka, dbt and Airflow, delivering multi-million-dollar cost savings and performance improvements.
I prioritize governance, observability, and secure data practices (HIPAA/GDPR/CCPA), mentor engineering teams, and drive CI/CD and MLOps to accelerate delivery and operational reliability.
Experience
Work history, roles, and key accomplishments
Lead Data Engineer
Analytics8
Jan 2023 - Present (3 years 1 month)
Architected a multi-cloud lakehouse and Medallion pipelines (Databricks, Delta Lake, Snowflake), reducing compute costs by 30% and enabling AI/ML workflows; mentored 8 engineers and improved release velocity 50%.
Sr. Cloud Data Engineer
Vectorsoft
Jan 2020 - Dec 2022 (2 years 11 months)
Designed Snowflake and Azure Synapse pipelines processing 6TB+ healthcare data daily, cut query runtimes 40% and reduced SLA breaches 30% through scalable ELT workflows and governance.
Developed Spark and Kafka streaming pipelines handling 5TB+ daily, improved performance 45% and reduced infrastructure cost 25% by scaling AWS EMR Hadoop/Hive clusters.
Built ETL workflows in Informatica and SQL Server loading 5TB+ supply chain data, automated 50+ ingestion pipelines reducing manual effort 40% and delivered 20+ Power BI dashboards.
Education
Degrees, certifications, and relevant coursework
COMSATS University
Bachelor of Science, Computer Science
Bachelor of Science in Computer Science from COMSATS University.
Tech stack
Software and tools used professionally
Fivetran
Matillion
Azure Synapse
Apache Spark
AWS Glue
Talend
GitHub
Kubernetes
Jenkins
GitHub Actions
NumPy
Pandas
PySpark
dbt
Hadoop
Gmail
Databricks
Terraform
Azure DevOps
Java
MLflow
Kubeflow
Kafka
Grafana
Prometheus
Milvus
Avro
Ansible
Vercel
Redpanda
Airflow
SQL
Dagster
LangChain
LlamaIndex
Weaviate
Pinecone
Atlan
WhyLabs
Monte Carlo
Tecton
Feast
Cube.js
DataHub
Delta Lake
OpenMetadata
Apache Hudi
Collibra
Bicep
Bash
Faiss
Microsoft Fabric
Factory
Availability
Location
Authorized to work in
Website
project02-lemon.vercel.appJob categories
Skills
Interested in hiring Janice?
You can contact Janice and 90k+ other talented remote workers on Himalayas.
Message JaniceFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
