Deepak Dulal
@deepakdulal
Senior Data Engineer/Scientist driving scalable cloud-native data and ML solutions.
What I'm looking for
I am a Senior Data Engineer/Scientist with six years of experience building scalable, cloud-native data and ML solutions across AWS, Azure, and GCP. I design end-to-end ETL pipelines, real-time streaming architectures, and CI/CD frameworks to deliver production-grade analytics.
I have engineered real-time streaming systems that reduced fraud detection and monitoring latency to sub-second levels, modernized mainframe-to-cloud migrations to Snowflake, improved forecasting accuracy by up to 30%, and cut ETL runtimes by as much as 50% while optimizing query performance by up to 45%.
I partner with clinicians, operations, and business stakeholders to translate data into actionable insights, mentor teams on big-data best practices, and maintain reproducible, auditable ML lifecycles to ensure compliant, high-impact deployments.
Experience
Work history, roles, and key accomplishments
Senior Data Scientist/Engineer
Athenahealth Inc
Dec 2023 - Present (1 year 9 months)
Engineered real-time streaming architectures using Spark Streaming and Kafka, reducing fraud detection and anomaly monitoring latency to sub-second levels. Optimized Redshift and Snowflake models to increase query speeds up to 45% and automated ML retraining pipelines to cut manual intervention by 70%.
Data Engineer
American Airlines
Nov 2020 - Nov 2023 (3 years)
Built streaming pipelines with Spark Streaming and Kafka on GCP Pub/Sub and AWS MSK to enable sub-second flight delay prediction and baggage tracking. Optimized Redshift and BigQuery data models to boost query performance up to 45% and migrated MapReduce workflows to Spark, reducing processing time by 55%.
ETL Developer
Procter & Gamble
Jan 2019 - Oct 2020 (1 year 9 months)
Engineered reusable Spark ingestion wrappers and tuned jobs to cut onboarding time for new datasets by 30% and reduce ETL runtimes. Migrated enterprise datasets to Azure Data Lake and implemented CI pipelines to improve deployment reliability and SLA compliance.
Education
Degrees, certifications, and relevant coursework
Florida State University
Master of Science, Data Science
Completed a Master of Science in Data Science at Florida State University.
Stanford University
Certificate, Machine Learning
Completed the Machine Learning specialization from Stanford University.
Certificate, Data-Driven Decision Making
Completed the "Ask Questions to Make Data-Driven Decisions" course by Google.
Macquarie University
Certificate, Excel Skills for Business
Completed "Excel Skills for Business: Essentials" from Macquarie University.
Tech stack
Software and tools used professionally
Azure HDInsight
Apache Spark
AWS Glue
Apache Flink
Talend
AtScale
Google Cloud Platform
Amazon S3
AWS Step Functions
GitLab
Kubernetes
Jenkins
GitLab CI
NumPy
Pandas
PySpark
AWS Data Pipeline
dbt
PostgreSQL
Hadoop
Gmail
Google Analytics
Databricks
Terraform
AWS CloudFormation
Azure DevOps
Jira
Java
JSON
XML
Azure Machine Learning
TensorFlow
PyTorch
scikit-learn
Kafka
Grafana
Prometheus
Azure Monitor
Datadog
Azure SQL Database
Airflow
Azure Analysis Services
SQL
ServiceNow
Availability
Location
Authorized to work in
Salary expectations
Job categories
Skills
Interested in hiring Deepak?
You can contact Deepak and 90k+ other talented remote workers on Himalayas.
Message DeepakFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
