Sajin Shrestha
@sajinshrestha
Senior Data Engineer building scalable cloud data platforms and reliable pipelines for analytics-driven decisions.
What I'm looking for
I’m a Senior Data Engineer with 8+ years of experience designing scalable data platforms and building reliable data pipelines for large-scale analytics and business decision making.
At Johnson & Johnson, I led the architecture of a healthcare analytics lakehouse platform using Azure Databricks and Snowflake. I built ingestion and distributed transformation pipelines with Apache Spark and automated workflows using Azure Data Factory and Apache Airflow, partnering with data scientists and business teams to deliver trusted, analytics-ready datasets.
Previously at Biogen and LifePoint Health, I delivered batch and streaming ingestion on GCP (Pub/Sub, Dataflow, BigQuery) and AWS (S3, Redshift), enforced governance with IAM and compliance controls, improved data quality with validation frameworks, and supported performance tuning and monitoring. I’m passionate about modernizing legacy systems to cloud-native architectures and building robust data ecosystems that reduce operational cost while improving scalability.
Experience
Work history, roles, and key accomplishments
Led architecture of a healthcare analytics lakehouse platform using Azure Databricks and Snowflake, building ingestion and distributed transformation pipelines with Spark. Implemented orchestration, governance, monitoring, and real-time ingestion using Azure Data Factory, Airflow, Lake Formation/Snowflake security controls, and Kafka/Event Hubs.
Built batch and streaming ingestion pipelines on GCP using Pub/Sub and Dataflow, structuring data layers in Cloud Storage and BigQuery for research and operational analytics. Developed ETL/workflow orchestration with Python/SQL and Beam plus Composer (Airflow), and implemented data quality validation, access controls, and monitoring using GCP services.
Data Engineer
Lifepoint Health
Oct 2017 - Feb 2020 (2 years 4 months)
Assisted in building batch data ingestion and ETL workflows for healthcare operational, patient, and financial datasets using Python, SQL, and distributed processing with Spark/Hadoop. Supported orchestration and optimization with Airflow, Redshift, and AWS S3 while implementing governance, monitoring, and reusable transformation scripts.
Education
Degrees, certifications, and relevant coursework
University
Master’s of Science in Robotics Engineering, Robotics Engineering
Completed a Master’s of Science in Robotics Engineering at a university in Dearborn, Michigan.
Tech stack
Software and tools used professionally
Amazon Redshift
Azure Synapse
Apache Spark
AWS Glue
Apache Hive
AWS IAM
Amazon S3
Google Cloud Storage
Kubernetes
Azure Kubernetes Service
dbt
DB
Sqoop
MySQL
MongoDB
Hadoop
HBase
Gmail
Databricks
Terraform
Java
Apache Flume
Kafka
Azure Monitor
Google Cloud Pub/Sub
Airflow
Apache Beam
Apache Oozie
Time Analytics
SQL
Azure Cosmos DB
Great Expectations
Cosmos
Bash
Factory
Beam
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Sajin?
You can contact Sajin and 90k+ other talented remote workers on Himalayas.
Message SajinFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
