Himalayas logo
SB
Open to opportunities

Saujan Baniya

@saujanbaniya

I am a Senior Data Engineer building scalable cloud-native data platforms.

United States
Message

What I'm looking for

I’m seeking a senior role building scalable, secure cloud data platforms and real-time pipelines in collaborative, compliance-focused teams where I can lead architecture, mentor engineers, and deliver production-grade analytics.

I am a results-driven Senior Data Engineer with over 7 years designing and modernizing cloud-native data platforms across finance, healthcare, and telecom.

I have built multi-terabyte data warehouses and orchestrated PySpark ETL in Databricks, implemented Medallion Architecture with Delta Lake, and integrated dbt to standardize transformations and testing. I designed real-time processing with Kafka, Spark, and Flink to reduce data latency and enable operational insights.

I am proficient across AWS, Azure, and GCP and automate infrastructure using Terraform, CloudFormation, and CI/CD tools like GitHub Actions and Azure DevOps. I enforce data quality and governance with Great Expectations and Azure Purview while ensuring compliance with HIPAA, GDPR, and SOX.

I consistently deliver production-ready, secure solutions—authoring documentation, mentoring junior engineers, and building dashboards and APIs that support predictive analytics, regulatory reporting, and enterprise decision-making.

Experience

Work history, roles, and key accomplishments

Pfizer logoPF
Current

Senior Data Engineer

Aug 2022 - Present (3 years 2 months)

Designed and deployed a multi-terabyte data warehouse on AWS Redshift and built PySpark ETL workflows in Databricks across AWS S3 and GCP Storage to enable scalable transformations. Implemented Medallion Architecture with Delta Lake and dbt, built real-time Kafka/Spark/Flink pipelines to reduce data latency, automated IaC with Terraform, and enforced HIPAA/GDPR controls.

LH

Data Engineer

LifePoint Health

Aug 2020 - Jul 2022 (1 year 11 months)

Built ETL pipelines with Azure Data Factory and Delta Lake in Azure Databricks to support CDC and modeled ML-ready datasets in Azure Synapse for analytics and reporting. Deployed dbt models and Great Expectations validations, automated infrastructure with Terraform and CI/CD, and delivered Power BI dashboards while enforcing HIPAA/GDPR controls.

Verizon logoVE

Data Engineer

Verizon

Jan 2018 - Jun 2020 (2 years 5 months)

Developed Hadoop and Spark pipelines processing multi-terabyte clickstream and log data, migrating batch workflows to Spark to achieve 5x performance gains and lower compute costs. Built Kafka and NiFi streaming ingestion, implemented Delta Lake and Parquet data lakes in S3, automated infrastructure with Terraform and CI/CD, and implemented data quality checks across pipelines.

Education

Degrees, certifications, and relevant coursework

The University of Findlay logoTF

The University of Findlay

Master of Business Administration, Business Analytics

Master's in Business Analytics (MBA) from The University of Findlay, focusing on business analytics and data-driven decision-making.

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan
Saujan Baniya - Senior Data Engineer - Pfizer | Himalayas