HimalayasHimalayas logo
BB
Open to opportunities

Bishal Bhattarai

@bishalbhattarai1

Senior Data Engineer building secure, real-time lakehouse pipelines for healthcare and finance analytics and AI.

United States
Message

What I'm looking for

I’m looking for a team where I can build secure, real-time lakehouse and streaming pipelines, strengthen data governance for regulated data, and ship production-grade analytics with strong CI/CD and observability.

I’m a Senior Data Engineer with 7+ years of professional experience designing and building scalable, secure, real-time data pipelines across healthcare and finance. I focus on delivering trusted analytics solutions in regulated environments, including early-stage Generative AI use cases.

I built a metadata-driven, self-service ingestion framework using AWS Glue, Spark, and S3 that enabled rapid onboarding of 10+ structured and semi-structured source systems into the lakehouse—reducing integration time by 60%. I engineered AWS-based pipelines with S3, Lambda, Step Functions, CloudWatch, and Glue Data Catalog for metadata management and auditability, and I optimize Spark jobs with Spark DataFrame and Spark SQL to cut execution time by 40% through partitioning, caching, and parallelization.

On healthcare platforms, I designed HIPAA-compliant patient monitoring analytics using AWS services including S3, SNS, SQS, Lambda, and Kinesis. I used Spark Structured Streaming with PySpark, Kafka/Kinesis, checkpointing, watermarking, and exactly-once processing patterns, and I enforced governance and security with Lake Formation RBAC, IAM/KMS, VPC Endpoints, and CloudTrail/CloudWatch with PHI masking and least-privilege access.

Across roles, I’ve delivered production-grade warehouse and modeling work with Snowflake, Delta Lake, and Medallion architecture (including star schema/snowflake-style modeling and SCD Type 2). I also automate deployments with CI/CD (Jenkins, GitHub Actions, Azure DevOps), provision infrastructure with Terraform modules, and lead POCs to enable LLM-integrated data platforms for retrieval-augmented generation (RAG) and conversational analytics.

Experience

Work history, roles, and key accomplishments

Elevance Health logoEH
Current

Senior Data Engineer

Aug 2023 - Present (2 years 9 months)

Designed and built a HIPAA-compliant patient monitoring analytics platform on AWS, creating a metadata-driven ingestion framework that onboarded 10+ source systems and reduced integration time by 60%. Engineered optimized PySpark/EMR/Glue ETL and Structured Streaming pipelines, improving processing performance by 35% and retiring Informatica jobs to cut runtime by 35%.

BB

Data Engineer

Bremer Bank

Jun 2021 - Jul 2023 (2 years 1 month)

Built a fraud analytics platform and data pipeline frameworks for automated batch and real-time streaming ingestion and delivery. Developed serverless ETL with AWS Glue and Spark, configured Redshift for analytics workloads, and implemented event-driven pipelines using Kafka/Kinesis.

J.B. Hunt logoJH

Data Engineer / ETL Developer

J.B. Hunt

Mar 2018 - May 2021 (3 years 2 months)

Created data lake ingestion and preparation pipelines using Azure Databricks and Spark-SQL for extraction, transformation, and aggregation from multiple file formats. Implemented Medallion architecture, delta tables with scheduled jobs, and Kafka-to-HDFS near real-time processing while migrating SQL databases to Azure data services.

Education

Degrees, certifications, and relevant coursework

Webster University logoWU

Webster University

Master's degree in Information Technology Management, Information Technology Management

Completed a Master's program in Information Technology Management at Webster University.

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan