Himalayas logo
SS
Open to opportunities

Samip Subedi

@samipsubedi1

Senior Data Engineer with expertise in cloud-based data solutions.

United States
Message

What I'm looking for

I am looking for a role that fosters innovation and collaboration, with opportunities for growth in data engineering and analytics.

I am a Senior Data Engineer with over 6 years of experience in designing, implementing, and managing cloud-based data architectures and ETL pipelines. My expertise lies in refactoring legacy workflows using Python and building scalable data pipelines with technologies like Apache Spark and Databricks. I have a strong background in cloud and hybrid architectures, particularly with AWS and Azure, where I have successfully migrated and integrated complex data engineering workflows.

Throughout my career, I have developed ETL/ELT solutions using AWS Glue, Azure Data Factory, and other tools, optimizing data ingestion and transformation processes. I am adept at implementing data governance and security measures, ensuring compliance with regulations like HIPAA and GDPR. My collaborative work with data scientists has led to improved clinical outcome predictions and enhanced data analytics capabilities across various domains.

Experience

Work history, roles, and key accomplishments

JJ
Current

Senior Data Engineer

Johnson & Johnson

Jan 2023 - Present (2 years 7 months)

Designed scalable ETL pipelines using PySpark, Databricks, and Google Cloud Dataflow, processing over 5TB of healthcare data daily in a hybrid cloud environment. Automated infrastructure deployment using Terraform, CloudFormation, and GitLab CI/CD, reducing manual provisioning effort by 70%.

BO

Data Engineer

Bofa

May 2020 - Present (5 years 3 months)

Designed scalable ETL pipelines using Azure Data Factory, Matillion, and Apache Airflow, automating ingestion from RDBMS and APIs into Azure Data Lake. Built modular data processing workflows using Databricks Notebooks, applying PySpark to transform customer and policyholder data across 10+ business domains.

CS

Data Engineer

Charles Schwab

Jul 2018 - Present (7 years 1 month)

Built and optimized ETL pipelines using AWS Glue, Informatica PowerCenter, and Cleo Integration Cloud to integrate third-party logistics and supplier feeds into enterprise data lakes. Engineered Hadoop-based pipelines using Apache Pig, MapReduce, and Hive, improving product catalog ingestion speed by 30%.

Education

Degrees, certifications, and relevant coursework

Houston Christian University logoHU

Houston Christian University

Masters in Business Administration, Data Analytics

Focused on Data Analytics, gaining expertise in advanced analytical techniques and their application in business contexts. Developed skills in data-driven decision-making and strategic business intelligence.

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan
Samip Subedi - Senior Data Engineer - Johnson & Johnson | Himalayas