Himalayas logo
SU
Open to opportunities

Sohith A User

@sohithauser

Data Engineer skilled in scalable ETL pipeline design and optimization.

United States
Message

What I'm looking for

I am looking for a role that offers opportunities for growth, collaboration, and innovation in data engineering.

I am a Data Engineer with extensive experience in designing scalable ETL pipelines using Python, SQL, and Apache Spark. My journey in data engineering has allowed me to develop and optimize data ingestion and transformation processes for large datasets, ensuring data accuracy and performance. I excel at integrating disparate data sources and implementing automation in cloud environments, collaborating with cross-functional teams to deliver robust, analytics-ready datasets.

At Health Catalyst, I designed and implemented scalable ETL pipelines that enhanced data availability by 25%. My expertise in automating cloud workflows with Python scripts for AWS Glue and Lambda has significantly reduced manual processing time. I have also developed robust data models and schemas in Snowflake and MongoDB, improving analytical performance and supporting business requirements. My ability to integrate machine learning processes using Amazon SageMaker has further boosted data quality by 20%, showcasing my commitment to leveraging technology for better data solutions.

Previously, at Accenture, I architected and maintained end-to-end ETL pipelines, increasing pipeline efficiency by 15%. My work involved automating cloud-based data workflows and orchestrating containerized deployments with Kubernetes, ensuring scalable and reliable application performance. I am passionate about data engineering and continuously seek to refine my skills and contribute to innovative data solutions.

Experience

Work history, roles, and key accomplishments

HC
Current

Data Engineer

Health Catalyst

May 2024 - Present (1 year 3 months)

Designed and implemented scalable ETL pipelines using Python, SQL, and Apache Spark to ingest, transform, and load large datasets into Snowflake, enhancing data availability by 25%. Automated cloud workflows with Python scripts for AWS Glue and Lambda, reducing manual processing time by 30% and ensuring consistent data ingestion.

A(

Data Engineer

Accenture (Gatorade)

May 2021 - Jul 2023 (2 years 2 months)

Architected and maintained end-to-end ETL pipelines using Python, PySpark, and SQL to ingest and transform data into Snowflake, increasing pipeline efficiency by 15%. Engineered reusable Python modules for data cleansing and validation, enhancing data integrity for downstream analytics and reporting.

Education

Degrees, certifications, and relevant coursework

UH

University of Houston

Masters, Computer Science

2023 - 2025

Pursued a Master's degree in Computer Science, focusing on advanced topics and research within the field. Developed expertise in various areas of computer science.

SU

SRM University

Bachelor of Technology, Computer Science

2019 - 2023

Completed a Bachelor of Technology in Computer Science, gaining foundational knowledge and practical skills. Engaged in coursework covering core computer science principles.

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan
Sohith A User - Data Engineer - Health Catalyst | Himalayas