Sohith A User
@sohithauser
Data Engineer skilled in scalable ETL pipeline design and optimization.
What I'm looking for
I am a Data Engineer with extensive experience in designing scalable ETL pipelines using Python, SQL, and Apache Spark. My journey in data engineering has allowed me to develop and optimize data ingestion and transformation processes for large datasets, ensuring data accuracy and performance. I excel at integrating disparate data sources and implementing automation in cloud environments, collaborating with cross-functional teams to deliver robust, analytics-ready datasets.
At Health Catalyst, I designed and implemented scalable ETL pipelines that enhanced data availability by 25%. My expertise in automating cloud workflows with Python scripts for AWS Glue and Lambda has significantly reduced manual processing time. I have also developed robust data models and schemas in Snowflake and MongoDB, improving analytical performance and supporting business requirements. My ability to integrate machine learning processes using Amazon SageMaker has further boosted data quality by 20%, showcasing my commitment to leveraging technology for better data solutions.
Previously, at Accenture, I architected and maintained end-to-end ETL pipelines, increasing pipeline efficiency by 15%. My work involved automating cloud-based data workflows and orchestrating containerized deployments with Kubernetes, ensuring scalable and reliable application performance. I am passionate about data engineering and continuously seek to refine my skills and contribute to innovative data solutions.
Experience
Work history, roles, and key accomplishments
Data Engineer
Health Catalyst
May 2024 - Present (1 year 3 months)
Designed and implemented scalable ETL pipelines using Python, SQL, and Apache Spark to ingest, transform, and load large datasets into Snowflake, enhancing data availability by 25%. Automated cloud workflows with Python scripts for AWS Glue and Lambda, reducing manual processing time by 30% and ensuring consistent data ingestion.
Data Engineer
Accenture (Gatorade)
May 2021 - Jul 2023 (2 years 2 months)
Architected and maintained end-to-end ETL pipelines using Python, PySpark, and SQL to ingest and transform data into Snowflake, increasing pipeline efficiency by 15%. Engineered reusable Python modules for data cleansing and validation, enhancing data integrity for downstream analytics and reporting.
Education
Degrees, certifications, and relevant coursework
University of Houston
Masters, Computer Science
2023 - 2025
Pursued a Master's degree in Computer Science, focusing on advanced topics and research within the field. Developed expertise in various areas of computer science.
SRM University
Bachelor of Technology, Computer Science
2019 - 2023
Completed a Bachelor of Technology in Computer Science, gaining foundational knowledge and practical skills. Engaged in coursework covering core computer science principles.
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Sohith A?
You can contact Sohith A and 90k+ other talented remote workers on Himalayas.
Message Sohith AFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
