Skip to main content
HimalayasHimalayas logo
AS
Open to opportunities

Aravind Swamy

@aravindswamy

Data Engineer focused on reliable ETL/ELT, dimensional modeling, and secure, analytics-ready pipelines.

United States
Message

What I'm looking for

I’m looking for a role where I can own end-to-end ETL/ELT pipelines and dimensional models, partner with stakeholders on analytics outcomes, and build secure, test-driven data systems with strong data quality and automation.

I’m a Data Engineer with 2 years of experience building end-to-end ETL/ELT pipelines and dimensional data models using PySpark, AWS, Azure, Snowflake, and dbt. I bring a strong analytics engineering foundation from an MS in Data Analytics Engineering and a Cloud Big Data Masters Certification.

In my Data Engineer Intern role at DeepThink Health, I architected daily ingestion pipelines from S3 to Snowflake for claims, eligibility, and clinical data, orchestrating COPY INTO commands with Airflow to achieve a 99% load success rate. I built dbt Cloud staging-to-gold models, implemented 30+ dbt tests, and added column-level masking and row-level security in Snowflake—reducing data quality incidents by 90% while maintaining HIPAA-compliant PHI protection.

I also strengthened end-to-end delivery by orchestrating transformation runs through dbt Cloud’s native job scheduler and ingestion through Airflow, maintaining 99% on-time delivery within agreed service windows. By integrating Snowflake with a patient care management UI and Power BI, I helped drive a 20% reduction in hospital readmission rates, and I built DAX/Power Query dashboards that reduced Snowflake compute costs by 30%.

Earlier at Cognizant Technology Solutions, I built Azure Data Factory pipelines to extract data from on-prem Oracle systems into ADLS Gen2 for a large cloud migration, and I developed PySpark notebooks in Databricks to cleanse and transform 20+ TB of raw data, cutting processing time by 60%. I delivered CI/CD practices with Azure DevOps across development/test/production and supported fact/dimension migrations into Azure Synapse with 99% data integrity, while Power BI reporting contributed to a 15% revenue increase from high-margin travel packages.

Experience

Work history, roles, and key accomplishments

DH

Data Engineer Intern

DeepThink Health

Jan 2025 - Jun 2025 (5 months)

Architected S3-to-Snowflake daily ingestion pipelines for claims and clinical data, achieving a 99% load success rate, and built dbt Cloud staging-to-gold models into fact/dimension structures. Implemented 30+ dbt tests with Snowflake masking and row-level security, reducing data quality incidents by 90% while integrating Snowflake with a care UI and Power BI to support a 20% readmission reduction

CS

Data Engineer Trainee

Cognizant Technology Solutions

Aug 2021 - Jul 2022 (11 months)

Built Azure Data Factory pipelines to extract data from on-prem Oracle into ADLS Gen2 for a large cloud migration, and developed PySpark (Databricks) notebooks to cleanse and transform 20+ TB of data, cutting processing time by 60%. Orchestrated ADF ETL workflows for 6 business-critical tables into Azure Synapse fact/dimension layers with 99% integrity, implemented Azure DevOps CI/CD across enviro

Education

Degrees, certifications, and relevant coursework

Northeastern University logoNU

Northeastern University

Master of Science (MS) in Data Analytics Engineering, Data Analytics Engineering

Grade: GPA 3.9/4.0

Master of Science in Data Analytics Engineering at Northeastern University, completed in December 2025.

Amrita University logoAU

Amrita University

Bachelor of Technology in Electronics & Communication Engineering, Electronics & Communication Engineering

Grade: GPA 3.0/4.0

Bachelor of Technology in Electronics & Communication Engineering at Amrita University, completed in July 2021.

Find your dream job

Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan