Open to opportunities

Alex Johnson

@alexjohnson2

Message

I build reliable, scalable AWS ETL data pipelines for healthcare and analytics teams.

United States

Message

What I'm looking for

I’m looking for a senior ETL/data engineering role where I can design reliable pipelines, strengthen data quality, optimize performance on AWS, and mentor teams while translating complex business requirements into scalable analytics.

I’m a Senior ETL Developer with roughly 10 years of experience delivering scalable, high-quality data pipelines, grounded in data quality and test automation. I build data platforms that are efficient and “reliable by design,” translating complex business needs into impactful outcomes.

In my current role at Kaiser Permanente, I led enterprise data pipelines supporting healthcare analytics and reporting. I built multi-source ingestion frameworks with Apache Airflow, resolved data latency by redesigning scheduling and dependency-based orchestration, and optimized Amazon Redshift models for clinical and operational BI reporting.

Previously, I developed Python-based ETL pipelines that processed large-scale healthcare data into AWS S3 and Redshift for reporting and analytics. I handled schema drift and upstream inconsistencies using dynamic schema validation and transformation logic, and I implemented in-pipeline validation to maintain accuracy for clinical and operational datasets.

Earlier at Carvana, I built scalable Apache Spark pipelines for streaming and batch analytics and managed Snowflake warehouse support for financial and operational reporting. I also developed automated data quality frameworks, automated S3-based ingestion/reporting workflows, and improved performance by optimizing Spark partitioning and join strategies.

Experience

Work history, roles, and key accomplishments

Current

Senior ETL Developer

Current

Kaiser Permanente

Jan 2024 - Present (2 years 7 months)

Led development of enterprise data pipelines supporting healthcare analytics and reporting systems, building multi-source ingestion frameworks with Apache Airflow. Resolved data latency issues by redesigning scheduling and dependency-based orchestration and optimized Amazon Redshift models for BI reporting.

Airflow Performance Optimization Amazon Redshift Python Data Orchestration Scheduling Data Governance SQL

ETL Developer

Kaiser Permanente

May 2022 - Dec 2023 (1 year 7 months)

Developed Python-based ETL pipelines processing large-scale healthcare data into AWS S3 and Redshift to support reporting and analytics. Implemented dynamic schema validation and in-pipeline data validation to handle schema drift and upstream inconsistencies.

S3 Python Amazon Redshift ETL Data Validation data transformation SQL Data Quality

ETL Engineer

Carvana

Nov 2019 - Apr 2022 (2 years 5 months)

Built scalable Apache Spark pipelines for streaming and batch data to power real-time inventory and sales analytics. Managed Snowflake for reporting workloads and implemented automated data quality monitoring while optimizing Spark partitioning and join strategies.

Apache Spark Streaming Pipelines Batch Processing Snowflake Data Quality Query Optimization Partitioning

Programming Analyst

Cognizant

Jul 2018 - Oct 2019 (1 year 3 months)

Delivered data migration solutions for a telecom BSS transformation project, including mapping and transformation logic. Created SQL-based validation processes to ensure data integrity during migration.

SQL Data Migration Data Mapping Data Integrity Data Quality Telecommunications

Test Automation Engineer

Cognizant

Jul 2016 - Jun 2018 (1 year 11 months)

Built a Selenium + Python automation framework for enterprise web applications to improve regression coverage. Integrated automated tests into Jenkins pipelines to support continuous testing in CI/CD environments.

Selenium Python Testing Automation Regression Testing Jenkins CI CD Integration Testing enterprise applications