Alex Johnson
@alexjohnson2
I build reliable, scalable AWS ETL data pipelines for healthcare and analytics teams.
What I'm looking for
I’m a Senior ETL Developer with roughly 10 years of experience delivering scalable, high-quality data pipelines, grounded in data quality and test automation. I build data platforms that are efficient and “reliable by design,” translating complex business needs into impactful outcomes.
In my current role at Kaiser Permanente, I led enterprise data pipelines supporting healthcare analytics and reporting. I built multi-source ingestion frameworks with Apache Airflow, resolved data latency by redesigning scheduling and dependency-based orchestration, and optimized Amazon Redshift models for clinical and operational BI reporting.
Previously, I developed Python-based ETL pipelines that processed large-scale healthcare data into AWS S3 and Redshift for reporting and analytics. I handled schema drift and upstream inconsistencies using dynamic schema validation and transformation logic, and I implemented in-pipeline validation to maintain accuracy for clinical and operational datasets.
Earlier at Carvana, I built scalable Apache Spark pipelines for streaming and batch analytics and managed Snowflake warehouse support for financial and operational reporting. I also developed automated data quality frameworks, automated S3-based ingestion/reporting workflows, and improved performance by optimizing Spark partitioning and join strategies.
Experience
Work history, roles, and key accomplishments
Led development of enterprise data pipelines supporting healthcare analytics and reporting systems, building multi-source ingestion frameworks with Apache Airflow. Resolved data latency issues by redesigning scheduling and dependency-based orchestration and optimized Amazon Redshift models for BI reporting.
Developed Python-based ETL pipelines processing large-scale healthcare data into AWS S3 and Redshift to support reporting and analytics. Implemented dynamic schema validation and in-pipeline data validation to handle schema drift and upstream inconsistencies.
Built scalable Apache Spark pipelines for streaming and batch data to power real-time inventory and sales analytics. Managed Snowflake for reporting workloads and implemented automated data quality monitoring while optimizing Spark partitioning and join strategies.
Delivered data migration solutions for a telecom BSS transformation project, including mapping and transformation logic. Created SQL-based validation processes to ensure data integrity during migration.
Built a Selenium + Python automation framework for enterprise web applications to improve regression coverage. Integrated automated tests into Jenkins pipelines to support continuous testing in CI/CD environments.
Education
Degrees, certifications, and relevant coursework
Texas Tech University
Bachelor of Science, Computer Science
2012 - 2016
Earned a Bachelor of Science in Computer Science at Texas Tech University from 2012 to 2016.
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Job categories
Interested in hiring Alex?
You can contact Alex and 90k+ other talented remote workers on Himalayas.
Message AlexFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
