Skip to main content
RF
Open to opportunities

Ryan Farmacka

@ryanfarmacka

Senior machine learning engineer building reliable Spark pipelines and CI/CD.

United States
Message

What I'm looking for

I’m looking for a team where I can build production-grade ML/Spark pipelines with strong CI/CD, testing, and observability—reducing manual ops while improving data quality for downstream decisions.

I build production-focused ML workflows end to end—training/inference and eligibility/enrollment style data flows—centered on reliable Spark-based pipelines, Delta-style storage patterns, and cloud deployments. I’ve hardened pipelines with CI/CD, automated testing, schema evolution safeguards, idempotent backfills, and deterministic partitioning to reduce production breakages and improve downstream data quality.

I’m especially proud of making systems observable and dependable: I add structured logging, metrics, drift signals, actionable alerting, and runbooks so teams can diagnose issues fast. From regulated, PII-aware handling and least-privilege access patterns to promotion gates, rollback-friendly job versioning, and quality checks for join keys and records, I focus on secure, measurable outcomes that cut manual operations.

Experience

Work history, roles, and key accomplishments

SC
Current

Senior Machine Learning Engineer

ScienceSoft

Oct 2022 - Present (3 years 8 months)

Designed an end-to-end ML workflow template for consulting engagements, adding dataset validation gates, CI checks, and repeatable training jobs. Improved production reliability and data quality by implementing idempotent backfills and deterministic partitioning, and strengthened observability with structured logs, metrics, and actionable alerts.

CC

Machine Learning Engineer

Connect for Health CO

Sep 2017 - Oct 2022 (5 years 1 month)

Built and maintained PySpark data pipelines for a healthcare marketplace enrollment platform, transforming eligibility, plan, and subsidy data into consistent Delta-style tables. Reduced enrollment verification failures by handling late-arriving updates and retries, adding reconciliation/validation checks, and implementing PII-aware, least-privilege patterns with improved observability.

MS

Junior Data Engineer / Developer

MasTec Network Solutions

Aug 2015 - Aug 2017 (2 years)

Supported telecom network operations by building Python/SQL ETL jobs to normalize event logs for reporting and operational dashboards. Improved reliability and runtime performance by adding retry/backoff and idempotent loads, optimizing Spark partitioning and join strategies, and improving operational support with runbooks and better logging.

Education

Degrees, certifications, and relevant coursework

Texas Tech University logoTU

Texas Tech University

Bachelor's in Computer Science, Computer Science

Earned a Bachelor's degree in Computer Science from Texas Tech University in May 2015.

Availability

Open to opportunities

Location

United States

Authorized to work in

Interested in hiring Ryan?

You can contact Ryan and 90k+ other talented remote workers on Himalayas.

Message Ryan

People also viewed

View all talent

Find your dream job

Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan