HimalayasHimalayas logo
RJ
Open to opportunities

Ryan Jepsen

@ryanjepsen

Senior data engineer specializing in low-latency real-time market data pipelines and scalable lakehouse platforms.

United States
Message

What I'm looking for

I’m looking for a Senior Data Engineering role to build low-latency streaming platforms, modernize ETL into lakehouse architectures, and own data quality, governance, and observability from pipeline design through production.

I’m a Senior Data Engineer focused on building low-latency, real-time market data systems. At IMC Trading, I led the development of a Real-Time Market Data Processing Platform, architecting Spark and Scala pipelines integrated with Kafka and Kinesis, deployed on AWS, and reducing processing latency by 35%.

I also delivered an event-driven High-Frequency Data Ingestion Framework using Kafka with AWS SNS and SQS, implementing schema enforcement and fault-tolerant design to support millions of market events per second. I e-architected legacy batch ETL into Spark Structured Streaming jobs orchestrated via Airflow, persisting curated datasets into AWS S3 and Amazon Redshift for downstream analytics.

Beyond performance and reliability, I strengthen resiliency through idempotent processing, retries, and dead-letter queues, and I embed observability, data quality validation, and governance aligned with CI/CD and Git-based version control. I’ve optimized distributed workloads with Spark tuning and lakehouse-style modeling (Star Schema and Snowflake Schema) to support petabyte-scale time-series datasets.

Experience

Work history, roles, and key accomplishments

IT
Current

Senior Data Engineer

IMC Trading

May 2021 - Present (4 years 11 months)

Led development of a real-time market data processing platform, architecting low-latency Spark/Scala pipelines integrated with Kafka and Kinesis, reducing processing latency by 35%. Built high-frequency ingestion and streaming ETL workflows (Airflow + Spark Structured Streaming) and improved throughput by 28% while reducing production incidents by 50%.

CL

Data Engineer

Conversant LLC

Jun 2016 - Oct 2019 (3 years 4 months)

Engineered distributed data processing platforms handling 250B+ log records daily using Spark/Scala with Kafka, Hadoop, and YARN clusters. Migrated legacy MapReduce to Spark/Spark Streaming pipelines and improved execution performance by 60%, enabling near real-time analytics via Kafka + Flume + Elasticsearch.

Education

Degrees, certifications, and relevant coursework

University of Illinois Urbana-Champaign logoUU

University of Illinois Urbana-Champaign

Bachelor of Science, Statistics

2012 - 2016

Earned a Bachelor of Science in Statistics at the University of Illinois Urbana-Champaign from 2012 to 2016.

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan