Ryan Jepsen
@ryanjepsen
Senior data engineer specializing in low-latency real-time market data pipelines and scalable lakehouse platforms.
What I'm looking for
I’m a Senior Data Engineer focused on building low-latency, real-time market data systems. At IMC Trading, I led the development of a Real-Time Market Data Processing Platform, architecting Spark and Scala pipelines integrated with Kafka and Kinesis, deployed on AWS, and reducing processing latency by 35%.
I also delivered an event-driven High-Frequency Data Ingestion Framework using Kafka with AWS SNS and SQS, implementing schema enforcement and fault-tolerant design to support millions of market events per second. I e-architected legacy batch ETL into Spark Structured Streaming jobs orchestrated via Airflow, persisting curated datasets into AWS S3 and Amazon Redshift for downstream analytics.
Beyond performance and reliability, I strengthen resiliency through idempotent processing, retries, and dead-letter queues, and I embed observability, data quality validation, and governance aligned with CI/CD and Git-based version control. I’ve optimized distributed workloads with Spark tuning and lakehouse-style modeling (Star Schema and Snowflake Schema) to support petabyte-scale time-series datasets.
Experience
Work history, roles, and key accomplishments
Senior Data Engineer
IMC Trading
May 2021 - Present (5 years 1 month)
Led development of a real-time market data processing platform, architecting low-latency Spark/Scala pipelines integrated with Kafka and Kinesis, reducing processing latency by 35%. Built high-frequency ingestion and streaming ETL workflows (Airflow + Spark Structured Streaming) and improved throughput by 28% while reducing production incidents by 50%.
Big Data Engineer
KPMG US
Oct 2019 - May 2021 (1 year 7 months)
Spearheaded enterprise data modernization by migrating legacy SQL Server workflows into distributed Spark/PySpark pipelines on Azure Databricks using Delta Lake storage. Built regulatory reporting ELT pipelines with dbt and improved processing time by 45% through Spark performance tuning and orchestration.
Data Engineer
Conversant LLC
Jun 2016 - Oct 2019 (3 years 4 months)
Engineered distributed data processing platforms handling 250B+ log records daily using Spark/Scala with Kafka, Hadoop, and YARN clusters. Migrated legacy MapReduce to Spark/Spark Streaming pipelines and improved execution performance by 60%, enabling near real-time analytics via Kafka + Flume + Elasticsearch.
Education
Degrees, certifications, and relevant coursework
University of Illinois Urbana-Champaign
Bachelor of Science, Statistics
2012 - 2016
Earned a Bachelor of Science in Statistics at the University of Illinois Urbana-Champaign from 2012 to 2016.
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Portfolio
github.com/ryancjepsenJob categories
Skills
Interested in hiring Ryan?
You can contact Ryan and 90k+ other talented remote workers on Himalayas.
Message RyanFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
