Ryan Jepsen
@ryanjepsen
Senior data engineer specializing in low-latency real-time market data pipelines and scalable lakehouse platforms.
What I'm looking for
I’m a Senior Data Engineer focused on building low-latency, real-time market data systems. At IMC Trading, I led the development of a Real-Time Market Data Processing Platform, architecting Spark and Scala pipelines integrated with Kafka and Kinesis, deployed on AWS, and reducing processing latency by 35%.
I also delivered an event-driven High-Frequency Data Ingestion Framework using Kafka with AWS SNS and SQS, implementing schema enforcement and fault-tolerant design to support millions of market events per second. I e-architected legacy batch ETL into Spark Structured Streaming jobs orchestrated via Airflow, persisting curated datasets into AWS S3 and Amazon Redshift for downstream analytics.
Beyond performance and reliability, I strengthen resiliency through idempotent processing, retries, and dead-letter queues, and I embed observability, data quality validation, and governance aligned with CI/CD and Git-based version control. I’ve optimized distributed workloads with Spark tuning and lakehouse-style modeling (Star Schema and Snowflake Schema) to support petabyte-scale time-series datasets.
Experience
Work history, roles, and key accomplishments
Senior Data Engineer
IMC Trading
May 2021 - Present (4 years 11 months)
Led development of a real-time market data processing platform, architecting low-latency Spark/Scala pipelines integrated with Kafka and Kinesis, reducing processing latency by 35%. Built high-frequency ingestion and streaming ETL workflows (Airflow + Spark Structured Streaming) and improved throughput by 28% while reducing production incidents by 50%.
Big Data Engineer
KPMG US
Oct 2019 - May 2021 (1 year 7 months)
Spearheaded enterprise data modernization by migrating legacy SQL Server workflows into distributed Spark/PySpark pipelines on Azure Databricks using Delta Lake storage. Built regulatory reporting ELT pipelines with dbt and improved processing time by 45% through Spark performance tuning and orchestration.
Data Engineer
Conversant LLC
Jun 2016 - Oct 2019 (3 years 4 months)
Engineered distributed data processing platforms handling 250B+ log records daily using Spark/Scala with Kafka, Hadoop, and YARN clusters. Migrated legacy MapReduce to Spark/Spark Streaming pipelines and improved execution performance by 60%, enabling near real-time analytics via Kafka + Flume + Elasticsearch.
Education
Degrees, certifications, and relevant coursework
University of Illinois Urbana-Champaign
Bachelor of Science, Statistics
2012 - 2016
Earned a Bachelor of Science in Statistics at the University of Illinois Urbana-Champaign from 2012 to 2016.
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Portfolio
github.com/ryancjepsenJob categories
Skills
Interested in hiring Ryan?
You can contact Ryan and 90k+ other talented remote workers on Himalayas.
Message RyanFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
