Jordan Tran
@jordantran
Senior Data Engineer building petabyte-scale data platforms and ML pipelines.
What I'm looking for
I’m a Senior Data Engineer with 11+ years of experience building large-scale data platforms and ML data pipelines at Amazon, Databricks, and Spotify. I specialize in event ingestion, lakehouse architecture, distributed data processing, and production-grade data systems running at petabyte scale.
At Amazon, I led development of batch and streaming data pipelines supporting OneMedical, implementing Kafka + Databricks real-time ingestion to improve event freshness and downstream decision-making. I’ve also strengthened reliability, observability, and recovery to maintain 99.9%+ uptime while improving data quality and auditability, and I optimized Spark performance through partitioning strategies and storage layout changes. Earlier, at Databricks, I helped scale Delta Lake with transactional reliability and schema enforcement, operationalized MLflow-based experiment and handoff workflows, and replaced fragile Parquet ETL patterns with auditable, production-ready pipelines.
Experience
Work history, roles, and key accomplishments
Built large-scale batch and streaming data pipelines using Kafka and Databricks to improve event freshness for healthcare analytics and decision-making. Improved reliability and observability to maintain 99.9%+ uptime while strengthening data quality, auditability, and Spark performance.
Contributed to foundational lakehouse initiatives, scaling Delta Lake with transactional reliability, schema enforcement, and auditability for concurrent batch and streaming workloads. Operationalized MLflow workflows for experiment tracking and reproducible handoffs, improving performance and operability of large-scale Spark pipelines.
Worked on Spotify’s Event Delivery Infrastructure migration and Discover Weekly personalization pipelines during rapid user growth, supporting cloud migration and event reliability. Helped scale event processing to 100B+ daily events across 500+ event types and improved analytics job efficiency with Parquet migration and query optimization.
Education
Degrees, certifications, and relevant coursework
The University of Texas at Austin
Bachelor of Science, Computer Science
2012 - 2015
Grade: GPA: 3.93
Earned a Bachelor of Science in Computer Science at The University of Texas at Austin (GPA: 3.93).
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Jordan?
You can contact Jordan and 90k+ other talented remote workers on Himalayas.
Message JordanFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
