Eric Redondo
@ericredondo
I’m a senior AI/ML engineer building production GenAI, RAG, and MLOps systems at scale.
What I'm looking for
I’m a Senior AI/ML Engineer with 10+ years building production-grade machine learning and GenAI systems at scale. I specialize in LLM-powered applications, retrieval-augmented generation (RAG), and MLOps that keep models reliable in high-availability environments.
In my recent work, I designed and deployed production RAG for enterprise knowledge search serving 50K+ internal users with low-latency (<1.2s p95) responses. I built end-to-end evaluation and monitoring, including prompt regression testing in CI, LLM-as-judge scoring, retrieval confidence for hallucination mitigation, and embedding drift detection using cosine similarity distribution shift and PSI metrics.
I focus on measurable impact and operational efficiency. I reduced inference cost by 27% using dynamic context window trimming, response caching, and multi-model routing, while also cutting deployment cycles from weeks to days through CI/CD automation with MLflow model registry and Git-based workflows.
Earlier, I built scalable ML systems and data infrastructure—designing cloud-native pipelines with SageMaker, EKS, and Lambda, and developing distributed ETL and real-time streaming using Spark, Kafka, and Kinesis. I’ve worked across NLP/CV and analytics use cases, partnering with scientists to productionize models and improving performance and reliability through monitoring, drift detection, and governance standards.
Experience
Work history, roles, and key accomplishments
Designed and deployed production RAG knowledge search for enterprise customers serving 50K+ internal users with low-latency (p95 <1.2s) responses. Built weak-supervision data labeling and LLM evaluation/monitoring frameworks, reducing manual labeling costs by 60% and inference costs by 27% while improving governance and reliability.
Architected and deployed production ML systems on AWS (SageMaker, EKS, Lambda), including automated training/deployment pipelines with SageMaker Pipelines and Step Functions. Reduced training infrastructure costs by 28% using spot and distributed strategies and improved monitoring with CloudWatch/Prometheus drift detection.
Data Engineer / Machine Learning Engineer
Linedata
Sep 2019 - Oct 2020 (1 year 1 month)
Built distributed ETL pipelines with Apache Spark for petabyte-scale datasets and developed real-time streaming pipelines using Kafka and Kinesis. Improved analytics performance by reducing query latency by 35% through storage optimization and indexing, supporting ML experimentation and A/B testing.
Education
Degrees, certifications, and relevant coursework
Stanford University
Master of Science, Computer Science (Artificial Intelligence specialization)
2015 - 2019
Master of Science in Computer Science with a specialization in Artificial Intelligence at Stanford University (2015–2019).
Texas A&M University
Bachelor of Science, Computer Science
2012 - 2015
Bachelor of Science in Computer Science at Texas A&M University, with minors in Mathematics and Mechanical Engineering (2012–2015).
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Eric?
You can contact Eric and 90k+ other talented remote workers on Himalayas.
Message EricFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
