HimalayasHimalayas logo
AS
Open to opportunities

Ahmed Syed

@ahmedsyed

Senior ML/MLOps engineer building production LLM systems—RAG, multi-agent workflows, and low-latency inference on AWS.

United States
Message

What I'm looking for

I’m looking to build end-to-end production LLM/RAG and multi-agent systems in fast-paced teams—optimizing latency, cost, and quality while prioritizing trust & safety, evaluation rigor, and measurable impact.

I’m a Senior ML & MLOps Engineer with 8+ years of experience taking research ideas into production—building production LLM systems, RAG pipelines, multi-agent workflows, and scalable inference platforms. I’m known for iterating fast while optimizing latency, accuracy, and cost, and for owning systems end-to-end from experimentation to deployment and continuous improvement.

At Vizient, I designed and deployed Advanced RAG and Graph RAG systems for an internal search platform, improving answer relevance and reducing retrieval errors. I built hybrid retrieval (BM25 + vector search + RRF reranking), benchmarked multiple RAG architectures, and created evaluation frameworks that measure retrieval and generation quality across configurations and datasets.

I also deploy and optimize production LLM systems on AWS SageMaker, where I improved latency by ~30%. Previously at Rivian and eBay, I built a multi-agent AI assistant for CRM workflows, implemented guardrails and safety controls, engineered distributed ML pipelines, and delivered end-to-end ML/data products—strengthening trust & safety, observability, and reliable execution in real-world environments.

Experience

Work history, roles, and key accomplishments

Rivian logoRI

Senior AI/ML Engineer

Rivian

Apr 2021 - Mar 2024 (2 years 11 months)

Built a multi-agent AI assistant for CRM workflows to automate customer insights, execution, and decision support across sales and support. Implemented multi-agent orchestration with guardrails and an LLM-as-judge evaluation pipeline to improve alignment and reliability while reducing latency and operational cost.

Ebay logoEB

Senior Machine Learning Engineer

Ebay

Apr 2018 - Apr 2019 (1 year)

Delivered end-to-end ML solutions from data generation and training through deployment and iteration. Developed NLP pipelines for entity extraction and classification using spaCy and Transformers, cutting manual review time by 40%, and integrated models into backend APIs and streaming pipelines.

GE Digital logoGD

Data Scientist

GE Digital

Jun 2016 - Mar 2018 (1 year 9 months)

Implemented production data pipelines for continuous ingestion, training, and feedback loops across fleet data, integrating edge data to cloud storage and analytics. Improved transportation from edge to source by moving to Spark Streaming and optimizing Cassandra modeling, reducing SLA from ~2 hours to ~5 minutes and fetch latency from 13 seconds to 3 seconds.

Education

Degrees, certifications, and relevant coursework

Southern Illinois University Edwardsville logoSE

Southern Illinois University Edwardsville

Master of Science, Computer Science

2019 - 2021

Earned a Master of Science in Computer Science at Southern Illinois University Edwardsville from 2019 to 2021.

San Jose State University logoSU

San Jose State University

Bachelor of Science, Computer Engineering

2011 - 2016

Earned a Bachelor of Science in Computer Engineering at San Jose State University from 2011 to 2016.

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan