Open to opportunities

Ahmed Syed

@ahmedsyed

Message

Senior ML/MLOps engineer building production LLM systems—RAG, multi-agent workflows, and low-latency inference on AWS.

United States

Message

What I'm looking for

I’m looking to build end-to-end production LLM/RAG and multi-agent systems in fast-paced teams—optimizing latency, cost, and quality while prioritizing trust & safety, evaluation rigor, and measurable impact.

I’m a Senior ML & MLOps Engineer with 8+ years of experience taking research ideas into production—building production LLM systems, RAG pipelines, multi-agent workflows, and scalable inference platforms. I’m known for iterating fast while optimizing latency, accuracy, and cost, and for owning systems end-to-end from experimentation to deployment and continuous improvement.

At Vizient, I designed and deployed Advanced RAG and Graph RAG systems for an internal search platform, improving answer relevance and reducing retrieval errors. I built hybrid retrieval (BM25 + vector search + RRF reranking), benchmarked multiple RAG architectures, and created evaluation frameworks that measure retrieval and generation quality across configurations and datasets.

I also deploy and optimize production LLM systems on AWS SageMaker, where I improved latency by ~30%. Previously at Rivian and eBay, I built a multi-agent AI assistant for CRM workflows, implemented guardrails and safety controls, engineered distributed ML pipelines, and delivered end-to-end ML/data products—strengthening trust & safety, observability, and reliable execution in real-world environments.

Experience

Work history, roles, and key accomplishments

Current

Senior ML/MLOps Engineer

Current

Vizient, Inc.

Apr 2024 - Present (2 years 3 months)

Designed and deployed advanced RAG and Graph RAG systems for an internal search platform, improving answer relevance and reducing retrieval errors in production. Deployed and optimized production LLM systems on AWS SageMaker, improving latency by ~30% and enabling scalable inference.

Advanced RAG Graph RAG AWS Sagemaker CUDA NCCL Triton ONNX Runtime TensorRT Kubernetes Airflow MLFlow

Senior AI/ML Engineer

Rivian

Apr 2021 - Mar 2024 (2 years 11 months)

Built a multi-agent AI assistant for CRM workflows to automate customer insights, execution, and decision support across sales and support. Implemented multi-agent orchestration with guardrails and an LLM-as-judge evaluation pipeline to improve alignment and reliability while reducing latency and operational cost.

Workflow Agents LLM As Judge Prompt Injection Mitigation Prompt Engineering

Senior Machine Learning Engineer

Ebay

Apr 2018 - Apr 2019 (1 year)

Delivered end-to-end ML solutions from data generation and training through deployment and iteration. Developed NLP pipelines for entity extraction and classification using spaCy and Transformers, cutting manual review time by 40%, and integrated models into backend APIs and streaming pipelines.

Data Pipelines Airflow Spark SpaCy Transformers Feature Engineering A B Testing

Data Scientist

GE Digital

Jun 2016 - Mar 2018 (1 year 9 months)

Implemented production data pipelines for continuous ingestion, training, and feedback loops across fleet data, integrating edge data to cloud storage and analytics. Improved transportation from edge to source by moving to Spark Streaming and optimizing Cassandra modeling, reducing SLA from ~2 hours to ~5 minutes and fetch latency from 13 seconds to 3 seconds.

Kafka Airflow Cassandra S3 CDC (HVR)REST APIs Data Modeling REDIS Streams