Skip to main content
Samir TalkalST
Open to opportunities

Samir Talkal

@samirtalkal

AI Engineer specializing in LLMs and agentic systems, building production-ready RAG and APIs.

India
Message

What I'm looking for

I’m looking for a role where I build LLM/RAG and agentic systems end to end—shipping low-latency APIs, improving reliability, and collaborating closely to turn prototypes into production.

I’m an AI Engineer with 3+ years of experience in backend development and data engineering, specializing in LLMs, RAG pipelines, and conversational AI systems. I enjoy turning complex AI workflows into scalable, production-ready services that teams can rely on.

At Quest Global, I built an on-prem LLM-based RAG system with reranking, cutting manual information retrieval time by ~40% and improving response accuracy. I also scaled API performance by transitioning to async FastAPI microservices, lowering latency by ~30% using LangChain orchestration.

Previously, I reduced infrastructure costs by 30% with fault-tolerant APIs on AWS Lambda and API Gateway, and improved request success rate by 10% using OpenAI APIs for translation and response optimization. Earlier, I streamlined multi-GB ETL into Redshift with Informatica and Airflow, and developed Power BI dashboards for real-time operational insights.

Experience

Work history, roles, and key accomplishments

Quest Global logoQG
Current

Software Engineer AI/ML

Quest Global

Sep 2025 - Present (9 months)

Built an on-prem LLM RAG system with reranking, reducing manual information retrieval time by ~40% and improving response accuracy. Scaled FastAPI microservices to lower latency by ~30% and automated AI-powered ticket creation, reducing manual effort by ~60%.

SA

Full Stack Software Engineer

SaayamForAll

Jul 2024 - Jul 2025 (1 year)

Reduced infrastructure costs by 30% by building fault-tolerant APIs and deploying on AWS Lambda and API Gateway. Improved multilingual QA latency and request success rate by 10% using LLM-powered Flask chatbots and OpenAI APIs for translation and response optimization.

IIT-BHU logoII

Machine Learning Research Intern

IIT-BHU

May 2019 - Jul 2019 (2 months)

Improved classification accuracy by up to 15% across 40 UCR datasets using novel time-series techniques. Boosted model accuracy by 5–15% with SVM and XGBoost using a hybrid feature set (DTW, WEASEL, L1 distance).

Education

Degrees, certifications, and relevant coursework

New York University logoNU

New York University

Master of Science in Computer Engineering, Computer Engineering

2022 - 2024

Grade: GPA: 3.83 / 4.00

Master of Science in Computer Engineering at New York University (Sep 2022–May 2024), focused on graduate-level computer engineering coursework.

Vellore Institute of Technology logoVT

Vellore Institute of Technology

Bachelor of Technology in Information Technology, Information Technology

2016 - 2020

Grade: GPA: 8.62 / 10.00

Bachelor of Technology in Information Technology at Vellore Institute of Technology (Jul 2016–Jun 2020) with an undergraduate focus on IT fundamentals.

Find your dream job

Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan