Skip to main content
SN
Open to opportunities

Sandeep Nayak

@sandeepnayak

AI & LLM Engineer delivering production-ready RAG/agents with cost optimization and automation.

Japan
Message

What I'm looking for

I’m looking to build production LLM/RAG/agent systems end-to-end—focusing on reliability, observability, and measurable cost/value improvements. I want to work with teams that iterate fast, evaluate rigorously, and scale inference efficiently.

I’m an AI/LLM Engineer with 7+ years building production systems that turn business needs into measurable value—especially through RAG, LLM agents, and multi-agent workflows. At Gaudiy Inc, I delivered a BI conversational platform using Hybrid/GraphRAG over millions of user records, built multi-agent fan insight pipelines with sentiment and topic modeling, and reduced response time and compute cost with a multi-agent routing system.

I focus on end-to-end deployment, reliability, and observability: I co-architected an auto-scaling GPU inference platform on GKE with IaC (Terraform), CI/CD, and traceability (LangSmith + Datadog), and I introduced structured evaluation and prompt optimization using DSPy with guardrails. Previously at Rakuten Mobile, I improved analytics and extraction with LLM/RAG, led customer propensity and time-series models, and operationalized AI automation that cut ticket MTTR (patent applied); I also bring an operations-minded background from Senior Manager planning at Tata Motors.

Experience

Work history, roles, and key accomplishments

GI
Current

AI Engineer

Gaudiy Inc

Apr 2024 - Present (2 years 2 months)

Delivered a production BI conversational tool supporting a 10B yen investment story using hybrid RAG over millions of user records, and built multi-agent pipelines for fan sentiment, topic modeling, and merchandise extraction. Reduced debugging time ~30% with end-to-end observability, and co-architected a cost-efficient auto-scaling GPU inference platform on GKE.

RM

Software Engineer (AI)

Rakuten Mobile

Sep 2022 - Mar 2024 (1 year 6 months)

Built an LLM analytics and knowledge-workflows stack using RAG-based extraction and summarization, cutting query handling time 40% and improving text processing efficiency 80%. Led customer propensity modeling, deployed a time-series DNN energy optimization model saving ~15M yen/year, and operationalized three production AI ticket systems (patent applied) that reduced manual effort by ~400 hours/p

Education

Degrees, certifications, and relevant coursework

Tohoku University logoTU

Tohoku University

Post-Graduate Researcher, Applied Machine Learning

2021 - 2022

Conducted applied machine learning research at Tohoku University, including 97% emotion estimation of working dogs from ECG and ML-based prediction of human motion for autonomous navigation.

Tohoku University logoTU

Tohoku University

Master’s in Applied Information Science, Data Science

2019 - 2021

Grade: 3.77/4.0

Completed a Master’s in Applied Information Science (Data Science) at Tohoku University with a GPA of 3.77/4.0.

NIT Karnataka logoNK

NIT Karnataka

Bachelor of Engineering, Mechanical Engineering, Robotics & Controls

2012 - 2016

Grade: 8.15/10

Earned a B.Eng. in Mechanical Engineering, Robotics & Controls at NIT Karnataka with a GPA of 8.15/10.

Find your dream job

Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan