Open to opportunities

Nagaraj Moger

@nagarajmoger

Message

AI-focused software engineer building production LLM backends, RAG pipelines, and full-stack apps that scale.

India

Message

What I'm looking for

I’m looking to build and scale production LLM/AI systems—ETL, RAG pipelines, and ML services—with strong engineering discipline on AWS. I want teams that value uptime, measurable latency improvements, and continuous monitoring with fast CI/CD.

I’m a results-driven Software Engineer with 3+ years of experience designing and deploying AI-powered backend systems, production LLM pipelines, and full-stack web applications. I focus on shipping reliable ML services with measurable performance and real business impact.

At Itwine Technology Pvt Ltd, I architected end-to-end ETL pipelines ingesting and preprocessing 5M+ data points daily, and I cut manual data preparation effort by 60% through OpenAI GPT fine-tuning workflows. I also designed and deployed 4 production ML models via RESTful microservices backed by Amazon Bedrock and JWT-secured APIs, achieving 99.9% uptime and sub-200ms inference latency across 10K+ daily requests.

I build intelligent RAG pipelines with LangChain, OpenAI embeddings, and vector databases (Pinecone/Chroma) to reduce manual query resolution time by 35%. From Dockerized zero-downtime AWS deployments to MLflow-driven monitoring and scheduled retraining, I deliver systems that stay healthy in production—supported by frontend integrations using Angular/React/Vue and impactful client projects.

Experience

Work history, roles, and key accomplishments

Software Engineer (AI)

Itwine Technology Pvt Ltd

Oct 2022 - Nov 2025 (3 years 1 month)

Architected end-to-end Python ETL pipelines ingesting 5M+ data points daily and cut manual data preparation effort by 60% by feeding curated datasets into OpenAI GPT fine-tuning workflows. Built and deployed 4 production ML models via JWT-secured REST microservices on AWS Bedrock, achieving sub-200ms inference latency and 99.9% uptime while reducing API response time from 800ms to 210ms.

Python ETL OpenAI GPT Fine Tuning Vector Databases Microservices Docker

Education

Degrees, certifications, and relevant coursework

JAIN (Deemed-to-be University)

Master of Computer Applications (MCA), Computer Applications

2023 - 2025

Master of Computer Applications (MCA) with coursework in machine learning, deep learning, natural language processing, cloud computing, distributed systems, and database management.