Skip to main content
HimalayasHimalayas logo
Nagaraj MogerNM
Open to opportunities

Nagaraj Moger

@nagarajmoger

AI-focused software engineer building production LLM backends, RAG pipelines, and full-stack apps that scale.

India
Message

What I'm looking for

I’m looking to build and scale production LLM/AI systems—ETL, RAG pipelines, and ML services—with strong engineering discipline on AWS. I want teams that value uptime, measurable latency improvements, and continuous monitoring with fast CI/CD.

I’m a results-driven Software Engineer with 3+ years of experience designing and deploying AI-powered backend systems, production LLM pipelines, and full-stack web applications. I focus on shipping reliable ML services with measurable performance and real business impact.

At Itwine Technology Pvt Ltd, I architected end-to-end ETL pipelines ingesting and preprocessing 5M+ data points daily, and I cut manual data preparation effort by 60% through OpenAI GPT fine-tuning workflows. I also designed and deployed 4 production ML models via RESTful microservices backed by Amazon Bedrock and JWT-secured APIs, achieving 99.9% uptime and sub-200ms inference latency across 10K+ daily requests.

I build intelligent RAG pipelines with LangChain, OpenAI embeddings, and vector databases (Pinecone/Chroma) to reduce manual query resolution time by 35%. From Dockerized zero-downtime AWS deployments to MLflow-driven monitoring and scheduled retraining, I deliver systems that stay healthy in production—supported by frontend integrations using Angular/React/Vue and impactful client projects.

Experience

Work history, roles, and key accomplishments

IL

Software Engineer (AI)

Itwine Technology Pvt Ltd

Oct 2022 - Nov 2025 (3 years 1 month)

Architected end-to-end Python ETL pipelines ingesting 5M+ data points daily and cut manual data preparation effort by 60% by feeding curated datasets into OpenAI GPT fine-tuning workflows. Built and deployed 4 production ML models via JWT-secured REST microservices on AWS Bedrock, achieving sub-200ms inference latency and 99.9% uptime while reducing API response time from 800ms to 210ms.

Education

Degrees, certifications, and relevant coursework

JAIN (Deemed-to-be University) logoJU

JAIN (Deemed-to-be University)

Master of Computer Applications (MCA), Computer Applications

2023 - 2025

Master of Computer Applications (MCA) with coursework in machine learning, deep learning, natural language processing, cloud computing, distributed systems, and database management.

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan