Skip to main content
Tanuj SaxenaTS
Open to opportunities

Tanuj Saxena

@tanujsaxena

Software Development Engineer building AI/ML systems and distributed, RAG-based retrieval.

India
Message

What I'm looking for

I’m looking for a role where I can build production-ready RAG/NLP pipelines and distributed systems, optimize retrieval quality and latency, and ship scalable AI features that measurably improve document workflows.

I’m a Software Development Engineer focused on building AI/ML systems that are scalable, efficient, and retrieval-first—especially for NLP, RAG, and distributed architectures. I enjoy turning complex document workflows into dependable automation and measurable performance gains.

In my internship at Cvent, I engineered NLP automation workflows using Python and transformer-based pipelines, improving enterprise content turnaround by 30%. I also developed semantic indexing and retrieval across 10K+ enterprise documents using vector embeddings and optimized query pipelines.

At The AIZoned, I architected scalable retrieval pipelines using Pinecone and transformer embeddings across 50K+ vectors. I built multi-agent asynchronous AI workflows using LangGraph, reducing execution latency by 28%, and fine-tuned LLaMA models using LoRA/QLoRA while streamlining training workflows for efficient GPU utilization.

My projects reflect this same engineering mindset: scalable indexing and structured knowledge mapping for a Legal AI Knowledge Graph, a cloud-native Sanskrit RAG platform using SanskritBERT embeddings and FAISS (with hybrid BM25 + dense search improving retrieval quality by 18%), and a multimodal Virtual Try-On system using DensePose, UNet synthesis, and Stable Diffusion that reduced preprocessing and alignment latency by 22%.

Experience

Work history, roles, and key accomplishments

Cvent logoCV

Business Analytics Intern

Cvent

Jun 2025 - Mar 2026 (9 months)

Engineered NLP automation workflows using Python and transformer-based pipelines, improving enterprise content turnaround by 30%. Built semantic indexing and retrieval systems for 10K+ documents using vector embeddings and optimized query pipelines.

TA

GenAI & LLMOps Intern

The AIZoned

Jan 2025 - Jun 2025 (5 months)

Architected scalable retrieval pipelines using Pinecone and transformer embeddings across 50K+ vectors. Built multi-agent asynchronous AI workflows with LangGraph and fine-tuned LLaMA models with LoRA/QLoRA, reducing execution latency by 28%.

Education

Degrees, certifications, and relevant coursework

Sharda University logoSU

Sharda University

Bachelor of Technology (B.Tech), Computer Science & Engineering (Data Science)

2022 - 2026

Pursuing a B.Tech in Computer Science & Engineering (Data Science) at Sharda University (2022–2026).

SS

St. Peter’s Sr. Sec. School

Senior Secondary Education, Science Stream

2020 - 2022

Completed Senior Secondary Education (Science Stream) at St. Peter’s Sr. Sec. School (2020–2022).

Find your dream job

Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan