Ishan Aggarwal
@ishanaggarwal
Software Development Engineer building fault-tolerant cloud systems and LLM-powered APIs.
What I'm looking for
I’m a Software Development Engineer with 2+ years building cloud-native distributed systems and high-throughput APIs across fintech and AI platforms. I focus on system reliability and performance—delivering fault-tolerant microservices processing $2B+ in daily transactions at 99.95% availability.
In my current role at Data Alchemy AI, I architect an AI Interview Coach and Data Catalog using GPT, Claude, LangChain, WebRTC, and Azure OpenAI. I’ve delivered sub-200ms real-time voice coaching with multi-turn conversational memory, and I deployed multi-tenant LLM agents on Azure Kubernetes Service with session-aware context management—reducing context retrieval overhead by 3x across 500+ concurrent users.
Previously at Otomashen Inc., I built GPT-4 and BERT recommendation engines with RAG across 100,000+ educational resources, improving course completion from 12% to 34%. I also engineered LLM inference infrastructure on AWS EKS using model quantization and GPU-accelerated serving, processing 8,000+ requests/day at 40% lower compute cost while maintaining 99.9% availability.
Earlier, at Bank of America, I redesigned a legacy monolith into 15+ fault-tolerant Python microservices with Docker, ECS, and Terraform on AWS—reaching 99.95% uptime and eliminating $1.2M in annual infrastructure costs. I bring a strong foundation in data structures, algorithms, and system design, and I enjoy turning experimentation into standards through A/B testing, observability, and cross-team iteration.
Experience
Work history, roles, and key accomplishments
Software Engineer
Data Alchemy AI
Oct 2025 - Present (8 months)
Architected an AI Interview Coach and Data Catalog using GPT/Claude and LangChain, delivering sub-200ms real-time voice coaching with multi-turn conversational memory and fault-tolerant sessions. Deployed multi-tenant LLM agents on Azure AKS, reducing context retrieval overhead 3x across 500+ concurrent users, and cut LLM iteration cycles from 3 days to under 6 hours (12x).
Software Engineer (Co-op)
Otomashen Inc.
Jan 2025 - Jun 2025 (5 months)
Built a GPT-4/BERT recommendation engine with a RAG pipeline over 100,000+ educational resources, increasing course completion rates from 12% to 34% (183% lift). Architected LLM inference infrastructure on AWS EKS with model quantization for 8,000+ requests/day at 40% lower compute cost while maintaining 99.9% availability.
Research Assistant
Northeastern Civic AI Lab
Jan 2024 - Apr 2025 (1 year 3 months)
Fine-tuned BERT on political speech data using RLHF, achieving 89% accuracy and 25% bias reduction. Mentored 40+ graduate students and led hands-on AI/ML workshops to support continuous learning.
Software Engineer
Bank of America
Jun 2022 - Jul 2023 (1 year 1 month)
Redesigned a legacy investment platform into 15+ fault-tolerant Python microservices deployed via Docker, ECS, and Terraform on AWS, achieving 99.95% uptime and eliminating $1.2M in annual infrastructure costs. Built a FastAPI gateway with load balancing, Redis caching, and JWT auth to sustain 10,000+ req/s and sub-50ms p99 latency under peak load, and orchestrated Kubernetes migrations using HPA
Software Engineering Intern
AWC Software Pvt. Ltd.
Jan 2021 - Jun 2021 (5 months)
Revamped 10+ web apps using React and TypeScript with lazy loading and code splitting, improving page load performance by 45%. Automated CI/CD with GitHub Actions, Docker, and AWS S3/CloudFront, reducing deployment time from 4 hours to under 30 minutes.
Education
Degrees, certifications, and relevant coursework
Northeastern University
Master of Science, Computer Science
2023 - 2025
Pursued a Master of Science in Computer Science, covering coursework in program design, web development, data structures and algorithms, cloud, databases, NLP, and AI foundations.
Vellore Institute of Technology
Bachelor of Technology, Information Technology
2018 - 2022
Activities and societies: Winner/finalist at 4+ major technical hackathons; delivered full-stack AI prototypes within 48-hour constraints.
Earned a Bachelor of Technology in Information Technology, with experience building full-stack AI prototypes during multiple technical hackathons.
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Ishan?
You can contact Ishan and 90k+ other talented remote workers on Himalayas.
Message IshanFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
