Alok Kushwaha
@alokkushwaha
NLP and ML engineer building end-to-end systems for document intelligence and financial risk modelling—from data to browser-ready products.
What I'm looking for
I’m an NLP and ML engineer focused on taking a modelling problem from raw data to a working system—training code, APIs, and a frontend people can open in a browser. My core strength is owning features end-to-end and shipping things that perform in real workflows.
I’ve published a peer-reviewed paper on financial contract risk analysis using Transformer-based clause modelling and graph neural networks. Alongside that, I’ve built and deployed four projects spanning financial risk modelling, legal NLP, and multi-agent AI.
In internships, I improved downstream model quality by building LLM fine-tuning preprocessing pipelines, and I benchmarked RAG workflows across FAISS, Chroma, and Pinecone to optimize retrieval precision/recall. I also annotated multi-million-record datasets with inter-rater reliability scoring to surface systematic labelling errors and reduce ingestion risk.
From an engineering perspective, I’ve owned ingestion pipelines, scheduling, and error logging, cutting manual preparation time by ~8 hrs/week. I’ve also tuned data and APIs for performance—diagnosing slow SQL, redesigning indexing, and reducing latency by 30%—while delivering full pipelines like OCR → clause segmentation → graph dependency modelling → FastAPI + Next.js.
Experience
Work history, roles, and key accomplishments
GenAI Intern
Innomatics Research Labs
Nov 2025 - Feb 2026 (3 months)
Built LLM fine-tuning preprocessing pipelines (tokenization, embedding, normalization) and improved downstream model reliability by reducing downstream model errors by 18%. Benchmarked RAG workflows across FAISS, Chroma, and Pinecone to select an optimal retrieval configuration using precision/recall evaluation.
Software Developer Intern
MGrid Technologies
Nov 2024 - Jun 2025 (7 months)
Owned end-to-end Python/Node.js ingestion pipelines (scripts, scheduling, and error logging), eliminating ~8 hours/week of manual data preparation. Optimized SQL performance by using EXPLAIN plans to redesign indexing and rewrite critical queries, reducing latency by 30% verified on production logs.
Education
Degrees, certifications, and relevant coursework
SIES College of Arts, Science & Commerce
Master of Science (MSc), Data Science
2024 - 2026
Grade: CGPA 8.57
Pursuing an MSc in Data Science at SIES College of Arts, Science & Commerce in Mumbai (CGPA: 8.57). Coursework focused on data science fundamentals and applied machine learning concepts.
SIES College of Arts, Science & Commerce
Bachelor of Science (BSc), Information Technology
2021 - 2024
Grade: CGPA 9.03
Completed a BSc in Information Technology at SIES College of Arts, Science & Commerce in Mumbai (CGPA: 9.03). Built foundational knowledge in IT and software/data-focused coursework.
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Job categories
Interested in hiring Alok?
You can contact Alok and 90k+ other talented remote workers on Himalayas.
Message AlokFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
