Data Scientist/ML Engineer with 5 years of experience in building end-to-end data solutions. I have a strong focus on business metrics and product behavior to solve complex business problems. I have a Bachelor of Technology in Electrical Engineering from PEC University of Technology and a PG Certification in Analytics from IIM Lucknow. Throughout my career, I have developed expertise in languages such as Python, Spark, SQL, R, and BigQuery. I am proficient in frameworks like Scikit, Keras, TensorFlow, PyTorch, Pandas, Numpy, Django, NLP, Flask, and OpenCV. I have hands-on experience with tools like Docker, Mongo, GIT, PostgreSQL, MySQL, Kafka, Hadoop, and Kubernetes. Additionally, I have worked on platforms like AWS (EC2, ECR, ECS, Sagemaker), Azure, GCP, and Databricks. My soft skills include storytelling, public speaking, leadership, and product management. I am also well-versed in Gen AI technologies such as Transformers, BERT, GPT, Langchain, Llama, RayServe, RAG, LORA, Semantic Kernels, and large-scale LLM.
In my current role as a Senior Manager at JIO-AI COE India, I have created and implemented a No Code Non-Linear Constrained End to End Cascaded Optimization framework using Artificial Neural Networks for real-time optimization and enhanced process control for manufacturing equipment across the Oil and Gas industry. I have also developed document extraction techniques using RAG agent, OpenAI, and Mistral, reducing operation manpower by 85%. Additionally, I have built a CNN model for RUL prediction of exchangers and developed a YoloV8 model pipeline for field device sensor value detection.
Prior to this, I worked as a Lead Machine Learning Engineer at Klaim.ai, where I developed a statistical financing portal and designed a complex rule engine and deep learning medical claims detection model. I also established a new data pipeline on Spark and curated an embedded analytics product for clients on Tableau.
Before that, I served as a Senior Data Scientist at HDFC Ergo Pvt. Ltd, where I spearheaded a classification project using spatial clustering, built a fraud detection model, implemented natural language processing techniques, and created a time series predictive model for Covid-19 impact analysis.
As a Data Science Intern at Absolutdata Analytics Ltd, I worked on real-time anomaly detection and signal processing projects for a major energy client and delivered a synthetic control API for hypothesis testing to major clients.