Mauzam Ali
@mauzamali
Machine Learning & AI engineer specializing in LLM/RAG systems and real-time ML APIs that cut costs.
What I'm looking for
Machine Learning & AI Engineer with 7+ years of experience building and deploying scalable AI systems across NLP, Computer Vision, and Generative AI. I specialize in developing LLM-powered applications, Retrieval-Augmented Generation (RAG) systems, and real-time ML APIs for production environments, while owning the full ML lifecycle from data engineering to deployment and optimization.
At BinarySol, I architected and deployed enterprise-grade LLM systems using OpenAI and open-source models, and built advanced RAG-based enterprise document intelligence that reduced manual query handling by 70%. I designed production-ready microservices with FastAPI and Docker, delivered high-concurrency real-time inference with optimized latency/load balancing, and led computer vision pipelines using YOLOv5/YOLOv8. I also optimize model cost with pruning and quantization, integrate CI/CD for continuous deployment and automated retraining, and collaborate closely with product, data, and DevOps teams to align AI solutions with business KPIs.
Experience
Work history, roles, and key accomplishments
Senior AI/ML Specialist
BinarySol
Jun 2023 - Present (3 years)
Architected and deployed enterprise LLM systems using OpenAI and open-source models, building RAG document intelligence that reduced manual query handling by 70%. Developed real-time, high-concurrency ML APIs and AI microservices, and optimized models to reduce cloud costs.
AI/ML Engineer & Data Scientist
H2o Labs
Jan 2021 - Dec 2022 (1 year 11 months)
Delivered end-to-end AI solutions for predictive analytics, classification, clustering, and recommendation systems. Built ETL pipelines and deployed real-time NLP and computer vision models via production ML APIs.
Junior ML Engineer & Data Scientist
ESS Junior
Jan 2018 - Dec 2020 (2 years 11 months)
Developed machine learning models for classification, regression, and forecasting, performing end-to-end data preprocessing and feature engineering. Built deep learning models in TensorFlow/Keras and supported deployment and validation of ML models from staging to production.
Education
Degrees, certifications, and relevant coursework
University of the Punjab
Bachelor of Computer Science, Computer Science
2014 - 2018
Activities and societies: Projects: Context-Aware AI Chatbot (LLM + RAG), Real-Time Fraud Detection, AI-Based Traffic Monitoring (YOLO + OpenCV), Intelligent Invoice Processing (OCR + NLP).
Bachelor's in Computer Science with hands-on project work including an LLM + RAG context-aware chatbot, real-time fraud detection, AI-based traffic monitoring, and OCR + NLP invoice processing.
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Mauzam?
You can contact Mauzam and 90k+ other talented remote workers on Himalayas.
Message MauzamFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
