Skip to main content
HimalayasHimalayas logo
Mauzam AliMA
Open to opportunities

Mauzam Ali

@mauzamali

Machine Learning & AI engineer specializing in LLM/RAG systems and real-time ML APIs that cut costs.

Pakistan
Message

What I'm looking for

I’m looking to build and deploy production-grade AI—especially LLM/RAG systems and real-time ML APIs—while owning the full ML lifecycle, optimizing latency and cloud cost, and working cross-functionally to deliver measurable business impact.

Machine Learning & AI Engineer with 7+ years of experience building and deploying scalable AI systems across NLP, Computer Vision, and Generative AI. I specialize in developing LLM-powered applications, Retrieval-Augmented Generation (RAG) systems, and real-time ML APIs for production environments, while owning the full ML lifecycle from data engineering to deployment and optimization.

At BinarySol, I architected and deployed enterprise-grade LLM systems using OpenAI and open-source models, and built advanced RAG-based enterprise document intelligence that reduced manual query handling by 70%. I designed production-ready microservices with FastAPI and Docker, delivered high-concurrency real-time inference with optimized latency/load balancing, and led computer vision pipelines using YOLOv5/YOLOv8. I also optimize model cost with pruning and quantization, integrate CI/CD for continuous deployment and automated retraining, and collaborate closely with product, data, and DevOps teams to align AI solutions with business KPIs.

Experience

Work history, roles, and key accomplishments

BI
Current

Senior AI/ML Specialist

BinarySol

Jun 2023 - Present (3 years)

Architected and deployed enterprise LLM systems using OpenAI and open-source models, building RAG document intelligence that reduced manual query handling by 70%. Developed real-time, high-concurrency ML APIs and AI microservices, and optimized models to reduce cloud costs.

Education

Degrees, certifications, and relevant coursework

University of the Punjab logoUP

University of the Punjab

Bachelor of Computer Science, Computer Science

2014 - 2018

Activities and societies: Projects: Context-Aware AI Chatbot (LLM + RAG), Real-Time Fraud Detection, AI-Based Traffic Monitoring (YOLO + OpenCV), Intelligent Invoice Processing (OCR + NLP).

Bachelor's in Computer Science with hands-on project work including an LLM + RAG context-aware chatbot, real-time fraud detection, AI-based traffic monitoring, and OCR + NLP invoice processing.

Find your dream job

Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan