Skip to main content
HimalayasHimalayas logo
Muhammad MurtazaMM
Open to opportunities

Muhammad Murtaza

@muhammadmurtaza1

Senior AI Engineer specializing in multi-agent LLM systems, RAG, and production-grade automation.

Pakistan
Message

What I'm looking for

I want to build production AI agent and LLM/RAG systems that scale reliably—focused on low latency, cost optimization, and solid MLOps. I enjoy multi-agent orchestration, human-in-the-loop flows, and shipping full-stack AI products end to end.

Senior Python AI Engineer with 4+ years of production experience building backend systems and APIs for AI-powered applications. I specialize in LLM orchestration (LangChain, LangGraph, LlamaIndex), RAG pipeline design, and multi-agent architectures that automate real business workflows at scale.

I've shipped systems serving 50K+ monthly users, processing 2M+ document pipelines, and handling 100K+ daily API requests at sub-200ms latency — reducing inference time by 25% and cloud costs by 30% through model quantization and infrastructure optimization.

My RAG work spans Pinecone, FAISS, ChromaDB, Weaviate, and Milvus with hybrid semantic search at 92% retrieval accuracy, evaluated using Ragas and DeepEval. I've fine-tuned LLMs including Llama 3 using LoRA and QLoRA via Hugging Face, achieving 35% accuracy improvement over baseline. I integrate and orchestrate foundation models across AWS Bedrock, OpenAI GPT-4, Anthropic Claude, Google Gemini, and Mistral — selecting the right model per cost, latency, and task.

On the backend, I build scalable FastAPI microservices with async patterns, structured JSON/function-calling outputs, and security best practices including HIPAA-compliant data handling and JWT authentication. I prototype rapidly with Streamlit and Gradio before productionizing, and deploy with Docker, Kubernetes, and CI/CD pipelines with MLflow experiment tracking for zero-downtime releases.

I work independently, communicate clearly across technical and non-technical teams, and ship reliable AI products people use every day.

Experience

Work history, roles, and key accomplishments

VT

Generative AI Engineer

Vision Byte Technologies

Jan 2023 - Aug 2025 (2 years 7 months)

Designed and deployed production multi-agent AI systems using LangChain/LangGraph, improving workflow efficiency by 40% through parallel orchestration and tool integration. Built RAG pipelines for 2M+ documents (92% retrieval accuracy) and served 100K+ daily API requests with sub-200ms latency while reducing cloud costs 30% and inference time 25%.

DC

ML & Deep Learning Engineer

Dot Coder

Jan 2021 - Nov 2022 (1 year 10 months)

Built and deployed production ML/NLP systems for text classification and sentiment analysis, reaching 89% accuracy on real-world datasets. Developed ML pipelines with automated monitoring and REST APIs for reliable model serving, and optimized deployments using Docker for consistent CI/CD integration.

Education

Degrees, certifications, and relevant coursework

Kohat University of Science & Technology logoKT

Kohat University of Science & Technology

Bachelor of Science, Information Technology

2021 - 2025

Bachelor of Science in Information Technology at Kohat University of Science and Technology (Oct 2021–Jun 2025), covering machine learning, natural language processing, deep learning, data science, and healthcare informatics.

Interested in hiring Muhammad?

You can contact Muhammad and 90k+ other talented remote workers on Himalayas.

Message Muhammad

People also viewed

View all talent

Find your dream job

Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan