Zaafir Rizwan
@zaafirrizwan
Senior AI Engineer specializing in Voice AI, LLMs & RAG. 3+ years shipping production systems at sub-500ms latency across GCP, Azure & AWS.
What I'm looking for
I am a AI Engineer specializing in Voice AI, Large Language Models (LLMs), and Retrieval-Augmented Generation (RAG) with a strong focus on building production-grade AI systems. Over the past few years, I’ve worked in fast-paced startup environments where I’ve designed and deployed scalable AI infrastructure across AWS, GCP, and Azure, delivering real-world AI products with sub-500ms latency and high reliability. zaafir_rizwan
My expertise lies in real-time conversational AI, voice agents, and agentic AI systems. I have built end-to-end voice AI pipelines using technologies like LiveKit, WebRTC, Deepgram, ElevenLabs, and Cartesia, integrating them with telephony platforms and enterprise systems to create intelligent voice assistants. I also design advanced RAG architectures, multimodal AI workflows, and LLM-based applications, leveraging models such as Claude, Gemini, and Llama. zaafir_rizwan
Beyond model development, I focus heavily on MLOps and AI infrastructure, using tools like Docker, Kubernetes, OpenTelemetry, ClickHouse, and MLflow to ensure scalable deployment, monitoring, and continuous improvement of AI systems. My work has helped reduce operational costs by up to 40% and accelerate deployment cycles by 60%, while maintaining production reliability. zaafir_rizwan
I’m particularly interested in voice AI, agentic systems, and real-time multimodal AI, and I enjoy building systems that bridge cutting-edge research with practical applications. I’m always excited to collaborate with teams pushing the boundaries of AI-driven products and intelligent automation.
Experience
Work history, roles, and key accomplishments
AI Engineer
TrifusionTech
Dec 2025 - Present (4 months)
Built AI infrastructure (Docker/Kubernetes) on GCP/Azure/AWS, cutting costs 40%. Engineered voice AI pipelines with LiveKit, Deepgram & ElevenLabs at sub-500ms latency. Developed RAG systems, LLMOps observability, MCP integrations & NLP recruitment tools — reducing hiring time 45% and deployment cycles 60%.
AI Engineer
Quest Lab
Aug 2023 - Oct 2025 (2 years 2 months)
Built containerized AI infrastructure (Docker/Kubernetes) on GCP, Azure & AWS, cutting costs 40%. Architected RAG systems, fine-tuned LLMs, and developed NLP-powered recruitment tools reducing hiring time 45%. Automated image/video workflows with ComfyUI, slashing design production 60%.
AI Intern
Folio3
Jun 2022 - Present (3 years 10 months)
Developed computer vision models using PyTorch and OpenCV, contributing to a key proof-of-concept for an automated product tagging system. Gained hands-on experience in the complete data science workflow, from data preprocessing and model experimentation to cloud-based deployment.
Education
Degrees, certifications, and relevant coursework
National University of Computer and Emerging Sciences (NUCES)
Bachelor of Science, Artificial Intelligence
2019 - 2023
Activities and societies: Relevant Coursework: Natural Language Processing, Deep Learning, Data Analytics, Cloud Computing, MLOps, Machine Learning. Final Year Project: Real-time Pattern Detection using Deep Learning and Automated Analysis.
Completed a Bachelor of Science in Artificial Intelligence, focusing on key areas such as Natural Language Processing, Deep Learning, Data Analytics, Cloud Computing, MLOps, and Machine Learning. Undertook a final year project on Real-time Pattern Detection using Deep Learning and Automated Analysis.
Tech stack
Software and tools used professionally
AWS Glue
AWS IAM
Amazon EC2
Microsoft Azure
Kubernetes
Cloudflare
Azure Pipelines
Jupyter
NumPy
Pandas
Gmail
Node.js
OpenCV
Terraform
React
JavaScript
Python
AWS Elastic Load Balancing ...
TensorFlow
PyTorch
scikit-learn
Azure Active Directory
Gemini
AWS Lambda
Azure Functions
Deepgram
Docker
Airflow
Amazon Web Services (AWS)
SQL
Amazon SageMaker
Azure Blob Storage
Hugging Face
LiveKit
Google Cloud Vertex AI Workbench
Cartesia
Availability
Location
Authorized to work in
Website
zaafir-rizwan.techSocial media
Job categories
Interested in hiring Zaafir?
You can contact Zaafir and 90k+ other talented remote workers on Himalayas.
Message ZaafirFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
