Hafeez Iqbal
@hafeeziqbal1
Senior AI/ML Engineer turning research models into reliable, cost-effective production systems.
What I'm looking for
I’m a Senior AI/ML Engineer with over 8 years of experience moving complex machine learning models from research environments into high-scale production. I focus on designing MLOps frameworks, automated pipelines, and distributed infrastructure that keep systems reliable and cost-effective.
I specialize in the Generative AI transition—optimizing Large Language Models (LLMs) with retrieval-augmented generation (RAG) and model quantization. In my recent role, I built an automated MLOps framework with MLflow that streamlined model promotion from R&D experimentation to production deployment.
I also deliver performance at scale: I developed a high performance RAG pipeline using Pinecone and LLMs to improve accuracy for complex domain queries. I optimized large-scale transformer models for real-time use with TensorRT and quantization to significantly reduce inference latency, and I scaled training to billion-parameter levels using distributed protocols on multi-GPU clusters with DeepSpeed.
To keep models healthy long-term, I establish rigorous monitoring and observability standards to detect data drift and maintain production integrity. Previously, I architected automated ingestion and augmentation pipelines, built internal performance dashboards with Flask and Streamlit, and operationalized systems with Docker, CI/CD, Kubernetes, and large-scale Spark workflows—from training to deployment to iteration.
Experience
Work history, roles, and key accomplishments
Senior AI/ML Engineer
Nimer Tech
Aug 2022 - Present (3 years 7 months)
Owned the end-to-end ML lifecycle for global predictive services by designing scalable PyTorch and AWS SageMaker architectures. Built an automated MLflow-based MLOps framework, developed a Pinecone-based RAG pipeline to improve domain response accuracy, and reduced inference latency using TensorRT and model quantization.
Architected automated data ingestion and augmentation pipelines to improve robustness across diverse, large-scale datasets. Delivered end-to-end vision and regression model development, implemented Docker-based versioned model lifecycle management, and deployed containerized models via CI/CD into staging environments.
Productionalized ensemble-based anomaly detection systems for real-time industrial monitoring, improving availability and reliability for mission-critical workloads. Built terabyte-scale feature engineering workflows with Spark and deployed BERT/spaCy NLP microservices in a high-availability Kubernetes environment with full CI/CD and automated hyperparameter sweeping.
Education
Degrees, certifications, and relevant coursework
Hafeez hasn't added their education
Don't worry, there are 90k+ talented remote workers on Himalayas
Tech stack
Software and tools used professionally
GitHub
Kubernetes
Jenkins
GitHub Actions
Gmail
Databricks
Terraform
Azure Machine Learning
TensorFlow
PyTorch
MLflow
scikit-learn
Kubeflow
Streamlit
DataRobot
DeepSpeed
FastAPI
Grafana
Prometheus
Milvus
Airflow
SQL
XGBoost
LangChain
LlamaIndex
Weights & Biases
Pinecone
Tecton
Feast
Ray
Delta Lake
JAX
ONNX Runtime
Agentic
Modal
Keep
100ms
Availability
Location
Authorized to work in
Salary expectations
Job categories
Skills
Interested in hiring Hafeez?
You can contact Hafeez and 90k+ other talented remote workers on Himalayas.
Message HafeezFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
