Mehroz Azyaan
@mehrozazyaan
Senior AI/ML Engineer building scalable LLM and MLOps platforms that deliver production impact.
What I'm looking for
I’m a Senior AI/ML Engineer with 9+ years building scalable ML platforms, LLM applications, and AI analytics for healthcare, SaaS, and enterprise. I’ve shipped 14 production models serving 2M+ daily predictions, with a focus on reliability, speed, and measurable business outcomes.
At Axonic Technologies (Apr 2022 — Present), I lead a team of 4 engineers operating 14 production AI/ML models across predictive analytics, NLP automation, and intelligent workflows, serving 2M+ predictions per day. I architected an MLOps platform on AWS SageMaker, Kubernetes, MLflow, and Docker—cutting model release cycles from 6 weeks to 48 hours using canary deployment and automated retraining.
I build LLM-powered document intelligence and semantic search using Hugging Face Transformers plus pgvector/Pinecone, processing 40k+ documents per day at 91% retrieval precision. I also engineer real-time GPU inference serving 2M+ predictions daily at p99 latency under 120ms, reducing inference compute cost 38% via quantization and autoscaling.
My engineering ethos is observability and governance: I implemented AI observability with drift detection, feature validation, and SHAP explainability, catching 4 silent regressions before production impact. Previously at Tempus AI and Uptake Technologies, I delivered healthcare ML automation and production predictive systems—designing feature stores, monitoring dashboards, and distributed pipelines while mentoring junior engineers.
Experience
Work history, roles, and key accomplishments
Senior AI / ML Engineer
Axonic Technologies
Apr 2022 - Present (4 years 2 months)
Led a team of 4 engineers operating 14 production AI/ML models across predictive analytics and NLP automation, serving 2M+ predictions per day. Built an AWS SageMaker/Kubernetes/MLflow MLOps platform and LLM semantic search, cutting release cycles from 6 weeks to 48 hours and achieving p99 inference latency under 120ms with 38% lower compute costs.
ML Engineer
Tempus AI
Aug 2019 - Mar 2022 (2 years 7 months)
Built 9 ML pipelines for healthcare analytics, clinical NLP, and predictive modeling across 6 pharmaceutical collaboration studies, automating 75% of manual chart reviews. Trained and deployed models on 18M patient encounter records (up to 0.89 AUC-ROC), and implemented Feast/Delta Lake feature stores and production monitoring achieving 99.9% serving uptime.
Data Scientist
Uptake Technologies
Jun 2017 - Jul 2019 (2 years 1 month)
Developed predictive analytics and anomaly detection models for IoT across mining, energy, and transportation, processing 500M daily sensor readings. Built time-series failure prediction using isolation forests and autoencoders (92% TPR at <2% FPR) and engineered Spark pipelines that produced 140 features enabling 18-month datasets to train in under 4 hours, while reducing training time by 55%.
Education
Degrees, certifications, and relevant coursework
Wheelock College
Master of Science in Computer Science, Computer Science
Earned a Master of Science in Computer Science at Wheelock College in 2017.
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Mehroz?
You can contact Mehroz and 90k+ other talented remote workers on Himalayas.
Message MehrozFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
