Mason Iqbal
@masoniqbal
Senior Generative AI engineer building scalable LLM and RAG systems that improve accuracy and reduce latency.
What I'm looking for
I’m a Senior Generative AI and Machine Learning Engineer with 8+ years of experience building and scaling production-grade AI systems. I focus on Large Language Models (LLMs), Retrieval-Augmented Generation (RAG), and end-to-end ML pipelines—translating business requirements into measurable impact in high-scale environments.
In recent roles, I designed and deployed LLM-based applications using RAG pipelines, improving information retrieval accuracy by 30% and building AI assistants that support thousands of users with sub-200ms response times. I’ve also implemented vector search systems (FAISS/Pinecone), reduced deployment time by 40%, optimized inference with quantization and batching (cutting compute costs by 25%), and strengthened reliability through CI/CD, automated retraining, and model monitoring.
Experience
Work history, roles, and key accomplishments
Senior Machine Learning Engineer
Istream Solution
Jan 2023 - Present (3 years 5 months)
Designed and deployed LLM-based RAG applications, increasing information retrieval accuracy by 30% and improving query performance by 35%. Built end-to-end ML pipelines, cutting deployment time by 40%, and reduced inference compute costs by 25% through quantization and batching.
AI/ML Engineer
Streamline Analytics
May 2020 - Dec 2022 (2 years 7 months)
Developed fraud detection models using XGBoost and LightGBM, improving detection accuracy by 25%, and built real-time Kafka streaming pipelines that reduced processing latency by 50%. Deployed ML models as REST APIs and implemented CI/CD with automated retraining, while prompt engineering and fine-tuning improved LLM response accuracy by 25%.
Machine Learning Engineer
NextEra Analytics
Aug 2017 - Apr 2020 (2 years 8 months)
Built healthcare NLP systems for entity recognition, improving accuracy by 20%, and developed predictive models for patient risk scoring and analytics. Designed secure, modular ML architectures and pipelines to improve maintainability by 20% and reduce API response times by 30% using caching and asynchronous processing, supported by A/B testing.
Education
Degrees, certifications, and relevant coursework
Mason hasn't added their education
Don't worry, there are 90k+ talented remote workers on Himalayas
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Mason?
You can contact Mason and 90k+ other talented remote workers on Himalayas.
Message MasonFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
