HimalayasHimalayas logo
HP
Open to opportunities

Hassan Pasha

@hassanpasha1

Senior AI Engineer building low-latency multimodal LLM systems with RAG and safety.

United States
Message

What I'm looking for

I’m looking for a team where I can build and scale production LLM/AI systems end-to-end—RAG, low-latency multimodal inference, and rigorous evaluation—while partnering with Product and MLOps to ship safety-conscious features fast.

I’m a Senior AI Engineer with 5+ years designing, deploying, and scaling production AI/ML systems for real-time and multimodal applications. I specialize in LLM-based architectures—RAG pipelines, prompt-based workflows, fine-tuning—plus vision and ASR, with hands-on GPU optimization and low-latency inference.

Across my work, I own the full ML lifecycle, from data ingestion and training through deployment, monitoring, evaluation, and iteration. I build evaluation frameworks to measure faithfulness, relevance, and ranking quality, and I continuously improve accuracy while reducing bias across outputs.

I also operationalize agentic and tool-using LLM systems, including RAG, structured outputs, guardrails, fallback chains, and safety/bias mitigation strategies. In educational environments, I’ve implemented regulatory-compliance guardrails aligned with FERPA/COPPA and helped translate classroom needs into scalable, production-ready AI.

Partnering closely with Product, Engineering, and MLOps teams, I communicate tradeoffs between model complexity, performance, and cost, and I contribute to long-term AI roadmap and architecture decisions. My goal is to deliver reliable, safety-conscious AI systems that perform under real constraints—latency, cost, and deployment velocity.

Experience

Work history, roles, and key accomplishments

AbbVie logoAB
Current

Senior AI Engineer

May 2023 - Present (2 years 11 months)

Designed and deployed real-time voice and vision AI systems for educational R&D environments, reducing inference latency 20–30% and improving system efficiency 15%. Operationalized LLM-based RAG and agent workflows, accelerating deployment cycles 25% and improving generative model accuracy 10–15% while implementing FERPA/COPPA safety guardrails and bias mitigation.

Mass General Brigham logoMB

Machine Learning Engineer

Apr 2020 - Apr 2023 (3 years)

Built end-to-end machine learning pipelines in Python for large-scale healthcare datasets, enabling clinical decision-making workflows. Developed and optimized supervised and deep learning models for patient risk prediction and medical text classification, and implemented modular LLM RAG systems with cross-encoder reranking for improved clinical retrieval precision.

7-Eleven logoEL

Data Scientist

7-Eleven

Feb 2017 - Mar 2020 (3 years 1 month)

Developed end-to-end machine learning models in Python and scikit-learn for customer behavior analysis and sales prediction, supporting retail business decisions. Engineered distributed ETL and forecasting workflows using Spark/PySpark and time-series methods, and built segmentation, recommendation, and NLP sentiment pipelines to drive measurable business impact via A/B testing.

Education

Degrees, certifications, and relevant coursework

University of Illinois Chicago logoUC

University of Illinois Chicago

Bachelor's degree, Computer Science

2014 - 2017

Earned a bachelor's degree in Computer Science at the University of Illinois Chicago from 2014 to 2017.

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan