Himalayas logo
HM
Open to opportunities

Harsh Mehta

@harshmehta

I am a data scientist and ML engineer specializing in cloud-native analytics.

United States
Message

What I'm looking for

I seek a collaborative, product-focused role building production ML and cloud-native data systems where I can lead model deployment, improve observability, and drive measurable business impact.

I am a data scientist and AI engineer with hands-on experience building cloud-native data architectures and production ML systems. I combine practical engineering with product-focused analytics to deliver measurable business outcomes.

I have built modular AI pipelines for large, unstructured documents, optimized LlamaIndex-based RAG systems, and benchmarked OCR and LLM performance for real-world data. I was selected for NSF I-Corps, conducted 90+ customer interviews to validate product-market fit, and led MVP development that produced a 35% uplift in booking conversions.

I bring end-to-end experience across data engineering, model training and deployment, ETL, and dashboards, with deep familiarity in AWS, Snowflake, SageMaker, and modern ML tooling. I seek roles where I can ship reliable ML products, improve observability, and drive data-informed decisions.

Experience

Work history, roles, and key accomplishments

OU

AI Engineering Extern

Outamation

Aug 2025 - Oct 2025 (2 months)

Built a modular AI pipeline combining Tesseract OCR and PyMuPDF to extract data from mortgage documents >200 pages and optimized a LlamaIndex-based RAG system to improve document retrieval precision. Established benchmarking for OCR and RAG performance and produced a technical report and stakeholder UI summarizing model trade-offs.

NL

Founder & AI Strategist

NSF I-Corps Great Lakes

Feb 2025 - May 2025 (3 months)

Selected by NSF I-Corps to validate Ensemble's product-market fit, conducting 90+ customer interviews to refine roadmap and strategy. Built autonomous AI agents and real-time scheduling via n8n and OAuth/PostgreSQL integrations, driving a 35% uplift in booking conversions for the MVP.

MG

Data Engineer

MSBA Financial Group

Sep 2024 - Oct 2024 (1 month)

Built an end-to-end data pipeline with AWS S3, Glue, and Redshift to centralize financial data and reduced processing time by 63%. Trained and deployed a SageMaker Canvas model achieving 99.19% accuracy and 0.981 AUC-ROC to predict bankruptcy risk and informed investment recommendations.

HV

Business Analyst Extern

HP Tech Ventures

Jun 2024 - Aug 2024 (2 months)

Evaluated 30+ startups and processed 50,000+ data points using Python to extract KPIs and inform investment recommendations. Built a lightweight ETL pipeline in Snowflake that improved data integration efficiency by 32% and wrote optimized SQL reducing key query times by 45%.

US

Data Scientist

UW Transportation Services

Jun 2024 - Aug 2024 (2 months)

Analyzed 11M+ parking transactions to identify usage and weather dependencies, reducing campus parking search time by 27% and improving stakeholder understanding via interactive dashboards. Applied clustering and predictive models to segment facilities and improve resource allocation by 18%.

PE

Business Analyst

Prayas Entertainment

Jan 2021 - Apr 2023 (2 years 3 months)

Improved operational efficiency by 35% using analytics and predictive models, built a financial forecasting model that supported strategic decisions and increased revenue by 10%. Identified high-value customer segments that raised lifetime value by 15% and retention by 10% while managing Oracle databases and ETL workflows.

IP

Data Analyst

Indigo Events & Promotions

Mar 2017 - Mar 2020 (3 years)

Led data-driven media strategy that increased customer satisfaction by 30% and improved lead conversions by 21%; optimized digital campaigns to raise CTR by 40% and cut acquisition costs by 20%. Used Google Analytics and BI tools to identify high-conversion opportunities, boosting conversions by 25% and ROI by 10% in three months.

Education

Degrees, certifications, and relevant coursework

University of Wisconsin–Madison logoUW

University of Wisconsin–Madison

Master of Science, Information (Data, ML, Cloud)

2023 - 2025

Master of Science in Information with emphasis on data, machine learning, and cloud technologies; completed coursework and applied projects from September 2023 to May 2025.

University of Mumbai logoUM

University of Mumbai

Bachelor of Management Studies, Management Studies

2013 - 2017

Completed a Bachelor of Management Studies covering core business and management coursework from June 2013 to July 2017.

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan
Harsh Mehta - AI Engineering Extern - Outamation | Himalayas