Skip to main content
HM
Open to opportunities

Harsh Mehta

@harshmehta

I am a data scientist and ML engineer specializing in cloud-native analytics.

United States
Message

What I'm looking for

I seek a collaborative, product-focused role building production ML and cloud-native data systems where I can lead model deployment, improve observability, and drive measurable business impact.

I am a data scientist and AI engineer with hands-on experience building cloud-native data architectures and production ML systems. I combine practical engineering with product-focused analytics to deliver measurable business outcomes.

I have built modular AI pipelines for large, unstructured documents, optimized LlamaIndex-based RAG systems, and benchmarked OCR and LLM performance for real-world data. I was selected for NSF I-Corps, conducted 90+ customer interviews to validate product-market fit, and led MVP development that produced a 35% uplift in booking conversions.

I bring end-to-end experience across data engineering, model training and deployment, ETL, and dashboards, with deep familiarity in AWS, Snowflake, SageMaker, and modern ML tooling. I seek roles where I can ship reliable ML products, improve observability, and drive data-informed decisions.

Experience

Work history, roles, and key accomplishments

OU

AI Engineering Extern

Outamation

Aug 2025 - Oct 2025 (2 months)

Built a modular AI pipeline combining Tesseract OCR and PyMuPDF to extract data from mortgage documents >200 pages and optimized a LlamaIndex-based RAG system to improve document retrieval precision. Established benchmarking for OCR and RAG performance and produced a technical report and stakeholder UI summarizing model trade-offs.

NL

Founder & AI Strategist

NSF I-Corps Great Lakes

Feb 2025 - May 2025 (3 months)

Selected by NSF I-Corps to validate Ensemble's product-market fit, conducting 90+ customer interviews to refine roadmap and strategy. Built autonomous AI agents and real-time scheduling via n8n and OAuth/PostgreSQL integrations, driving a 35% uplift in booking conversions for the MVP.

MG

Data Engineer

MSBA Financial Group

Sep 2024 - Oct 2024 (1 month)

Built an end-to-end data pipeline with AWS S3, Glue, and Redshift to centralize financial data and reduced processing time by 63%. Trained and deployed a SageMaker Canvas model achieving 99.19% accuracy and 0.981 AUC-ROC to predict bankruptcy risk and informed investment recommendations.

HV

Business Analyst Extern

HP Tech Ventures

Jun 2024 - Aug 2024 (2 months)

Evaluated 30+ startups and processed 50,000+ data points using Python to extract KPIs and inform investment recommendations. Built a lightweight ETL pipeline in Snowflake that improved data integration efficiency by 32% and wrote optimized SQL reducing key query times by 45%.

US

Data Scientist

UW Transportation Services

Jun 2024 - Aug 2024 (2 months)

Analyzed 11M+ parking transactions to identify usage and weather dependencies, reducing campus parking search time by 27% and improving stakeholder understanding via interactive dashboards. Applied clustering and predictive models to segment facilities and improve resource allocation by 18%.

PE

Business Analyst

Prayas Entertainment

Jan 2021 - Apr 2023 (2 years 3 months)

Improved operational efficiency by 35% using analytics and predictive models, built a financial forecasting model that supported strategic decisions and increased revenue by 10%. Identified high-value customer segments that raised lifetime value by 15% and retention by 10% while managing Oracle databases and ETL workflows.

IP

Data Analyst

Indigo Events & Promotions

Mar 2017 - Mar 2020 (3 years)

Led data-driven media strategy that increased customer satisfaction by 30% and improved lead conversions by 21%; optimized digital campaigns to raise CTR by 40% and cut acquisition costs by 20%. Used Google Analytics and BI tools to identify high-conversion opportunities, boosting conversions by 25% and ROI by 10% in three months.

Education

Degrees, certifications, and relevant coursework

University of Wisconsin–Madison logoUW

University of Wisconsin–Madison

Master of Science, Information (Data, ML, Cloud)

2023 - 2025

Master of Science in Information with emphasis on data, machine learning, and cloud technologies; completed coursework and applied projects from September 2023 to May 2025.

University of Mumbai logoUM

University of Mumbai

Bachelor of Management Studies, Management Studies

2013 - 2017

Completed a Bachelor of Management Studies covering core business and management coursework from June 2013 to July 2017.

Find your dream job

Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan