Harsh Mehta
@harshmehta
I am a data scientist and ML engineer specializing in cloud-native analytics.
What I'm looking for
I am a data scientist and AI engineer with hands-on experience building cloud-native data architectures and production ML systems. I combine practical engineering with product-focused analytics to deliver measurable business outcomes.
I have built modular AI pipelines for large, unstructured documents, optimized LlamaIndex-based RAG systems, and benchmarked OCR and LLM performance for real-world data. I was selected for NSF I-Corps, conducted 90+ customer interviews to validate product-market fit, and led MVP development that produced a 35% uplift in booking conversions.
I bring end-to-end experience across data engineering, model training and deployment, ETL, and dashboards, with deep familiarity in AWS, Snowflake, SageMaker, and modern ML tooling. I seek roles where I can ship reliable ML products, improve observability, and drive data-informed decisions.
Experience
Work history, roles, and key accomplishments
AI Engineering Extern
Outamation
Aug 2025 - Oct 2025 (2 months)
Built a modular AI pipeline combining Tesseract OCR and PyMuPDF to extract data from mortgage documents >200 pages and optimized a LlamaIndex-based RAG system to improve document retrieval precision. Established benchmarking for OCR and RAG performance and produced a technical report and stakeholder UI summarizing model trade-offs.
Founder & AI Strategist
NSF I-Corps Great Lakes
Feb 2025 - May 2025 (3 months)
Selected by NSF I-Corps to validate Ensemble's product-market fit, conducting 90+ customer interviews to refine roadmap and strategy. Built autonomous AI agents and real-time scheduling via n8n and OAuth/PostgreSQL integrations, driving a 35% uplift in booking conversions for the MVP.
Data Engineer
MSBA Financial Group
Sep 2024 - Oct 2024 (1 month)
Built an end-to-end data pipeline with AWS S3, Glue, and Redshift to centralize financial data and reduced processing time by 63%. Trained and deployed a SageMaker Canvas model achieving 99.19% accuracy and 0.981 AUC-ROC to predict bankruptcy risk and informed investment recommendations.
Business Analyst Extern
HP Tech Ventures
Jun 2024 - Aug 2024 (2 months)
Evaluated 30+ startups and processed 50,000+ data points using Python to extract KPIs and inform investment recommendations. Built a lightweight ETL pipeline in Snowflake that improved data integration efficiency by 32% and wrote optimized SQL reducing key query times by 45%.
Data Scientist
UW Transportation Services
Jun 2024 - Aug 2024 (2 months)
Analyzed 11M+ parking transactions to identify usage and weather dependencies, reducing campus parking search time by 27% and improving stakeholder understanding via interactive dashboards. Applied clustering and predictive models to segment facilities and improve resource allocation by 18%.
Business Analyst & Branding Extern
Beats by Dre
May 2024 - Jun 2024 (1 month)
Conducted competitive and consumer research that informed a revised pricing strategy increasing sales volume by 10% and a marketing approach that boosted engagement by 15%. Applied Python NLP to 2,000+ reviews and produced Tableau/Power BI visualizations to guide campaign targeting, improving targeting metrics by 11%.
Business Analyst
Prayas Entertainment
Jan 2021 - Apr 2023 (2 years 3 months)
Improved operational efficiency by 35% using analytics and predictive models, built a financial forecasting model that supported strategic decisions and increased revenue by 10%. Identified high-value customer segments that raised lifetime value by 15% and retention by 10% while managing Oracle databases and ETL workflows.
Data Analyst
Indigo Events & Promotions
Mar 2017 - Mar 2020 (3 years)
Led data-driven media strategy that increased customer satisfaction by 30% and improved lead conversions by 21%; optimized digital campaigns to raise CTR by 40% and cut acquisition costs by 20%. Used Google Analytics and BI tools to identify high-conversion opportunities, boosting conversions by 25% and ROI by 10% in three months.
Education
Degrees, certifications, and relevant coursework
University of Wisconsin–Madison
Master of Science, Information (Data, ML, Cloud)
2023 - 2025
Master of Science in Information with emphasis on data, machine learning, and cloud technologies; completed coursework and applied projects from September 2023 to May 2025.
University of Mumbai
Bachelor of Management Studies, Management Studies
2013 - 2017
Completed a Bachelor of Management Studies covering core business and management coursework from June 2013 to July 2017.
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Harsh?
You can contact Harsh and 90k+ other talented remote workers on Himalayas.
Message HarshFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
