Trupti Shriyan
@truptishriyan
Dynamic Data Analyst skilled in Python, SQL, and data-driven insights.
What I'm looking for
I am a dynamic Data Science Intern with a solid foundation in classification modeling and a passion for data-driven insights. My hands-on experience includes developing advanced logistic regression models and optimizing performance through hyperparameter tuning. I excel at visualizing results and collaborating with stakeholders to translate complex data into actionable strategies that align with business goals.
During my internship at Acmegrade Pvt. Ltd., I developed a multiclass classification model using One-vs-Rest logistic regression, achieving 85% accuracy and a 20% performance gain over the baseline. I collaborated with business stakeholders to define success metrics and align modeling goals with operational objectives, translating technical results into actionable insights.
My academic journey includes a Master of Science in Computer Science from the University of Texas at Arlington and a Bachelor of Technology in Computer Engineering. I have a strong background in machine learning, data analysis, and visualization, and I am eager to leverage my analytical skills and innovative thinking in a collaborative team environment to drive impactful outcomes in the field of data science.
Experience
Work history, roles, and key accomplishments
KSAT Quest Project
Github
Apr 2025 - Present (2 months)
Developed a Random Forest Regression model to predict KSAT from 40+ unstructured Excel soil datasets, achieving an R2 of 80% and reduced RMSLE by applying extensive feature engineering and cross-sample evaluation, demonstrating model scalability across diverse geotechnical data.
Hate Speech Detection Using RoBERTa
Github
Jan 2025 - Present (5 months)
Fine-tuned a 12-layer RoBERTa transformer on 74K+ text samples using 3-fold cross-validation, achieving 91.2% accuracy and 91.1% F1-score by merging 4 public datasets into a balanced MetaHate corpus, applying byte-level BPE tokenization, and accelerating training by 40% using FP16 precision on A100 GPUs.
MammoScanAI
Github
Nov 2024 - Present (7 months)
Built a multi-model classification pipeline using Random Forest, SVM, Logistic Regression, kNN, and Nave Bayes, achieving 95% accuracy on the Wisconsin Diagnostic dataset by applying Chi-square feature selection, scaling, hyperparameter tuning, and visualizing results with heatmaps and confusion matrices for enhanced interpretability.
Data Science Intern
Acmegrade Pvt. Ltd.
Jul 2022 - Present (2 years 11 months)
Developed a multiclass classification model using One-vs-Rest logistic regression, achieving 85% accuracy and a 20% performance gain over baseline by applying cross-validation, hyperparameter tuning, and result visualization with Matplotlib and Seaborn. Collaborated with business stakeholders to define success metrics, align modeling goals with operational objectives, and translate technical resul
Education
Degrees, certifications, and relevant coursework
University of Texas at Arlington
Master of Science, Computer Science
Currently pursuing a Master of Science in Computer Science, focusing on advanced topics in the field. Expected to graduate in May 2026.
K. J. Somaiya Institute of Technology
Bachelor of Technology, Computer Engineering
Grade: 3.75 / 4.00
Completed a Bachelor of Technology in Computer Engineering, gaining a strong foundation in computer science principles. Achieved a GPA of 3.75 out of 4.00.
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Portfolio
github.com/usernameJob categories
Skills
Interested in hiring Trupti?
You can contact Trupti and 90k+ other talented remote workers on Himalayas.
Message TruptiFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
