Mehdi Smail
@mehdismail
Data Scientist focused on ML pipeline operations, predictive modeling, and automation that cuts reporting time and boosts data quality.
What I'm looking for
I’m a Data Scientist with a Master of Science in Data Science and 4+ years of experience supporting machine learning pipelines, predictive modeling, and data engineering for real-world decision systems. I’ve managed end-to-end ML data operations at scale, delivering 4.6M+ labeled data points to production models.
I build automated Python/SQL pipelines that turn messy, distributed sources into reliable reporting and operational insights. By consolidating Hive data into weekly underdelivery and quality reports, I reduced manual preparation time by 88%, and I improved visibility by removing data lag in SLA dashboards.
I also focus on quality and reliability: I’ve automated Root Cause Analysis pipelines, performed statistical QA and arbitration across cases, and supported SLA tracking and escalation with data-driven backlog forecasting. I’m comfortable deploying end-to-end data solutions on AWS and turning analytics into actionable workflows.
Experience
Work history, roles, and key accomplishments
Data Understanding Specialist
ByteDance / TikTok
Oct 2025 - Present (8 months)
Owned analytics and reporting for ML data operations across 416 labeling queues, improving delivery rate to 92.2% and accuracy to 98% on 1.7M+ labeled data points per quarter. Built Python/SQL reporting pipelines and SLA dashboards that cut manual reporting by 88% (10+ hours weekly) and reduced reporting lag from 3 days to 1 day, and developed an LLM-powered RCA tool reducing investigation time to
Data Understanding Specialist
ByteDance
Oct 2025 - Present (8 months)
Managed end-to-end ML data operations across 416+ queues and 11 ML products, delivering 4.6M+ labeled data points in Q4 to support production brand safety models tied to $7.8B in prebid inventory filtering. Built automated Python/SQL reporting and SLA visibility dashboards, reducing manual reporting by 88% and eliminating 3-day data lag while improving operational RCA turnaround to 30–60 minutes.
Trust & Safety Associate
Accenture Technology Solutions Sdn. Bhd.
Nov 2024 - Sep 2025 (10 months)
Resolved 400+ moderation cases daily using user report and business verification data, maintaining 99.9% accuracy and exceeding SLA targets. Defined technical requirements and process documentation for a policy-assistance tool, projecting 20% faster case resolution through workflow automation.
IT Trainer
Kidocode
Mar 2021 - Oct 2021 (7 months)
Led curriculum development for 15+ training modules by structuring complex programming and data concepts into clear learning materials. Achieved 95% learner satisfaction ratings through course design and delivery.
Education
Degrees, certifications, and relevant coursework
Universiti Teknologi Malaysia
Master of Science in Data Science, Data Science
Master of Science in Data Science at Universiti Teknologi Malaysia (completed 2024).
Universiti Teknologi Malaysia (UTM)
Master of Science (M.Sc.), Data Science
Earned an M.Sc. in Data Science from Universiti Teknologi Malaysia (UTM) in 2024.
International Islamic University Malaysia
Bachelor of Engineering, Mechatronics
Bachelor of Engineering in Mechatronics at International Islamic University Malaysia (completed 2022).
International Islamic University Malaysia (IIUM)
Bachelor of Engineering (B.Eng.), Mechatronics
Earned a B.Eng. in Mechatronics from International Islamic University Malaysia (IIUM) in 2022.
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Mehdi?
You can contact Mehdi and 90k+ other talented remote workers on Himalayas.
Message MehdiFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
