Himalayas logo
RM
Open to opportunities

Ruken Missonnier

@rukenmissonnier

Data scientist specializing in NLP, model fine-tuning, and production-ready ML pipelines.

France
Message

What I'm looking for

I seek NLP/ML roles where I can build and productionize models, work on RAG and LLM fine-tuning, and collaborate in data-driven teams focused on impact.

I am a data scientist focused on natural language processing, model optimization, and deploying robust ML systems to production. I curate and augment multi-source datasets and build retrieval-augmented generation pipelines to power downstream applications.

At Largo.ai I fine-tune and distill large language and transformer models (LLaMA2, RoBERTa, BERT variants, DistilBERT, Qwen-14B) using FP16, LoRA, 4-bit quantization and extensive hyperparameter sweeps to improve accuracy and F1 metrics. I also implemented FAISS HNSW ANN, cross-encoder fusion and async GPT-4 re-ranking for high-quality JSON outputs.

In banking roles I engineered and serialized logistic regression and XGBoost credit-risk models that increased AUC-ROC by 5%, built internal chatbots to query complex SQL data, and deployed ETL pipelines integrated with SAP BusinessObjects. I have experience prototyping anomaly detection models and delivering PowerBI visualizations from PL/SQL and Python pipelines.

I bring academic rigor as a current PhD candidate and prior teaching experience, combining strong statistical foundations with hands-on production experience to deliver impactful, reliable ML solutions.

Experience

Work history, roles, and key accomplishments

LA
Current

NLP Data Scientist

Largo.ai

Jan 2024 - Present (1 year 9 months)

Curated and augmented multi-source emotion/genre and movie/actor datasets and built RAG pipelines with FAISS HNSW, cross-encoder fusion and GPT-4 re-ranking to produce structured JSON outputs. Fine-tuned and distilled LLaMA2, RoBERTa, BERT variants and Qwen-14B using FP16, LoRA and 4-bit quantization with hyperparameter sweeps, improving retrieval and classification metrics (accuracy, F1, top-k).

TB

Data Specialist

Türkiye İş Bankası

Jan 2019 - Jan 2021 (2 years)

Built PL/SQL and Python data pipelines to extract and transform raw data and produced Power BI visualizations and reports for team consumption, improving data accessibility and reporting workflows.

MU

Statistics Teaching Assistant

Middle East Technical University

Jan 2016 - Jan 2017 (1 year)

Led recitation labs for Statistical Inference and Regression Analysis, supporting over 60 students and assisting with coursework and exam preparation.

Education

Degrees, certifications, and relevant coursework

Istanbul Technical University logoIU

Istanbul Technical University

Doctor of Philosophy, Computer Science

2024 -

PhD candidate in Computer Science conducting advanced research and coursework since 2024.

Ondokuz Mayis University logoOU

Ondokuz Mayis University

Master of Science, Data Science

2021 - 2023

Completed a Master of Science in Data Science with coursework and projects focused on statistical modelling and machine learning.

Marmara University Faculty of Engineering logoME

Marmara University Faculty of Engineering

Master of Science, Engineering Management

2019 - 2020

Completed a Master of Science in Engineering Management covering project management and engineering systems.

Galatasaray University logoGU

Galatasaray University

Master of Science, Economics

2018 - 2021

Completed a Master of Science in Economics with coursework in economic theory and quantitative methods.

Tech stack

Software and tools used professionally

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan
Ruken Missonnier - NLP Data Scientist - Largo.ai | Himalayas