Skip to main content
HimalayasHimalayas logo
Leon RaymerLR
Looking for a job

Leon Raymer

@leonraymer

Senior ML Engineer | 4+ yrs production NLP/LLMs at Yandex | RAG, fine-tuning, eval infra | Remote B2B from Yerevan (UTC+4)

Armenia
Message

What I'm looking for

I’m looking to build and own scalable NLP/LLM systems—especially RAG, fine-tuning, and evaluation—while partnering closely with product and analytics to deliver measurable improvements in support, content, and conversion.

I’m a Senior ML Engineer with 4+ years of production experience delivering end-to-end NLP and LLM solutions at scale. I’ve built and owned RAG systems, LLM fine-tuning pipelines, and evaluation infrastructure serving millions of users. I translate business requirements into measurable outcomes, from support automation to content intelligence and conversion optimization.

At Yandex, I implemented a custom CUDA-accelerated loss function within the PyTorch training kernel, reducing GPU memory overhead by 9% and cutting cloud compute costs by ~$14K/month for domain-specific LLM fine-tuning. I also built an offline LLM evaluation framework with an automated benchmark suite of 3K labeled examples, LLM-as-a-Judge scoring, and Slack alerting on metric degradation—reducing regression detection time from 2 weeks to 4 hours. In production, I shipped a review summarization feature, aggregating 50–500 reviews per product with a map-reduce approach and achieving a 3.41% relative increase in add-to-cart rate on 2M users via A/B testing.

Earlier at Yandex, I designed and deployed a RAG-based customer support automation system using BM25 retrieval (Elasticsearch), a cross-encoder reranker, and a fine-tuned, grounded LLM—automating 34% of incoming support tickets with CSAT of 4.1/5.0 and keeping hallucination rate at 2%. I replaced a legacy BERT classifier with an instruction-tuned LLM (SFT + LoRA) for ticket routing across 20+ categories, improving accuracy from 81% to 93%, especially on long, multi-issue tickets. I’ve mentored interns and junior engineers through onboarding, code reviews, and production deployments, and I’ve helped lead design reviews to align engineering effort with product roadmaps and business priorities.

Experience

Work history, roles, and key accomplishments

YA
Current

Senior ML Engineer

Yandex

Feb 2025 - Present (1 year 4 months)

Implemented a custom CUDA-accelerated PyTorch loss function, reducing GPU memory overhead by 9% and cutting cloud compute costs by ~$14K/month for domain-specific LLM fine-tuning. Built an offline LLM evaluation framework (3K-example benchmark + LLM-as-a-Judge) to reduce regression detection time from 2 weeks to 4 hours.

YA

Middle ML Engineer

Yandex

Jul 2023 - Jan 2025 (1 year 6 months)

Designed and deployed a RAG-based customer support automation system using Elasticsearch BM25 retrieval, cross-encoder reranking, and grounded LLM generation. Automated 34% of incoming tickets with CSAT 4.1/5.0, maintained 2% hallucination rate, and improved ticket routing accuracy from 81% to 93% by replacing a legacy classifier with an instruction-tuned LLM (SFT + LoRA).

YA

Junior ML Engineer

Yandex

Jul 2022 - Jun 2023 (11 months)

Fine-tuned RuBERT for multi-class support ticket classification (20+ categories) achieving 81% accuracy and reduced ticket routing time from 4 minutes to under 1 second. Curated labeled datasets via Yandex Toloka (annotation guidelines + QC), reaching inter-annotator agreement of 0.74 (Fleiss' kappa), and developed FastAPI-based inference services for production NLP models.

YA

Intern ML Engineer

Yandex

Jan 2022 - Jun 2022 (5 months)

Worked on ETA prediction for Yandex Maps by performing feature engineering and offline experiments with CatBoost. Shipped weather-based features to production, reducing MAPE by 2.6pp on winter routes.

Education

Degrees, certifications, and relevant coursework

HSE University logoHU

HSE University

Bachelor of Science, Applied Mathematics & Computer Science

BSc in Applied Mathematics & Computer Science at HSE University with coursework in machine learning, deep learning, and natural language processing.

Find your dream job

Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan