Leon Raymer
@leonraymer
Senior ML Engineer | 4+ yrs production NLP/LLMs at Yandex | RAG, fine-tuning, eval infra | Remote B2B from Yerevan (UTC+4)
What I'm looking for
I’m a Senior ML Engineer with 4+ years of production experience delivering end-to-end NLP and LLM solutions at scale. I’ve built and owned RAG systems, LLM fine-tuning pipelines, and evaluation infrastructure serving millions of users. I translate business requirements into measurable outcomes, from support automation to content intelligence and conversion optimization.
At Yandex, I implemented a custom CUDA-accelerated loss function within the PyTorch training kernel, reducing GPU memory overhead by 9% and cutting cloud compute costs by ~$14K/month for domain-specific LLM fine-tuning. I also built an offline LLM evaluation framework with an automated benchmark suite of 3K labeled examples, LLM-as-a-Judge scoring, and Slack alerting on metric degradation—reducing regression detection time from 2 weeks to 4 hours. In production, I shipped a review summarization feature, aggregating 50–500 reviews per product with a map-reduce approach and achieving a 3.41% relative increase in add-to-cart rate on 2M users via A/B testing.
Earlier at Yandex, I designed and deployed a RAG-based customer support automation system using BM25 retrieval (Elasticsearch), a cross-encoder reranker, and a fine-tuned, grounded LLM—automating 34% of incoming support tickets with CSAT of 4.1/5.0 and keeping hallucination rate at 2%. I replaced a legacy BERT classifier with an instruction-tuned LLM (SFT + LoRA) for ticket routing across 20+ categories, improving accuracy from 81% to 93%, especially on long, multi-issue tickets. I’ve mentored interns and junior engineers through onboarding, code reviews, and production deployments, and I’ve helped lead design reviews to align engineering effort with product roadmaps and business priorities.
Experience
Work history, roles, and key accomplishments
Senior ML Engineer
Yandex
Feb 2025 - Present (1 year 4 months)
Implemented a custom CUDA-accelerated PyTorch loss function, reducing GPU memory overhead by 9% and cutting cloud compute costs by ~$14K/month for domain-specific LLM fine-tuning. Built an offline LLM evaluation framework (3K-example benchmark + LLM-as-a-Judge) to reduce regression detection time from 2 weeks to 4 hours.
Middle ML Engineer
Yandex
Jul 2023 - Jan 2025 (1 year 6 months)
Designed and deployed a RAG-based customer support automation system using Elasticsearch BM25 retrieval, cross-encoder reranking, and grounded LLM generation. Automated 34% of incoming tickets with CSAT 4.1/5.0, maintained 2% hallucination rate, and improved ticket routing accuracy from 81% to 93% by replacing a legacy classifier with an instruction-tuned LLM (SFT + LoRA).
Junior ML Engineer
Yandex
Jul 2022 - Jun 2023 (11 months)
Fine-tuned RuBERT for multi-class support ticket classification (20+ categories) achieving 81% accuracy and reduced ticket routing time from 4 minutes to under 1 second. Curated labeled datasets via Yandex Toloka (annotation guidelines + QC), reaching inter-annotator agreement of 0.74 (Fleiss' kappa), and developed FastAPI-based inference services for production NLP models.
Intern ML Engineer
Yandex
Jan 2022 - Jun 2022 (5 months)
Worked on ETA prediction for Yandex Maps by performing feature engineering and offline experiments with CatBoost. Shipped weather-based features to production, reducing MAPE by 2.6pp on winter routes.
Education
Degrees, certifications, and relevant coursework
HSE University
Bachelor of Science, Applied Mathematics & Computer Science
BSc in Applied Mathematics & Computer Science at HSE University with coursework in machine learning, deep learning, and natural language processing.
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Social media
Job categories
Skills
Interested in hiring Leon?
You can contact Leon and 90k+ other talented remote workers on Himalayas.
Message LeonFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
