I am a Machine Learning Engineer focused on LLM orchestration and agentic AI, building production-grade systems that integrate models from OpenAI, Google, and Anthropic into scalable backend infrastructures.
I design agentic workflows, fine-tune large language models, and implement RAG and CAG systems; I have engineered SDK features, developed APIs for transcription and RAG-as-a-service, and automated evaluation pipelines using LLM-as-a-judge.
My background includes deep learning research published in IEEE, hands-on MLOps experience with multi-GPU training (DeepSpeed, FSDP, Accelerate), and deployment work across GCP, Azure, and AWS, improving operational efficiency and reducing manual processes.
I bring bilingual communication skills, multicultural adaptability, and a strong combination of academic research and practical engineering to help teams productionize advanced generative AI solutions.
