Open to opportunities

Ayan Sau

@ayansau

Message

Generative AI Data Analyst specializing in LLM evaluation, RLHF, and multimodal alignment.

India

Message

What I'm looking for

I’m looking for a role focused on LLM/GenAI evaluation, RLHF, and multimodal + multilingual alignment, where I can run QA/QC, response ranking, and safety checks, collaborate with global teams, and improve model quality through high-precision training data.

I’m a Generative AI Data Analyst with 3+ years of experience driving LLM evaluation and training quality through LLM evaluation, RLHF, and human preference evaluation. I work across text, image, audio, and video to assess accuracy, safety, reasoning, instruction-following, and user preference—turning findings into actionable improvements.

I’ve led QA/QC audits for annotations and model evaluations, performing response ranking and re-evaluation to maintain consistency and inter-annotator agreement. I actively identify hallucinations, bias, misinformation, and policy violations, providing structured feedback that supports model alignment and response reliability.

I also bring strong multilingual localization experience, conducting evaluations across English, Bengali, Hindi, and more to improve cultural relevance and alignment. Alongside GenAI work, I’ve built a technical support foundation and data labeling expertise—supporting high-quality training data for large-scale AI systems while meeting accuracy and productivity targets.

Experience

Work history, roles, and key accomplishments

Current

Generative AI Data Analyst

Current

Welocalize

Jun 2026 - Present (1 month)

Evaluated AI-generated content across text, image, audio, and video for accuracy, safety, and overall quality. Compared and ranked model responses, conducted multilingual localization evaluations, and performed QC/QA audits including hallucination and policy-violation checks.

LLM Evaluation Response Ranking Prompt Evaluation AI Safety Evaluation Hallucination Detection Bias Misinformation Policy Checks Localization QA

Current

AI Trainer - Bengali Language

Current

Welo Data

Apr 2026 - Present (3 months)

Evaluated AI-generated Bengali responses for accuracy, grammar, and cultural relevance, and provided rewritten improvements. Created Bengali prompts and responses to improve conversational datasets while collaborating with a global team on localization quality.

Grammar And Cultural Relevance Checks Prompt Engineering Response Rewriting Localization

Current

RLHF Generalist

Current

Mercor

Mar 2026 - Present (4 months)

Performed RLHF tasks to improve large language model reasoning, safety, and instruction adherence. Evaluated and ranked responses, wrote prompts across domains, identified model failure patterns, and provided structured feedback for alignment improvement.

Response Ranking Instruction Following Evaluation Reasoning Quality Assessment Hallucination Detection

Current

Generative AI Specialist

Current

Alignerr

Sep 2025 - Present (10 months)

Evaluated and compared AI-generated responses across domains for accuracy, reasoning, factuality, and instruction following. Performed response ranking and preference evaluation, conducted multimodal/text/audio evaluation, ran error analysis (hallucinations and safety issues), and completed QA/QC reviews to support multilingual model alignment.

Response Ranking Preference Models Quality Assessment Hallucination Detection Multimodal Analysis Text Evaluation QA

Current

Data Labelling Expert

Current

RWS Group

Jan 2024 - Present (2 years 6 months)

Conducted high-precision data annotation for machine learning applications including sentiment analysis, NER, content moderation, and text classification. Reviewed datasets for consistency, balance, and linguistic accuracy, supported QA/inter-annotator agreement processes, and delivered expert labeling for large-scale client projects.

Data Annotation Sentiment Analysis Content Moderation Inter Annotator Agreement

Team Lead & LLM Trainer

Turing

Sep 2025 - Feb 2026 (5 months)

Trained and fine-tuned large language models using RLHF. Created and improved prompts, evaluated outputs for accuracy/safety/instruction following, and performed annotations and quality checks using tools including Python, OpenAI API, and Hugging Face.

Prompt Engineering LLM Fine tuning Instruction Following Evaluation Data Annotation Python OpenAI API Hugging Face

Technical Support Executive

Gigmo Solutions

Feb 2024 - May 2026 (2 years 3 months)

Provided B2B and B2C technical support for Autodesk customers via chat and email, troubleshooting connectivity, serial activation, licensing errors, account verification, and installation issues. Supported Autodesk products (e.g., AutoCAD, Revit, Fusion 360, Navisworks, Inventor, 3ds Max, Maya, Civil 3D, BIM 360) and assisted with licensing management, multi-user network deployments, and complianc

Live Chat Technical Support Troubleshooting Connectivity Issues Verification Installation License Management Autodesk Product Support

Operations Executive - Support Coord

Startek

Jun 2023 - Feb 2024 (8 months)

Supported Zomato backend operations by resolving order-related and system-level discrepancies and acting as a key liaison between customer-facing teams and technical backend teams. Used Excel and Google Suite to maintain daily reporting for order tracking and escalation patterns and supported chatbot development through video/image annotation.

Back End Discrepancy Handling Excel reporting Video Annotation Image Annotation Chatbot Google Workspace

Operation and Audit Intern

PlanetSpark

Jan 2023 - May 2023 (4 months)

Evaluated over 10 demo sessions daily against predefined quality parameters and provided feedback to leadership on teacher skills and methodologies. Resolved teacher inquiries via phone and email and handled technical/platform-related concerns while collaborating with trainers and coaches.

Teamwork Phone Support Email Support QA Against Predefined Parameters

Customer Service Associate

Teleperformance

Jul 2022 - Jan 2023 (6 months)

Provided end-to-end technical support to Flipkart customers for electronics such as smartphones, laptops, desktops, smartwatches, IoT devices, and related products. Handled voice and chat inquiries, resolved hardware/software setup and usage issues, escalated complex cases to authorized service centers, and documented resolutions in internal CRM tools.

Technical Customer Support Troubleshooting Customer Escalation