Skip to main content
SA
Open to opportunities

Shahid Afridi

@shahidafridi2

AI Research Associate specializing in LLM and web-agent evaluation, prompt engineering, and robust safety testing.

India
Message

What I'm looking for

I’m looking to transition into an AI Engineer role by deepening Python fundamentals and building/validating LLM tooling. I want to work on production-ready AI agent training pipelines with strong evaluation, safety checks, and measurable quality benchmarks.

I’m an AI Research Associate with 1 year of hands-on experience contributing to production AI agent training pipelines for a major cloud provider’s web browser agent. My core work spans LLM agent evaluation, Chain-of-Thought trajectory annotation, and simulated environment testing, and I was promoted to Technical Research Associate within 3 months.

I lead evaluation efforts by detecting hallucination, reward hacking, and prompt injection/leakage, and by validating Human-in-the-Loop checkpoints using internal evaluation tooling. I also define JSON schemas within prompts for structured outputs, supervised 2–3 Research Associates, and onboarded 3–4 joiners on evaluation protocols—actively building toward an AI Engineer role.

Experience

Work history, roles, and key accomplishments

Keywords Studios logoKS

Technical Research Associate

Oct 2025 - Jul 2026 (9 months)

Led web-agent trajectory data collection by authoring multi-step task prompts and capturing screenshot-based step-level Chain-of-Thought annotations to train GUI-based LLM agents. Contributed to production web browser agent evaluation across synthetic web gyms, performing multimodal checks for hallucinations, prompt leakage, reward hacking, and HITL checkpoints; supervised 2–3 associates and onboa

Keywords Studios logoKS

Research Associate

Jul 2025 - Sep 2025 (2 months)

Collected web-agent trajectory datasets by executing prompt-driven UI interactions on live websites and writing step-level Chain-of-Thought annotations for screenshot-based action sequencing. Evaluated model outputs for correctness, instruction adherence, reasoning quality, and safety compliance, delivering consistently high-accuracy results that led to promotion within 3 months.

Education

Degrees, certifications, and relevant coursework

REVA University logoRU

REVA University

Bachelor of Technology (B.Tech), Electrical and Computer Engineering

Grade: CGPA: 7.7/10

Completed a B.Tech in Electrical and Computer Engineering at REVA University in 2024. Achieved a CGPA of 7.7/10.

Tech stack

Software and tools used professionally

Get matched with your dream remote job

Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan