HimalayasHimalayas logo
ZF
Open to opportunities

Zihao Feng

@zihaofeng1

Senior Machine Learning Engineer building large-scale generative retrieval and representation learning systems.

United States
Message

What I'm looking for

I want to build and scale generative retrieval and representation learning systems—especially safe, user-facing LLM pipelines and post-training alignment—while owning the distributed infrastructure that makes training and inference cheaper and reliable.

I’m a Senior Machine Learning Engineer at Snap building large-scale representation learning and generative retrieval systems. My work spans GNNs and Transformer-based modeling, along with the distributed infrastructure required to train and deploy models at scale. I’m increasingly focused on generative modeling and post-training techniques.

At Snap, I led social graph representation learning using GNNs and Transformer encoders, improving friending quality and large-scale retrieval performance. I explored generative retrieval via semantic tokenization (Semantic IDs) and next-token approaches, and I built production LLM pipelines for safe user recommendation with curated high-quality human-labeled datasets and optimized precision-recall tradeoffs. I also investigated preference optimization (GRPO-style) to align open-weight LLMs with policy objectives, and I rebuilt distributed graph training infrastructure to make training 3x cheaper and inference 20x cheaper—contributing to large-scale open-source GNN systems.

Previously at Pinterest, I helped build the company’s first neural retrieval models for ads, evolving from heuristic interest matching to GraphSAGE-based user representations and a backend two-tower deep retrieval architecture. I migrated major recommendation surfaces to NN-based retrieval, driving substantial improvements in engagement quality and monetization efficiency. I also worked on an EfficientNet-based solution to remove NSFW content from Pins and improved comment sentiment analysis by combining image content with text using OpenAI CLIP.

Experience

Work history, roles, and key accomplishments

SI
Current

Senior Machine Learning Engineer

Snap Inc.

Oct 2022 - Present (3 years 5 months)

Led social graph representation learning with GNNs and Transformer encoders, improving friending quality and large-scale retrieval performance. Built production LLM pipelines for safe user recommendation, curated high-quality labeled datasets, and rebuilt distributed graph training infrastructure to cut training cost by 3x and inference cost by 20x.

PI

Software Engineer, Machine Learning

Pinterest

Aug 2019 - Oct 2022 (3 years 2 months)

Built Pinterest’s first neural retrieval model for Shopping Ads, evolving from heuristic matching to GraphSAGE-based user representations and a two-tower deep retrieval architecture. Migrated major recommendation surfaces to NN-based retrieval and improved engagement quality and monetization efficiency.

Education

Degrees, certifications, and relevant coursework

UC Berkeley School of Information logoUI

UC Berkeley School of Information

Information

2017 - 2019

Studied at the UC Berkeley School of Information from 2017 to 2019.

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan