At Netflix, our mission is to entertain the world. Together, we are writing the next episode - pushing the boundaries of storytelling, global fandom and making the unimaginable a reality. We are a dream team obsessed with the uncomfortable excitement of discovering what happens when you merge creativity, intuition and cutting-edge technology. Come be a part of what’s next.
Requirements
- Experience building high-traffic distributed services and infrastructure for online ML model inference
- Understanding scalable model-serving solutions for generative models and LLMs
- Proficiency in object-oriented programming (preferably Java)
- Experience deploying ML models using tools like Triton Inference Server, TensorRT, Docker
- Experience working with the public cloud like AWS, Azure, or GCP
- A BS/MS in Computer Science, Applied Math, Engineering, or a related field
Benefits
- Health Plans
- Mental Health support
- 401(k) Retirement Plan with employer match
- Stock Option Program
- Disability Programs
- Health Savings and Flexible Spending Accounts
- Family-forming benefits
- Life and Serious Injury Benefits
- Paid leave of absence programs
