We’re seeking a highly skilled, execution-focused Data Scientist with 4–10 years of experience to join our team. This role demands hands-on expertise in fine-tuning and deploying generative AI models across image, video, and audio domains — with a special focus on lip-sync, character consistency, and automated quality evaluation frameworks. You will be expected to run rapid experiments, test architectural variations, and deliver working model iterations quickly in a high-velocity RD environment.
Responsibilities
- Run end-to-end fine-tuning experiments on state-of-the-art models (Flux family, LoRA, diffusion-based architectures, context-based composition).
- Develop and optimize generative AI models for audio generation and lip-sync, ensuring high fidelity and natural delivery.
- Extend current language models to support regional Indian languages beyond US/UK English for audio and content generation.
- Enable emotional delivery in generated audio (shouting, crying, whispering) to enhance realism.
- Integrate and synchronize background scores seamlessly with generated video content.
- Work towards achieving video quality standards comparable to Veo3/Sora.
- Ensure consistency in scenes and character generation across multiple outputs.
- Design and implement an automated objective evaluation frameworks to replace subjective human review — for cover images, video frames, and audio clips. Implement scoring systems that standardize quality checks before editorial approval.
- Run comparative tests across multiple model architectures to evaluate trade-offs in quality, speed, and efficiency.
- Drive initiatives independently, showcasing high agency and accountability. Utilize strong first-principle thinking to tackle complex challenges.
- Apply a research-first approach with rapid experimentation in the fast-evolving Generative AI space.
Requirements
- 4-10 years of experience in Data Science, with a strong focus on Generative AI.
- Familiarity with state-of-the-art models in generative AI (e.g., Flux, diffusion models, GANs).
- Proven expertise in developing and deploying models for audio and video generation.
- Demonstrated experience with natural language processing (NLP), especially for regional language adaptation.
- Experience with model fine-tuning and optimization techniques.
- Hands-on exposure to ML deployment pipelines (FastAPI or equivalent).
- Strong programming skills in Python and relevant deep learning frameworks (e.g., TensorFlow, PyTorch).
- Experience in designing and implementing automated evaluation metrics for generative content.
- A portfolio or demonstrable experience in projects related to content generation, lip-sync, or emotional AI is a plus.
- Exceptional problem-solving skills and a proactive approach to research and experimentation.
Benefits
What you get
- Best in class salary: We hire only the best, and we pay accordingly.
- Proximity Talks: Meet other designers, engineers, and product geeks — and learn from experts in the field.
- Keep on learning with a world-class team: Work with the best in the field, challenge yourself constantly, and learn something new every day.
About us
Proximity is the trusted technology, design, and consulting partner for some of the biggest Sports, Media, and Entertainment companies in the world! We’re headquartered in San Francisco and have offices in Palo Alto, Dubai, Mumbai, and Bangalore. Since 2019, Proximity has created and grown high-impact, scalable products used by 370 million daily users, with a total net worth of $45.7 billion among our client companies.
Today, we are a global team of coders, designers, product managers, geeks, and experts. We solve complex problems and build cutting-edge tech, at scale. Our team of Proxonauts is growing quickly, which means your impact on the company’s success will be huge. You’ll have the chance to work with experienced leaders who have built and led multiple tech, product, and design teams. Here’s a quick guide to getting to know us better:
Here’s a quick glimpse of Proximity and what it’s like to be a Proxonaut:
- Visit this YouTube link to listen to what our CEO, Hardik Jagda, has to say about Proximity.
- Meet some of our Proxonauts here: Know thy Proxonauts better
- Here are some quick links to the Careers page, Blog, and Studio Proximity (our design wing).
Follow our team's #BTS (behind-the-scenes) updates on our Instagram channels —
- @ProxWrks - @H.Jagda