Monolith is seeking a Research Engineer to design, develop, and optimise multimodal generative models for text, images, and audio. The role involves working with domain experts, AI researchers, and engineers to translate academic breakthroughs into production systems. The team fosters a collaborative and inclusive environment supporting growth and innovation in AI-driven content and data generation.
Requirements
- Experience building and deploying machine learning models for multimodal data
- Deep understanding of generative AI (diffusion models, GANs, VAEs, vision-language models, large language models)
- Proficient programming skills in Python and ML frameworks
- Excellent analytical and problem-solving skills
- Collaborative mindset
Benefits
- Competitive compensation
- Comprehensive benefits package
- Continuous learning and professional development
- Remote-first working model
- Flexible work arrangements