We are seeking a Data Scientist to join our Data Operations team. The ideal candidate will have expertise in Generative AI, Machine Learning, and NLP, with a focus on customizing and optimizing existing RAG pipelines. This is a remote work opportunity, and we value diversity and inclusivity.
Requirements
- Collect data, perform data analysis, develop models, define quality metrics, and conduct quality assessments of models, along with regular presentations to stakeholders.
- Create production-ready Python packages for each component of data science pipelines and coordinate their deployment with the technology team.
- Design, develop, and deploy Generative AI models and solutions that meet specific business needs.
- Expertise in Retrieval Augmented Generation (RAG) optimization and customization of existing RAG pipelines to meet specific project needs.
- Proficiency in large-scale data ingestion, preprocessing, and transformation of multilingual content to ensure high-quality inputs for downstream models.
- Experience building Agentic RAG systems is strong requirements.
- Experience in LangChain, AutoGen, Haystack, MCP or similar AI agent management tools.
- Fine-tune large language models (LLMs) and transformer models to enhance accuracy and relevance.
- Implement guardrails and evaluation mechanisms to ensure responsible and ethical AI usage.
- Conduct rigorous testing and evaluation of AI models to ensure high performance and reliability.
- Integrate data science components and ensure end-to-end quality assessment.
- Maintain the robustness of data science pipelines against model drift and ensure consistent output quality.
- Establish a reporting process for pipeline performance and develop automatic re-training strategies for existing pipelines.
- Work collaboratively with cross-functional teams to integrate AI solutions into existing products and services.
- Mentor junior data scientists and contribute to the knowledge-sharing culture within the team.
- Stay up-to-date with the latest advancements in AI, machine learning, and NLP technologies.
Benefits
- Health insurance options for you and your family
- Group life and accident insurance for financial security
- Employee assistance programs and mental health resources
- Flexible working arrangements for work-life balance
- Paid time-off options, including sick leave, vacation, and public holidays
- Subsidized meals and free transportation in select locations
