While technology is the heart of our business, a global and diverse culture is the heart of our success. We love our people and we take pride in catering them to a culture built on transparency, diversity, integrity, learning and growth.
If working in an environment that encourages you to innovate and excel, not just in professional but personal life, interests you- you would enjoy your career with Quantiphi!
About Quantiphi:
Quantiphi is an award-winning Applied AI and Big Data software and services company, driven by a deep desire to solve transformational problems at the heart of businesses. Our signature approach combines groundbreaking machine learning research with disciplined cloud and data-engineering practices to create breakthrough impact at unprecedented speed.
Company Highlights:
Quantiphi has seen 2.5x growth YoY since its inception in 2013, we don’t just innovate - we lead. Headquartered in Boston, with 4,000+ Quantiphi professionals across the globe. As an Elite/Premier Partner for Google Cloud, AWS, NVIDIA, Snowflake, and others, we’ve been recognized with:
- 17x Google Cloud Partner of the Year awards in the last 8 years.
- 3x AWS AI/ML award wins.
- 3x NVIDIA Partner of the Year titles.
- 2x Snowflake Partner of the Year awards.
- We have also garnered top analyst recognitions from Gartner, ISG, and Everest Group.
- We offer first-in-class industry solutions across Healthcare, Financial Services, Consumer Goods, Manufacturing, and more, powered by cutting-edge Generative AI and Agentic AI accelerators.
- We have been certified as a Great Place to Work for the third year in a row- 2021, 2022, 2023.
Be part of a trailblazing team that’s shaping the future of AI, ML, and cloud innovation. Your next big opportunity starts here!
Work Location:Bedminster, NJ or Dallas, TX
Responsibilities:
- Lead AI/ML program execution, ensuring timely delivery of scalable, production-grade RAG/LLM/Agentic solutions.
- Define program roadmaps through PI planning sessions, milestones, and deliverables for AI-driven initiatives across multiple teams.
- Manage LLM infrastructure, GPU optimization, AI inferencing pipelines, and large-scale model deployment strategies.
- Oversee the implementation of RAG, Agentic Workflows, multi-agent LLM systems, and Retrieval-augmented QA pipelines.
- Managing client engagement and delivery per terms of the contract expectations.
- Manage project delivery, team and ensure positive customer relations.
- Drive project margins optimization using Gen AI based tools, accelerators.
- Collaborate with our diverse and global teams to deliver committed results to our clients.
- Lead AI-driven engagements, ensuring alignment with business goals, technical feasibility, and governance frameworks.
- Develop and execute strategic roadmaps for LLM-based solutions, including RAG (Retrieval-Augmented Generation), Agentic RAG, and Agent-driven workflows.
- Manage cross-functional teams, including ML engineers, data scientists, software developers, and consultants to deliver AI solutions.
- Collaborate with stakeholders to define technical architecture, infrastructure requirements, and optimization techniques.
- Implement scalable AI agent architectures, ensuring integration with LangChain, NVIDIA NeMo, and Triton Inference Server.
- Track project performance, set KPIs, and provide executive-level reporting on outcomes and ROI.
- Guide AI model evaluation, MLOps pipeline integration, and fine-tuning strategies for scalable AI solutions.
- Support AI compliance strategies, ensuring alignment with data privacy, security, and responsible AI practices.
Skill Set Required:
- More than 8 years of program management experience.
- Strong leadership and multi-stakeholder management skills.
- Multi-Workstream Project Management ensuring customer success & account growth.
- Maintaining positive work environment & ensure career growth of the team members.
- Tight Delivery execution and reporting to senior management at client organization and at Quantiphi.
- Mentoring team members for career progression & upskilling to drive better solution outcomes.
- Team leading experience and ability & experience to work as project lead.
- Excellent Communication, presentation & storytelling skills.
- Must have experience with Cloud GCP or AWS or Azure (LLM hosting, GPU-based inference, cost optimization).
- Experience managing large-scale AI projects leveraging LLMs (e.g., Llama, GPT, Claude, Mistral).
- Strong expertise in RAG, Agentic RAG, AI Agents, Vector DBs (e.g., FAISS, Pinecone, Weaviate, ChromaDB).
- Knowledge of LLM-based fine-tuning techniques, Low-Rank Adaptation (LoRA), Quantization (AWQ, GPTQ, FP8, INT4).
- Familiarity with Multi-GPU parallelization, model pruning, and knowledge distillation.
- Understanding of Governance frameworks (e.g., AI Ethics, Explainability, Risk Mitigation).
- Proficiency in NVIDIA NeMo, Triton Inference Server, and LangChain for agentic workflows.
What is in it for you:
- Be part of a team and company that has won NVIDIA's AI Services Partner of the Year three times in a row with an unparalleled track record of building production AI applications on DGX and Cloud GPUs.
- Strong peer learning which will accelerate your learning curve across Applied AI, GPU Computing and other softer aspects such as technical communication.
- Exposure to working with highly experienced AI leaders at Fortune 500 companies and innovative market disruptors looking to transform their business with Generative AI.
- Access to state-of-the-art GPU infrastructure on the cloud and on-premise.
- Be part of the fastest-growing AI-first digital transformation and engineering company in the world
If you like wild growth and working with happy, enthusiastic over-achievers, you'll enjoy your career with us!