AllCloudAL

LLM Architect

AllCloud specializes in tailored cloud solutions, optimizing AWS and Salesforce capabilities for organizations leveraging data and AI.

AllCloud

Employee count: 201-500

United States only

Description

LLM Architect

Location: US / Canada (Eastern Time) - Home based

Job Type: Full-time, Permanent

About AllCloud

AllCloud is a global professional services company providing organizations with cloud enablement and transformation tools. As an AWS Premier Consulting Partner and audited MSP, a Salesforce Platinum Partner, and a Snowflake Premier Partner, AllCloud helps clients connect their front and back offices by building a new operating model to harness the benefits of cloud technology and data and analytics.

Job Summary

We are looking for an innovative LLM Architect to lead the design and development of custom language models at AllCloud. This role will be responsible for architecting, training, and optimizing large language models based on modified transformer architectures. The ideal candidate will have deep expertise in NLP, transformer model design, and efficient training methodologies. You'll work alongside GPU Engineers and ML Engineers to create state-of-the-art language models that meet our customers' specific requirements, pushing the boundaries of what's possible with generative AI.

Responsibilities

  • Design custom transformer-based language model architectures tailored to specific use cases
  • Develop and implement modifications to transformer architectures to enhance performance, efficiency, or capabilities
  • Create and execute model pre-training, fine-tuning, and evaluation strategies
  • Implement techniques like quantization, pruning, and knowledge distillation to optimize model size and performance
  • Design and implement training data pipelines, including data selection, cleaning, and augmentation
  • Establish rigorous evaluation frameworks to assess model performance, fairness, and safety
  • Research and implement state-of-the-art techniques in LLM development
  • Create detailed documentation on model architectures, training methodologies, and performance characteristics
  • Collaborate with GPU Engineers to implement efficient training strategies across distributed systems
  • Work with customers to understand their unique requirements and translate them into model design decisions

Requirements

Summary of Key Requirements

  • 4+ years of experience in deep learning research or development with a focus on NLP and transformer models
  • Strong understanding of transformer architecture and its variants (GPT, BERT, T5, etc.)
  • Experience designing and training large language models from scratch
  • Expertise in PyTorch or TensorFlow for implementing custom model architectures
  • Knowledge of distributed training approaches for large models (DeepSpeed, Megatron, etc.)
  • Experience with model compression techniques (quantization, pruning, knowledge distillation)
  • Strong background in mathematics, particularly linear algebra, differential equations, probability, and statistics
  • Familiarity with current research in LLM development, including attention mechanisms, mixture of experts, and efficient training methods
  • Master's or PhD in Computer Science, Machine Learning, or related field
  • Publication record in NLP, LLMs, or transformer architecture (strongly preferred)

Certifications

  • AWS Machine Learning Specialty (Strongly Preferred)
  • NVIDIA-Certified Associate - Generative AI Multimodal (Preferred)

Why work for us?

Our team inspires progress in each other and in our customers through our relentless pursuit of excellence; you will work with leaders who promote learning and personal development.

AllCloud is an Equal Opportunity Employer and considers applicants for employment without regard to race, color, religion, sex, orientation, national origin, age, disability, genetics or any other basis forbidden under federal, provincial, or local law.

About the job

Apply before

Posted on

Job type

Full Time

Experience level

Senior

Location requirements

Hiring timezones

United States +/- 0 hours

About AllCloud

Learn more about AllCloud and their company culture.

View company profile

AllCloud is a leading global professional services company focused on providing organizations with tools for cloud enablement and transformation. By combining expertise with agility, AllCloud accelerates cloud innovation and helps maximize the value gained from cloud technology. Their specific focus on tailored solutions integrates AWS and Salesforce capabilities with data and analytics, enabling businesses to harness the full potential of the cloud.

The company prides itself on a goal-oriented approach that starts with an in-depth exploration of client objectives. AllCloud works closely with clients to craft customized roadmaps that ensure their cloud architecture evolves with their growing needs. This strategic partnership not only facilitates immediate results but also prepares organizations for future challenges. Recognized as an AWS Premier Consulting Partner and a Salesforce Platinum Partner, AllCloud has solidified its reputation by delivering successful cloud deployments for businesses of all sizes across various industries.

Claim this profileAllCloud logoAL

AllCloud

View company profile

Similar remote jobs

Here are other jobs you might want to apply for.

View all remote jobs

6 remote jobs at AllCloud

Explore the variety of open remote roles at AllCloud, offering flexible work options across multiple disciplines and skill levels.

View all jobs at AllCloud

Remote companies like AllCloud

Find your next opportunity by exploring profiles of companies that are similar to AllCloud. Compare culture, benefits, and job openings on Himalayas.

View all companies

Find your dream job

Sign up now and join over 85,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan
AllCloud hiring LLM Architect • Remote (Work from Home) | Himalayas