Aleph AlphaAA

AI Inference Engineer - Large Language Models (f/m/d)

Aleph Alpha GmbH specializes in advanced generative AI technology to empower enterprises and public organizations, focusing on data sovereignty and ethical AI applications.

Aleph Alpha

Employee count: 51-200

Germany only

Overview:

You will join our product team in a position that sits at the intersection of artificial intelligence research and real-world solutions. We foster a highly collaborative work culture where you can expect to work closely with your teammates and have a high level of communication between teams through methodologies such as pair or mob programming.

Your responsibilities:

  • Model Inference: Focus on inference optimization to ensure rapid response times and efficient resource utilization during real-time model interactions.

  • Hardware Optimization: Run models on various hardware platforms, from high-performance GPUs to edge devices, ensuring optimal compatibility and performance.

  • Experimentation and Testing: Regularly run experiments, analyze outcomes, and refine the strategies to achieve peak performance in varying deployment scenarios.

  • Staying up to date with the current literature on MLSys

Your profile:

  • You care about making something people want. You want to ship something that will bring value to our users. You want to deliver AI solutions end-to-end and not finish building a prototype.

  • Bachelor's degree or higher in computer science or a related field.

  • You understand how multimodal transformers work.

  • You understand the characteristics of LLM inference (KV caching, flash attention, and model parallelization).

  • You have hands-on experience with large language models or other complex AI architectures.

  • You have experience in system design and optimization, particularly within AI or deep learning contexts.

  • You are proficient in Python and have deep understanding of deep learning frameworks such as PyTorch.

  • A deep understanding of the challenges associated with scaling AI models for large user bases.

Nice if you have:

  • Previous experience in a high-growth tech environment or a role focused on scaling AI solutions.

  • Expertise with CUDA and Triton programming and GPU optimization for neural network inference.

  • Experience with Rust.

  • Experience in adapting AI models to suit a range of hardware, including different accelerators.

  • Experience in model quantization, pruning, and other neural network optimization methodologies.

  • A track record of contributions to open-source projects (please provide links).

  • Some Twitter presence discussing ML Sys topics.

What you can expect from us:

  • Become part of an AI revolution!

  • 30 days of paid vacation

  • Access to a variety of fitness & wellness offerings via Wellhub

  • Mental health support through nilo.health

  • Substantially subsidized company pension plan for your future security

  • Subsidized Germany-wide transportation ticket

  • Budget for additional technical equipment

  • Flexible working hours for better work-life balance and hybrid working model

  • Virtual Stock Option Plan

  • JobRad® Bike Lease

About the job

Apply before

Posted on

Job type

Full Time

Experience level

Mid-level

Location requirements

Hiring timezones

Germany +/- 0 hours

About Aleph Alpha

Learn more about Aleph Alpha and their company culture.

View company profile

Aleph Alpha GmbH is a pioneering AI company located in Heidelberg, Germany. Our mission centers around providing sovereign generative AI technology designed to boost businesses and governments, ensuring a competitive edge in the rapidly evolving AI economy. We focus on the integration of cutting-edge AI models into practical applications that are secure, efficient, and aligned with ethical standards.

The platform we offer, known as PhariaAI, is a robust AI application development tool tailored to address the complex needs of enterprises while ensuring data reliability and compliance with ever-changing legal requirements. At Aleph Alpha, we believe in not just harnessing AI, but enabling organizations to thrive through the seamless collaboration between human expertise and advanced AI insights. Our commitment to ethical AI development ensures that every solution we provide is geared towards fostering trust, transparency, and sustainability, shaping a digital landscape that remains under the control of European values and principles.

Claim this profileAleph Alpha logoAA

Aleph Alpha

View company profile

Similar remote jobs

Here are other jobs you might want to apply for.

View all remote jobs

21 remote jobs at Aleph Alpha

Explore the variety of open remote roles at Aleph Alpha, offering flexible work options across multiple disciplines and skill levels.

View all jobs at Aleph Alpha

Remote companies like Aleph Alpha

Find your next opportunity by exploring profiles of companies that are similar to Aleph Alpha. Compare culture, benefits, and job openings on Himalayas.

View all companies

Find your dream job

Sign up now and join over 85,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan