HimalayasHimalayas logo
SocotraSO

Staff ML Platform Engineer – Large Scale Training (LLMOps/MLOps)

Socotra is the first cloud-native core platform for insurance, enabling carriers to rapidly develop and deploy new products.

Socotra

Employee count: 51-200

United States only

Stay safe on Himalayas

Never send money to companies. Jobs on Himalayas will never require payment from applicants.

Build the Future of Scalable AI at TrueFoundry

At TrueFoundry , we’re redefining how ML teams train, deploy, and scale their models. Our LLMOps and MLOps platform empowers organizations to experiment faster, train large-scale models reliably, and deploy them seamlessly on Kubernetes—with the same muscle as Big Tech.

We're looking for ML Systems Engineers who are passionate about scaling deep learning workloads, optimizing multi-GPU training, and shipping production-grade solutions. If you live and breathe PyTorch, multi-node training, and love solving gnarly infra challenges—this is your place.

What You’ll Work On

  • Write clean, modular, and scalable Python code , with a strong emphasis on reliability and performance.
  • Build platform for training and finetuning large-scale ML models across multi-GPU, multi-node clusters with PyTorch, Kubeflow, and other orchestration tools.
  • Own the infrastructure and code that enables high-throughput, low-latency inference pipelines for state-of-the-art models.
  • Build platform for developing, deploying and evaluating agentic applications for our end customers.
  • Help shape internal standards and best practices across the engineering team for high-scale ML workloads.

What We’re Looking For

  • 5+ years of hands-on experience building and deploying ML systems at scale.
  • 5+ years of writing production quality high performance code.
  • Deep experience with multi-GPU/multi-node training , ideally with PyTorch as your primary framework.
  • Experience working with torch, high-level ML frameworks, and inference engines (vLLM or TensorRT).
  • Experience with Kubernetes is highly preferred; exposure to Kubernetes-native tools is a huge plus.
  • A pragmatic mindset—you know when to optimize and when to ship.
  • Bonus: Familiarity with open-source LLM training/fine-tuning.

Why Join TrueFoundry?

  • Work directly with ex-Facebook engineers and founders from IIT Kharagpur, UC Berkeley, and Y Combinator alumni .
  • First-hand exposure to building and scaling a deep-tech startup —insights you’ll carry if you want to start your own one day.
  • Be part of a fearlessly experimental culture focused on customer success and long-term impact.

Flexible hours, learning credits, and the opportunity to work shoulder-to-shoulder with the co-founders (Abhishek & Nikunj).

#J-18808-Ljbffr

About the job

Apply before

Posted on

Job type

Full Time

Experience level

Location requirements

Hiring timezones

United States +/- 0 hours

About Socotra

Learn more about Socotra and their company culture.

View company profile

We founded Socotra with a singular mission: to provide the insurance industry with a truly modern, cloud-native core platform that enables agility, speed, and innovation. We recognized that for too long, insurers were held back by rigid, legacy systems that made launching new products and adapting to market changes incredibly difficult and expensive. We believed that insurance—a vital industry that underpins the global economy—deserved better technology.

Our platform is built from the ground up using modern engineering principles. We offer the first truly cloud-native core system that supports the entire policy lifecycle, from underwriting and policy administration to billing and claims. By leveraging open APIs and a flexible data model, we empower insurers to integrate seamlessly with other best-in-class technologies and launch products in weeks, not years. We are passionate about transparency and quality, which is why our documentation is publicly available and our platform is designed to be intuitive for developers and business users alike. We are committed to helping insurers of all sizes, from innovative startups to global carriers, modernize their operations and deliver superior experiences to their customers.

Employee benefits

Learn about the employee benefits and perks provided at Socotra.

View benefits

Parental Leave

Paid maternity and paternity leave.

Life Insurance

Life insurance coverage for employees.

Company Events

Regular company social outings and events.

Equity

Competitive salary and meaningful equity packages.

View Socotra's employee benefits
Claim this profileSocotra logoSO

Socotra

View company profile

Similar remote jobs

Here are other jobs you might want to apply for.

View all remote jobs

Remote companies like Socotra

Find your next opportunity by exploring profiles of companies that are similar to Socotra. Compare culture, benefits, and job openings on Himalayas.

View all companies

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan