Himalayas logo
PandaDocPA

Machine Learning Engineer - Document Intelligence & Applied GenAI

PandaDoc is an all-in-one document automation software that streamlines the process of creating, approving, and eSigning proposals, quotes, and contracts.

PandaDoc

Employee count: 501-1000

Stay safe on Himalayas

Never send money to companies. Jobs on Himalayas will never require payment from applicants.

The landscape of AI is evolving rapidly, and PandaDoc is investing heavily in machine learning to power the next generation of intelligent document workflows. Our goal is to build scalable, production-grade AI systems that automate document understanding, extract structured data at scale, and enable new AI-first product experiences for tens of thousands of businesses.

As an ML Engineer focused on Document Intelligence and GenAI, you will design, train, evaluate, and optimize models that transform unstructured documents into high-quality structured data. You’ll work across the full stack of model development—datasets, training, inference, deployment pipelines—and help bring cutting-edge research into real production systems at scale.

What makes this role unique?

  • Document Intelligence at Scale: Your work will directly power PandaDoc’s core AI capabilities—from layout detection and OCR to structured extraction, retrieval, and document-based reasoning.
  • High Ownership, High Impact: You will design end-to-end ML systems, influence roadmap decisions, and work closely with product, engineering, and design to define requirements and ship production AI features.
  • Real-World ML Challenges: You’ll tackle model robustness, evaluation, latency, observability, RAG quality, model routing, and the complexities of deploying AI systems that must perform reliably on millions of documents.
  • Deep GenAI Integration: You’ll experiment with frontier and open-source models, integrate vision–language systems, and build efficient pipelines for inference, guardrails, fine-tuning, and document-aware reasoning.

In this role, you will:

  • Model Development Evaluation
    • Build and maintain evaluation frameworks for document models, LLMs, OCR, and structured extraction.
    • Define metrics, benchmarks, and validation strategies for real-world document workloads.
  • Dataset Pipeline Creation
    • Design and curate high-quality datasets for supervised training, fine-tuning, and validation.
    • Create scalable preprocessing pipelines for PDFs, scans, images, forms, and semi-structured documents.
  • Model Training Fine-Tuning
    • Train and fine-tune transformer-based OCR, VLMs, layout models, and open-source LLMs for document understanding tasks.
    • Optimize models for reliability, accuracy, and cost efficiency in production environments.
  • Inference Deployment
    • Deploy ML models with modern inference runtimes (vLLM, TGI, TensorRT, ONNX Runtime).
    • Build guardrails, monitoring, and fallback mechanisms to ensure safe and predictable model behavior.
  • RAG Document Reasoning
    • Develop retrieval and chunking strategies tailored to document structures (tables, forms, multi-page PDFs).
    • Optimize end-to-end RAG pipelines for semantic search, QA, and workflow automation.
  • Cross-Functional Collaboration
    • Partner with PMs, backend engineers, and product designers to define AI opportunities and translate requirements into technical solutions.

About you:

We are expanding our AI/ML function with an ML Engineer who specializes in document intelligence, vision–language models, and LLM-based extraction and reasoning. You should be comfortable with both traditional document AI approaches and cutting-edge GenAI workflows. You thrive in fast-moving environments, are self-directed, and enjoy solving practical ML problems that directly impact customers.

We’re looking for someone with experience in:

  • Vision transformers, layout models, and OCR systems
  • Structured extraction from complex documents
  • RAG for document-heavy workloads
  • Optimizing LLM pipelines for cost, accuracy, and throughput
  • Deploying and benchmarking models in real production systems

Required Experience

  • 5+ years of Python experience
  • Experience training, fine-tuning, and deploying traditional computer vision models for document intelligence tasks (layout detection, table extraction, OCR, information extraction)
  • Hands-on experience with document understanding frameworks and models:
    • Traditional document AI models (LayoutLM, Donut, DocFormer)
    • Modern vision-language models with OCR capabilities (DeepSeek-OCR, LightOnOCR-1B, etc.)
    • Experience deploying and optimizing models using inference frameworks such as vLLM (preferred), TGI, TensorRT, or ONNX Runtime
    • Experience applying LLMs to document intelligence workflows, including both frontier models and open-source alternatives
    • Strong understanding of coordinate systems and spatial reasoning for absolute positioning and field detection in forms/documents

It would be awesome if you had:

  • Familiarity with PDF parsing libraries and document preprocessing pipelines
  • Experience fine-tuning open-source models for domain-specific document tasks
  • Knowledge of evaluation metrics for document understanding tasks (F1, exact match, etc.)

Company Overview:

PandaDoc empowers more than 67,000 growing organizations to thrive by taking the work out of document workflow. PandaDoc provides an all-in-one document workflow automation platform that helps fast scaling teams accelerate the ability to create, manage, and sign digital documents including proposals, quotes, contracts, and more. For more information, please visit https://www.pandadoc.com.

Company Culture:

We're known for our work-life balance, kind co-workers, creative virtual team-bonding events. And although our Pandas are located across the globe, we stay connected with the help of technology and ensure that everyone on our team feels, well, like a team.

Pandas work best when they're happy. We retain our talent by upholding our values of integrity transparency, and selling a product that changes the lives of our customers.

Check out our LinkedIn to learn more.

Benefits:

  • An honest, open culture that emphasizes feedback and promotes professional and personal development
  • An opportunity to work from anywhere — our team is distributed worldwide, from Lisbon to Manila, from Florida to California
  • 6 self care days
  • A competitive salary
  • And much more!

PandaDoc is an Equal Opportunity Employer. We are committed to equal treatment of all employees without regard to race, national origin, religion, gender, age, sexual orientation, veteran status, physical or mental disability or other basis protected by law.

EXTERNAL RECRUITERS

Approval Requirement

The use of external recruiters/staffing agencies requires prior approval from our HR Team. The HR Team at PandaDoc requests that external recruiters/staffing agencies not to contact PandaDoc employees directly in an attempt to present candidates. Complying with this request will be a factor in determining future professional relationships with PandaDoc.

About the job

Apply before

Posted on

Job type

Full Time

Experience level

Mid-level
Senior

Location requirements

Open to candidates from all countries.

Hiring timezones

Worldwide

About PandaDoc

Learn more about PandaDoc and their company culture.

View company profile

At the heart of PandaDoc lies a vibrant, remote-first culture built on a foundation of empowering teams and simplifying the complexities of modern work. The company's journey began with a simple yet powerful goal: to make work easier. This initial vision, born from the founders' desire to escape inefficient processes, has blossomed into a global platform trusted by tens of thousands of businesses. The core mission is to deliver transformative technology that streamlines how companies of all sizes operate and achieve success. PandaDoc is committed to helping its customers build trust with every interaction by making business agreements faster, more transparent, and less of a hassle. This allows teams to shift their focus from cumbersome paperwork to the strategic initiatives that truly drive them forward.

The company's culture is defined by a set of core values encapsulated in the acronym L.I.F.E.: Learn, Impact, Fun, and Empathy. These aren't just words on a wall; they are the principles that guide how the team shows up, collaborates, and grows together. 'Learn' fosters a continuous growth mindset, encouraging curiosity and personal development. 'Impact' drives a passion for building a product that makes a real difference for customers and the community. 'Fun' is about enjoying the journey and celebrating successes together, creating a positive and engaging work environment. Finally, 'Empathy' is central to how they interact with colleagues and customers, ensuring a supportive and understanding atmosphere. This people-first approach is evident in their commitment to a diverse, international team that spans dozens of countries and cultures. PandaDoc believes that this diversity is a superpower, leading to stronger products, more innovative solutions, and a richer, more inclusive workplace where every 'Panda' feels valued and can thrive.

Employee benefits

Learn about the employee benefits and perks provided at PandaDoc.

View benefits

Commuter benefits

PandaDoc offers commuter benefits.

401K Plan

PandaDoc offers a 401K plan to its employees.

Company Equity

PandaDoc offers company equity to its employees.

Generous PTO

20 days PTO for years 1-3, 25 days after year 3.

View PandaDoc's employee benefits
Claim this profilePandaDoc logoPA

PandaDoc

View company profile

Similar remote jobs

Here are other jobs you might want to apply for.

View all remote jobs

50 remote jobs at PandaDoc

Explore the variety of open remote roles at PandaDoc, offering flexible work options across multiple disciplines and skill levels.

View all jobs at PandaDoc

Remote companies like PandaDoc

Find your next opportunity by exploring profiles of companies that are similar to PandaDoc. Compare culture, benefits, and job openings on Himalayas.

View all companies

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan
PandaDoc hiring Machine Learning Engineer - Document Intelligence & Applied GenAI • Remote (Work from Home) | Himalayas