Himalayas logo
Unstructured TechnologiesUT

Software Engineer - Public Sector

Unstructured.io is a company that specializes in transforming unstructured data from various formats into LLM-ready data, enabling enterprises to leverage their internal data for AI applications.

Unstructured Technologies

Employee count: 51-200

United States only
Ready to shape the future of AI infrastructure and build systems that power the most advanced unstructured data pipelines in the world? At Unstructured, we’re building the backbone of generative AI—enabling federal agencies, defense organizations, and public sector partners to transform PDFs, HTML, Word docs, images, and more into high-performance data pipelines that scale securely and reliably.
Our tools already power mission-critical workloads for half of the Fortune 500, and our open-source package has been downloaded 36+ million times. Now, we’re entering our next chapter—bringing these same innovations to the US government—and we’re hiring a Software Engineer - Public Sector to help lead the charge.
If you’re energized by solving hard technical problems that truly matter—and you want to build solutions that empower agencies and support national missions—this is your moment.

Active SECRET clearance required

What You’ll Own & Drive:

  • Own the technical vision for some of the most complex, high-impact systems powering federal and public sector data pipelines.
  • Drive architectural strategies that prioritize scale, performance, and security compliance.
  • Write production-ready code and lead projects that directly support mission-critical workloads.
  • Solve deep technical challenges like data orchestration at scale, secure data flow optimization, and AI-first infrastructure for sensitive workloads.
  • Align engineering, product, and go-to-market teams to deliver solutions that meet public sector compliance and security needs.
  • Guide architectural decisions that balance innovation with risk mitigation.

What You Bring:

  • 5–9 years building software for US government or Department of Defense (DOD) networks.
  • Active SECRET clearance required; TS/SCI strongly preferred
  • Deep expertise in Python, distributed architectures, and AWS, Azure, or GCP.
  • Proven success leading mission-critical technical initiatives and mentoring engineering teams.
  • Passion for performance, reliability, and elegant, compliant solutions.
  • Unstructured values service and encourages veterans of the US military and civilian agencies to apply to this role.
  • Ability and willingness to travel up to 20%

Bonus Points:

  • Experience with AI/ML systems, unstructured data, or real-time pipelines.
  • Familiarity with FedRAMP, IL5, or other compliance frameworks.
  • Expertise with Kubernetes, IaC, or scaling SaaS infrastructure.
  • Startup DNA—you thrive in fast-moving, high-impact environments.

Why You’ll Love It Here

  • Real Impact: Your work will directly support national missions and critical agency workloads.
  • Big Challenges: You won’t be bored—our problems are meaty, meaningful, and novel.
  • Elite Team: Work with sharp, low-ego builders obsessed with quality and execution.
  • Fast Growth: Help shape foundational tech and a growing federal practice at a high-growth AI company.

About the job

Apply before

Posted on

Job type

Full Time

Experience level

Mid-level

Location requirements

Hiring timezones

United States +/- 0 hours

About Unstructured Technologies

Learn more about Unstructured Technologies and their company culture.

View company profile

Unstructured addresses a significant challenge many enterprises face: leveraging their vast amounts of unstructured data for use with large language models (LLMs) and other AI applications. Customers often struggle with data in various formats like PDFs, Word documents, PowerPoint presentations, HTML files, images, and more, which are not readily usable by machine learning models. This is where Unstructured steps in, providing solutions to automate the preprocessing of this messy, human-generated data. Our platform transforms raw data into clean, structured formats, making it compatible with LLMs for tasks such as fine-tuning, pre-training, and Retrieval Augmented Generation (RAG).

Our customers need to unlock the potential of their internal data to enhance productivity, drive innovation, and gain actionable intelligence. Unstructured offers open-source libraries and commercial API products designed to simplify and accelerate this data transformation process. We enable organizations to connect their enterprise data, regardless of file type or layout, to LLMs efficiently. This means data scientists and engineers no longer need to spend the majority of their time on the laborious task of data preprocessing, which traditionally involves building custom, brittle pipelines for each data type. By providing robust tools for data ingestion, partitioning, cleaning, and staging, Unstructured empowers businesses to build powerful AI applications based on their own specific, high-quality data, rather than relying solely on generic, pre-trained models. This allows for more accurate, relevant, and secure AI-driven insights and workflows.

Claim this profileUnstructured Technologies logoUT

Unstructured Technologies

View company profile

Similar remote jobs

Here are other jobs you might want to apply for.

View all remote jobs

7 remote jobs at Unstructured Technologies

Explore the variety of open remote roles at Unstructured Technologies, offering flexible work options across multiple disciplines and skill levels.

View all jobs at Unstructured Technologies

Remote companies like Unstructured Technologies

Find your next opportunity by exploring profiles of companies that are similar to Unstructured Technologies. Compare culture, benefits, and job openings on Himalayas.

View all companies

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan