Himalayas logo
TetraScienceTE

Lead Platform Engineer - Search Platform

TetraScience is replatforming and reengineering the world’s scientific data in the cloud to solve humanity’s grand challenges.

TetraScience

Employee count: 51-200

United States only

Stay safe on Himalayas

Never send money to companies. Jobs on Himalayas will never require payment from applicants.

Who We Are

TetraScience is the Scientific Data and AI company. We are catalyzing the Scientific AI revolution by designing and industrializing AI-native scientific data sets, which we bring to life in a growing suite of next gen lab data management solutions, scientific use cases, and AI-enabled outcomes.

TetraScience is the category leader in this vital new market, generating more revenue than all other companies in the aggregate. In the last year alone, the world’s dominant players in compute, cloud, data, and AI infrastructure have converged on TetraScience as the de facto standard, entering into co-innovation and go-to-market partnerships: Latest News and Announcements | TetraScience Newsroom:

In connection with your candidacy, you will be asked to carefully review the Tetra Way letter, authored directly by Patrick Grady, our co-founder and CEO. This letter is designed to assist you in better understanding whether TetraScience is the right fit for you from a values and ethos perspective.

It is impossible to overstate the importance of this document and you are encouraged to take it literally and reflect on whether you are aligned with our unique approach to company and team building. If you join us, you will be expected to embody its contents each day.

The Role

We are seeking a Lead Software Engineer to help expand our scientific search platform beyond traditional keyword search and unlock new capabilities in chemical search, semantic search, and natural language search. In this role, you will work at the intersection of AI/ML, cheminformatics, knowledge representation, and distributed systems, enabling scientists to retrieve and reason over complex experimental datasets, chemical entities, assay results, and unstructured lab documents.

As a technical leader of the Search Platform team, you will guide, design and implement new search capabilities while also owning the underlying infrastructure that powers them. Lead by example for building the systems, mentor engineers, and help shape the roadmap for search capabilities and platform evolution. This includes maintaining scalable, reliable services and continuously improving the platform as we expand our search offerings. You will collaborate closely with Applied AI Scientists, platform engineers, and product teams to deliver high performance search services that drive discovery, analysis, and decision making across the bio-pharma R&D lifecycle.

If you are passionate about building scalable search systems, advancing scientific retrieval, and supporting production scale AI workloads, we’d love to talk to you.

What You will Do

  • Lead by example to architect and code the next-generation scientific search engine, building a system that can reason over billions of scientific data points—from chemical structures (SMILES) to unstructured lab documents and instrument data.
  • Engineer sophisticated hybrid search pipelines that blend sparse (keyword), structured (metadata), and dense (vector) retrieval. You will go beyond out-of-the-box OpenSearch to design custom ranking logic, reciprocal rank fusion, and relevance tuning that surfaces the exact "needle in the haystack" for drug discovery.
  • Own and operate the Search Platform infrastructure, ensuring high availability, scalability, performance, and observability across indexing, embedding generation, and query execution.
  • Develop and maintain backend services and APIs in Python and TypeScript that power search capabilities for scientists, data engineers, and AI applications.
  • Collaborate with Applied AI Scientists to integrate embeddings, transformer models, and chemical fingerprints into production search workflows.
  • Architect and implement scientific entity resolution and knowledge graph pipelines to transform raw text into interconnected knowledge. You will design systems that extract and link chemical and biological entities (NER/NED) from unstructured documents, enabling the search engine to "understand" relationships between compounds, targets, and assays.
  • Continuously improve search quality through evaluation metrics such as precision@K, recall@K, MRR, and relevance testing with real scientific use cases.
  • Ensure security, compliance, and tenant isolation as part of operating search services in enterprise bio-pharma environments.
  • Contribute to architectural decisions, technical strategy, and platform-wide improvements to accelerate scientific insight generation.

Requirements

  • 10+ years of backend or platform engineering experience building distributed, production grade systems.
  • Hands-on experience with search technologies such as Elasticsearch/OpenSearch, Lucene, or vector databases
  • Strong understanding of semantic search concepts embeddings, transformers, similarity scoring, ranking logic, relevance tuning, hybrid retrieval.
  • Expert-level coding skills in TypeScript and Python building robust APIs and backend services.
  • Experience building and operating microservices or search infrastructure on cloud platforms (AWS preferred), including containerization, CI/CD, observability, and performance tuning.
  • Familiarity with scientific or unstructured data processing, such as documents, tables, analytical results, or experimental datasets.
  • Strong problem solving skills, with the ability to navigate ambiguous scientific workflows and translate them into engineered systems.
  • Excellent communication and collaboration skills comfortable working alongside scientists, AI researchers, and product teams.
  • Exposure to NLP, LLMs, embedding generation, or retrieval-augmented workflows.
  • Experience with large-scale data platforms such as Databricks, Lakehouse architectures, or distributed indexing systems.

Nice to Have

  • Experience with cheminformatics tools and libraries (e.g., RDKit), including molecular fingerprints, similarity metrics, or substructure search.
  • Prior experience implementing chemical search systems, such as SMILES parsing, normalization, or chemical indexing.
  • Knowledge of vector databases / embeddings stores (e.g., OpenSearch) to support semantic search and RAG.

Benefits

  • 100% employer-paid benefits for all eligible employees and immediate family members
  • Unlimited paid time off (PTO)
  • 401K
  • Flexible working arrangements - Remote work
  • Company paid Life Insurance, LTD/STD
  • A culture of continuous improvement where you can grow your career and get coaching
  • We are not currently providing visa sponsorship for this position.

About the job

Apply before

Posted on

Job type

Full Time

Experience level

Senior
Manager

Location requirements

Hiring timezones

United States +/- 0 hours

About TetraScience

Learn more about TetraScience and their company culture.

View company profile

TetraScience is replatforming and reengineering the world’s scientific data in the cloud to solve humanity’s grand challenges.

The world’s first and only open R&D Data Cloud

The Tetra R&D Data Cloud provides global life sciences companies the flexibility, scalability, and data-centric capabilities to enable easy access to centralized, harmonized, and actionable scientific data.

Our Story

TetraScience is the R&D Data Cloud company with a mission to transform life sciences R&D, accelerate discovery, and improve and extend human life. While the company was founded in 2014, we began our R&D Data Cloud journey in 2019 with the origin of the Tetra Data Platform (TDP). We encourage you to read more about our company history in this blog post: Tetra 1.0 and 2.0 | The Noise and the Signal.

The Tetra R&D Data Cloud provides life sciences companies with the flexibility, scalability, and data-centric capabilities to enable easy access to centralized, harmonized, and actionable scientific data and is actively deployed across enterprise pharma and biotech organizations. As an open platform, TetraScience has built the largest integration network of lab instruments, informatics applications, CRO/CDMOs, analytics, and data science partners, creating seamless interoperability and an innovation feedback loop that will drive the future of life sciences R&D.

Our Mission

To be the most trusted and valuable partner to the world’s scientific community, fueling innovation, facilitating collaboration, accelerating discovery, and enhancing and extending human life.

Employee benefits

Learn about the employee benefits and perks provided at TetraScience.

View benefits

Disability insurance

We'll cover your disability insurance.

Retirement benefits

401(k) plans to help you invest in your future.

Healthcare benefits

100% company-paid medical, dental, and vision insurance.

Life insurance

We'll cover your life insurance so you don't have to worry.

View TetraScience's employee benefits
Claim this profileTetraScience logoTE

TetraScience

Company size

51-200 employees

Founded in

2019

Chief executive officer

Patrick Grady

View company profile

Similar remote jobs

Here are other jobs you might want to apply for.

View all remote jobs

6 remote jobs at TetraScience

Explore the variety of open remote roles at TetraScience, offering flexible work options across multiple disciplines and skill levels.

View all jobs at TetraScience

Remote companies like TetraScience

Find your next opportunity by exploring profiles of companies that are similar to TetraScience. Compare culture, benefits, and job openings on Himalayas.

View all companies

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan