Data Engineer (Python)
Company
Orcrist is building a next generation data intelligence platform using cutting-edge technologies. We're handling petabyte-scale data with sub-second queries. Our product is a Kubernetes‑based platform delivered as B2B SaaS or as a self‑hosted on‑prem solution, including air‑gapped deployments. We enable customers across defense, law enforcement, and enterprise to turn mission-critical data into actionable intelligence.
Role
Join our team to build the unified data platform that powers search, graph, analytics, and AI workloads. You will design and operate batch and streaming pipelines that keep our lake/lakehouse reliable, versioned, and high quality. You’ll work with Apache NiFi, Kafka, Spark, Hudi, and Python to deliver trustworthy data to application, ML, and analytics teams.
What you'll do
- Build and operate data ingestion pipelines using Apache NiFi and Kafka.
- Implement scalable batch and streaming transformations in Python and SQL.
- Design and maintain lakehouse data models powering search, graph, and analytics.
- Set up data versioning with Apache Hudi plus data quality checks and monitoring.
- Maintain data catalog and lineage so teams can trust and discover datasets.
About You
- 3+ years of production data engineering with strong Python (pandas/PySpark) and SQL.
- Experience building large-scale batch and streaming pipelines across multiple data stores.
- Comfortable with Kafka, data integration tools (NiFi or similar), Docker, and Kubernetes.
- Familiar with data governance, lineage, and testing frameworks (e.g. Great Expectations).
- Collaborative, well-documented, self-directed, and happy working remotely in Germany.
Nice‑to‑haves
- German language skills (B1+) and exposure to government/defense or law-enforcement domains.
- Experience with graph databases and search systems (e.g. Elasticsearch, OpenSearch).
- Familiarity with Apache Flink, Apache Beam, or ML engineering fundamentals.
- Background as a military reservist or related experience.
What We Offer
- Modern architecture & stack.
- Remote‑first in Germany with occasional team events in Berlin.
- Home office budget and great equipment.
- 30 days vacation.
- Direct impact on critical missions across private and public‑sector customers.
