Himalayas logo
CASTCA

Data Engineer / Data Enablement with AI for AI

CAST Software pioneers the software intelligence domain, enabling organizations to gain automated insights into their complex software systems for improved efficiency and control.

CAST

Employee count: 201-500

France only

Stay safe on Himalayas

Never send money to companies. Jobs on Himalayas will never require payment from applicants.

CAST, a Software Company based in Meudon , is the market leader in Software Intelligence.

Working at CAST R&D means being an important part of a highly-talented, fast-paced, multicultural and Agile team .

Overview

Were building the foundation to ground AI with AAA Software Intelligence Aggregated,

Accurated, and Augmented sourced from real-world software and technology projects. This

role goes beyond manual curation: it's about using AI to empower AI. You will leverage LLMs,

embeddings, and NLP tools to clean, enrich, and validate data, enabling AI systems and

autonomous agents to rely on it for training and contextual understanding.

Responsibilities

Aggregate and structure data from software ecosystems (codebases, APIs, tickets,

documentation, architecture specs).

Apply LLMs, embeddings, and NLP tools to automate: data cleaning, entity extraction,

metadata tagging, and semantic annotation.

Build and maintain semantic pipelines for LLM fine-tuning and RAG (Retrieval-Augmented

Generation).

Organize datasets into formats suitable for Agent-to-Agent (A2A) interactions: APIs, vector

DBs, knowledge graphs, etc.

Collaborate with AI teams to evolve schemas, prompts, labeling strategies, and evaluation

data.

Ensure strong data lineage, reproducibility, and version control.

Requirements

3+ years in data engineering, ML data ops, or structured data curation.

Proficient in Python, with strong data pipeline skills (Pandas, PyArrow, regex, Airflow).

Experience with LLMs or NLP tools (e.g., Hugging Face, spaCy, LangChain).

Ability to use AI to clean, enrich, classify, and organize technical content.

Strong understanding of tokenization, chunking, and model input preparation.

Experience working with software project data: Git repos, APIs, technical documentation, etc.

Bonus Skills

Knowledge of vector DBs (FAISS, Qdrant, Weaviate) or knowledge graphs (Neo4j, RDF,

SPARQL).

About the job

Apply before

Posted on

Job type

Full Time

Experience level

Mid-level
Senior

Location requirements

Hiring timezones

France +/- 0 hours

About CAST

Learn more about CAST and their company culture.

View company profile

CAST Software is at the forefront of software intelligences, delivering innovative solutions that transform how businesses understand their proprietary applications. With a mission to navigate the increasing complexity of custom-built software, CAST provides tools that automatically analyze and offer insights into application inner workings. Founded in 1990, CAST has grown from its humble beginnings to operate in nine countries across three continents.

The company's flagship products, CAST Highlight and CAST Imaging, act as dynamic control towers and provide deep insights into software architecture and performance, respectively. These tools allow organizations to streamline operations, reduce maintenance costs, and enhance software quality, ensuring that digital transformation efforts are both efficient and effective. With extensive investment in research and development exceeding $200 million, CAST’s technology is utilized by leading global enterprises and top consulting firms, including BCG and Accenture. CAST is committed to empowering businesses to adapt quickly to market changes while ensuring robust control over application lifecycles.

Claim this profileCAST logoCA

CAST

View company profile

Similar remote jobs

Here are other jobs you might want to apply for.

View all remote jobs

11 remote jobs at CAST

Explore the variety of open remote roles at CAST, offering flexible work options across multiple disciplines and skill levels.

View all jobs at CAST

Remote companies like CAST

Find your next opportunity by exploring profiles of companies that are similar to CAST. Compare culture, benefits, and job openings on Himalayas.

View all companies

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan
CAST hiring Data Engineer / Data Enablement with AI for AI • Remote (Work from Home) | Himalayas