- Work with Apache Spark batch and real-time streaming to process data at scale
- Work with Scala microservices hosted on K8s (GKE) to support our products
- Work with MLlib and TensorFlow on Vertex AI to solve forecasting and simulations in addition to a variety of machine learning challenges
- Deploy new models and engines for new ideas and analysis
- At least 5 years of commercial experience with Big Data
- Advanced knowledge of different non-relational schema models (Column Family, Graph, Document, Object)
- Experience with advanced topologies and architecture design such as Kappa and Lambda architecture
- Experience with GCP Dataflow
- Experience with Scala
- Experience with Python
- Experience with high-throughput systems
- Experience with LLM
- At least Advanced level of English
PERSONAL PROFILE
- Excellent communication and interpersonal skills, with the ability to collaborate effectively with cross-functional teams and stakeholders
- Strong problem-solving and decision-making skills, with a focus on driving results and meeting deadlines
- Self-motivated, adaptable, and eager to learn new technologies and frameworks
We are seeking a Data Engineer with strong Scala expertise to support and enhance the company’s data infrastructure and machine learning models. The ideal candidate is passionate about building scalable, resilient pipelines and is eager to innovate using cutting-edge technology in a high-impact environment.
You will contribute to the core systems and data services by applying state-of-the-art solutions to solve complex and large-scale engineering challenges.
CUSTOMER
Our client is a fast-growing technology company focused on solving real-world challenges. Since day one, they have been committed to building highly resilient and scalable systems that allow for experimentation and innovation. Their data platform is designed to be both powerful and flexible, offering engineers the freedom to introduce new ideas and experiment with the latest technologies.
PROJECT
The project involves enhancing the client’s large-scale data processing systems and supporting machine learning infrastructure. You’ll be working with a team of experienced engineers to design, build, and maintain core data pipelines and services using Scala and other advanced technologies. Your work will directly impact the efficiency and intelligence of the platform powering.