HimalayasHimalayas logo
SunscrapersSU

Senior Data Engineer (LLM)

Sunscrapers is a Warsaw-based software development company founded in 2010, specializing in data-driven software solutions, data engineering, DevOps, and custom web application development using Python, JavaScript, and AWS.

Sunscrapers

Employee count: 11-50

Poland only

Stay safe on Himalayas

Never send money to companies. Jobs on Himalayas will never require payment from applicants.

Sunscrapers is a technology consultancy that empowers finance and healthcare leaders to succeed by leveraging cutting-edge software, data, and AI.

We combine world-class engineering, deep industry expertise, and proprietary know-how to deliver innovative, high-impact solutions. Specializing in software engineering, DevOps, data engineering, and data science, we design and build AI-powered data platforms and web applications tailored to each client’s unique needs.

Trusted by over 60 clients across the US, UK, and beyond, we consistently maintain a 4.9/5 client satisfaction rating, with partnerships averaging five years or more.

The project:

We are carrying out the project for our client, an American private equity and investment management fund - listed on the Forbes 500 list - based in New York.

We support them in the area of the infrastructure and data platform, and very recently we also build and experiment with Gen AI applications. The client operates very widely in the world of finance, loans, investments and real estate.

As a Senior Data Engineer you’ll design and implement core systems that enable data science and data visualization at companies that implement data-driven decision processes to create a competitive advantage.

You’ll build data platform for data and business teams, including internal tooling, data pipeline orchestrator, data warehouses and more, using:

Technologies: Python, Terraform, SQL, Pandas, Shell scripts

Tools: git, Docker, Snowflake, Pinecone, Neo4j, Jenkins, Jupyter Notebook, OpenAI API, Apache Airflow / Astronomer, Kubernetes, Artifactory, Windows with WSL, Linux, Gitlab

AWS: EC2, ELB, IAM, RDS, Route53, S3, and more

Best Practices: Continuous Integration, Code Reviews

The ideal candidate will be well organized, eager to constantly improve and learn, driven and, most of all - a team player!

Your responsibilities will include:

  • Developing PoCs using latest technologies, experimenting with third party integrations
  • Delivering production grade applications once PoCs are validated
  • Creating solutions that enable data scientists and business analysts to be self-sufficient as much as possible.
  • Finding new ways how to leverage Gen AI applications and underlying vector and graph data storages
  • Designing datasets and schemes for consistency and easy access
  • Contributing data technology stacks including data warehouses and ETL pipelines
  • Building data flows for fetching, aggregation and data modeling using batch and streaming pipelines
  • Documenting design decisions before implementation

Requirements

What's important for us?

  • At least 5+ years of professional experience in data-related role
  • Undergraduate or graduate degree in Computer Science, Engineering, Mathematics, or similar
  • Expertise in Python and SQL languages
  • Experience with data warehouses (Snowflake)
  • Experience with different types of database technologies (RDBMS, vector, graphs, document based, etc.)
  • Expertise in AWS stack and services
  • Proficiency in using Docker
  • Experience with infrastructure-as-code tools, like Terraform
  • Great analytical skills and attention to detail - asking questions and proactively searching for answers
  • Excellent command in spoken and written English, at least C1
  • Creative problem-solving skills
  • Excellent technical documentation and writing skills
  • Ability to work with both Windows and Unix-like operating systems as the primary work environments

You will score extra points for:

  • Experience with integrating LLMs (OpenAI but also others, maybe open source)
  • Understanding of LLMs fine tuning, embedding and vector semantic searching
  • Experience with Pinecone or Neo4j
  • Familiarity with data visualization in Python using either Matplotlib, Seaborn or Bokeh
  • Proficiency in statistics and machine learning, as well as Python libraries like Pandas, NumPy, matplotlib, seaborn, scikit-learn, etc
  • Experience in building ETL processes and data pipelines with platforms like Airflow or Luigi
  • Knowledge of any Python web framework, like Django or Flask with SQLAlchemy
  • Experience in operating within a secure networking environment, like a corporate proxy
  • Experience in working with repository manager, for example Jfrog Artifactory

Benefits

What do we offer?

  • Working alongside a talented team of software engineers who are changing the image of Poland abroad
  • Culture of teamwork, professional development and knowledge sharing (https://www.youtube.com/user/sunscraperscom)
  • Flexible working hours and remote work possibility
  • Comfortable office in central Warsaw, equipped with all the necessary tools for conquering the universe (Macbook Pro/Dell, external screen, ergonomic chairs)

Sounds like a perfect place for you? Don’t hesitate to click apply and submit your application today!

About the job

Apply before

Posted on

Job type

Full Time

Experience level

Senior

Location requirements

Hiring timezones

Poland +/- 0 hours

About Sunscrapers

Learn more about Sunscrapers and their company culture.

View company profile

Sunscrapers is a software development company dedicated to assisting clients in their growth and innovation by harnessing the power of data-driven software. Established in 2010 and with its headquarters in Warsaw, Poland, the company was founded on the vision of merging the skills and talents of Polish engineers with a Western business mindset, creativity, and entrepreneurial spirit to deliver world-class software solutions. The company operates under the guiding principles of ambition, technical excellence, and trust-based partnerships. Their core service offerings encompass data engineering, which includes data sourcing, processing, and storage; DevOps, focusing on the establishment of modern cloud infrastructure; and custom software development, which involves the design and creation of robust web applications. These services primarily utilize Python, JavaScript, and AWS technologies.

Over the years, Sunscrapers has collaborated with more than 60 clients globally, ranging from Fortune 500 enterprises and small to medium-sized businesses (SMBs) to startups and scaleups. Client satisfaction is a key metric for the company, with an average rating of 4.9 out of 5 and an average partnership duration of five years, often extending longer. Sunscrapers is also an active participant in the developer community, operates an in-house R&D lab, and prides itself on employing highly talented and experienced Polish engineers. The company emphasizes a culture of teamwork, professional development, and knowledge sharing. Their approach combines business-savvy software craftsmanship with agile project management to build solutions that adhere to the highest industry standards. Sunscrapers aims to empower finance and healthcare leaders by leveraging cutting-edge software, data, and AI, designing and building AI-powered data platforms and web applications tailored to each client's unique requirements.

Claim this profileSunscrapers logoSU

Sunscrapers

View company profile

Similar remote jobs

Here are other jobs you might want to apply for.

View all remote jobs

5 remote jobs at Sunscrapers

Explore the variety of open remote roles at Sunscrapers, offering flexible work options across multiple disciplines and skill levels.

View all jobs at Sunscrapers

Remote companies like Sunscrapers

Find your next opportunity by exploring profiles of companies that are similar to Sunscrapers. Compare culture, benefits, and job openings on Himalayas.

View all companies

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan