Himalayas logo
Xenon7XE

Site Reliability Engineer (SRE)-Mobile and Internet Platform

Xenon7 is a consulting and professional services company specializing in integrating Artificial Intelligence (AI) and Machine Learning (ML) into businesses, offering services that leverage cutting-edge technology to optimize operations, enhance customer experiences, and drive innovation.

Xenon7

Employee count: 11-50

Germany only

Stay safe on Himalayas

Never send money to companies. Jobs on Himalayas will never require payment from applicants.

Description

About us:

Where elite tech talent meets world-class opportunities!

At Xenon7, we work with leading enterprises and innovative startups on exciting, cutting-edge projects that leverage the latest technologies across various domains of IT including Data, Web, Infrastructure, AI, and many others. Our expertise in IT solutions development and on-demand resources allows us to partner with clients on transformative initiatives, driving innovation and business growth. Whether it's empowering global organizations or collaborating with trailblazing startups, we are committed to delivering advanced, impactful solutions that meet today’s most complex challenges.

About the Client:

Join one of Egypt’s premier financial institutions, renowned for its extensive suite of banking services, including Institutional Banking, Personal Banking, and Islamic Banking. With a global presence through over 50 branches and correspondents, we serve a diverse and dynamic clientele. As we embark on a groundbreaking digital transformation journey, we are committed to leveraging the latest technologies to establish a state-of-the-art data architecture that will redefine our performance and service delivery.

Requirements

Position Overview

The Site Reliability Engineer (SRE) is responsible for ensuring the stability, performance, and reliability of Bank's critical applications, particularly Mobile Banking and Internet Banking platforms. This role bridges development and operations teams, implementing automation solutions, monitoring system health, and providing 24/7 operational support to maintain seamless banking services for customers on on-premise infrastructure.

Key Responsibilities

  • Monitor and maintain the reliability and performance of Mobile Banking and Internet Banking applications using Prometheus and Grafana dashboards
  • Manage and support OpenShift/Kubernetes infrastructure for containerized banking applications on on-premise servers
  • Respond to and resolve production incidents with minimal mean time to resolution (MTTR)
  • Implement and maintain centralized logging solutions using ELK Stack (Elasticsearch, Logstash, Kibana) for application troubleshooting
  • Develop and execute runbooks and automation scripts to reduce manual operational toil in OpenShift environments
  • Provide 24/7 production support and on-call rotation for critical banking services
  • Analyze logs and metrics from Prometheus and EFK to identify performance bottlenecks and reliability issues
  • Conduct root cause analysis (RCA) on incidents and implement preventive measures
  • Optimize Kubernetes/OpenShift deployments, pod management, and resource allocation on-premise
  • Implement alerting strategies and threshold management in Prometheus and Grafana
  • Support infrastructure scaling, capacity planning, and load balancing in production environments
  • Implement security best practices and compliance requirements for financial systems in containerized environments
  • Manage on-premise data center infrastructure and server resources
  • Document operational procedures, troubleshooting guides, and create knowledge base articles

Qualifications

  • BSc in Computer Science, Information Technology, Software Engineering, or related field
  • 2+ years of hands-on experience in SRE, DevOps, or Production Engineering roles
  • Hands-on experience supporting production applications in Kubernetes/OpenShift environments
  • Strong experience with OpenShift container platform administration and troubleshooting on on-premise infrastructure
  • Proficiency with Prometheus for metrics collection and monitoring
  • Proficiency with Grafana for dashboard creation and visualization
  • Experience with ELK Stack (Elasticsearch, Logstash, Kibana) for centralized logging
  • Strong understanding of Linux/Unix operating systems and networking fundamentals
  • Practical experience with CI/CD tools and automation frameworks
  • Proficiency in at least one programming/scripting language (Python, Go, or Bash)
  • Experience with database management (SQL and NoSQL) on-premise
  • Excellent troubleshooting and analytical skills for production support
  • Strong communication skills and ability to work in cross-functional teams
  • Experience in 24/7 production support environments
  • Experience with on-premise data center infrastructure management
  • Previous experience in financial services or banking sector is a plus

About the job

Apply before

Posted on

Job type

Full Time

Experience level

Senior

Location requirements

Hiring timezones

Germany +/- 0 hours

About Xenon7

Learn more about Xenon7 and their company culture.

View company profile

At Xenon7, the core belief is in the transformative power of Data, AI, and ML. The company is dedicated to guiding organizations through every stage of their data journey, whether they are just beginning to understand the value of data-driven strategies or are looking to implement advanced AI solutions. Xenon7 positions itself as an 'inferno where skill, dedication, and passion run together,' inviting curious minds passionate about pushing technological boundaries to join their mission. They aim to empower businesses to navigate the complexities of AI with confidence, revolutionizing how organizations approach AI challenges by leveraging intelligent solutions to unlock new possibilities. The company emphasizes applying AI ethics and Infosec Regulations and Principles in all its endeavors, ensuring responsible and secure technological advancements.

Xenon7's culture is built on values of integrity, collaboration, and a relentless pursuit of excellence, which guide every decision. Their teams are a cooperative practice of AI scientists and business leaders, blending expertise from diverse disciplines to tackle complex challenges with creativity, agility, and a commitment to continuous improvement. They work with leading enterprises and innovative startups on cutting-edge projects across various IT domains, including Data, Web, Infrastructure, and AI. This collaborative approach, combined with a visionary mindset, allows Xenon7 to deliver advanced, impactful solutions that meet today's most complex challenges and exceed client expectations. They operate free from the bloat and hierarchical structure of legacy consulting firms, enabling them to help clients make better human and technology decisions and ethically achieve more with less. Xenon7 encourages individuals, whether professors, technologists, scientists, or those brimming with creative potential, to explore career opportunities at the forefront of AI innovation and contribute to shaping the future.

Claim this profileXenon7 logoXE

Xenon7

View company profile

Similar remote jobs

Here are other jobs you might want to apply for.

View all remote jobs

10 remote jobs at Xenon7

Explore the variety of open remote roles at Xenon7, offering flexible work options across multiple disciplines and skill levels.

View all jobs at Xenon7

Remote companies like Xenon7

Find your next opportunity by exploring profiles of companies that are similar to Xenon7. Compare culture, benefits, and job openings on Himalayas.

View all companies

Find your dream job

Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!

Sign up
Himalayas profile for an example user named Frankie Sullivan
Xenon7 hiring Site Reliability Engineer (SRE)-Mobile and Internet Platform • Remote (Work from Home) | Himalayas