Job Description:
Flexential is hiring a Platform Engineering leader in the IT organization to plan roadmaps, establish requirements, develop and operationally manage platform technologies including Observability, DevOps, ITSM and Integrations. Current platform initiatives include building a next-gen OpenTelemetry observability platform for 40+ data center facilities and platforms using a LGTM stack (Loki, Grafana, Tempo, Mimir); and enabling secure high-velocity SDLC capability enabling paved pathways, engg excellence measurements and devsecops across multiple development teams.This role sits at the intersection of engineering management and hands-on technical work. You will lead a team of platform engineers, create/capture requirements, establish and own technical planning and implementation, and be accountable for platform reliability, security, and delivery timelines. This is a high-visibility, high-impact role — the platforms you build will be foundational to Flexential's IT services, as well as enable AIOps and AI infrastructure.
Key Responsibilities and Essential Job Functions:
Lead the design, development, deployment and operational management of automated, resilient, high availability, self-healing, secure platforms with native-AI capabilities for IT needs, serving both internal as well as customer business capabilities.
Lead, Build and manage the Platform Engineering teamand function— hiring, mentoring, performance management, and technical roadmap ownership.
Plan, build and operate an OpenTelemetry Observability platform with technologies including Grafana, Mimir, Loki, Tempo, Alertmanager on Kubernetes/RKE2 using Helm and ArgoCD.
Build an automatedfederatedObservabilityEdge Stack — Prometheus +OTelcollector nodes deployed persiteandZabbix auto-discovery configuration and Prometheus scrape profile library for 10+ device classes (Cisco, Juniper, Dell, NetApp, etc.).
Design,developand manageengineering lifecycleplatforms for high-velocity secure SDLCusing Gitlab and similar / related technologies.
Buildand operateiaCand CI/CDplatformsincludingGitLab CI/CD, Terraform, Ansible AWX, Helm, andArgoCDforautomatedprovisioning and application deployment.
Own, enhance andoperatecriticalITplatform technologiese.gBoomifor integrations, AWS for Cloud environments,includingtheirhostedinfrastructure.
Establish and enforce platform security posture:secretsmanagement via CyberArk/Conjur, RBAC,mTLS,compliance boundary design, and zero inbound telemetry architecture.
Build and integrateITSM capabilitiesforvarious platformse.gautomated incident creation,CIenrichment, and CMDB correlation
Defineand implementextensibility patternsincluding AIOps:e.ganomaly detection hooks, event correlation pipeline design, and integration with future ML/AI tooling.
Partner with other IT and business teams for App Dev, requirements capture, delivery validation and integrationneeds.
Represent platform engineering in cross-functional architecture reviews and executive-level program updates.
Perform othermanagement and technicalduties asrequiredand assignedfor team and operational resiliencee.gteam building,on -callrotation,etc
Travelmayberequiredto team or project events
Required Qualifications
12+ years ofrelevanttechnicalexperiencewith 4+ years in a management(or Principal-level)role leadingaengineeringteam
DevOps / Platform Engineering - 8+ years, End-to-end ownership of developer/infrastructure platforms; Kubernetes, Helm,ArgoCD, service-mesh,containerized workloads
GitOps/ CI-CD-5+ yearsGitLab CI/CD, pipeline authoring, infrastructure-as-code delivery
8+ years of expert level automationframeworksexperience with Python, Terraform, Ansible,etc.
Infrastructure (Linux/VM) - 8+ years Linux systems administration, VM lifecycle (VMware vCenter/VCF),Netappstorage and compute provisioning
Working knowledge ofNetworking -3+ years, TCP/IP, BGP/OSPF, SNMP protocol
AI tooling–Strong understanding(or1+yearsexperience) withMCP, Agentic workflows,SRE workflowse.gAIOpsforAnomaly detection, event correlation, alert noise reductionon Prometheus and Grafana stack
Experience withSecrets & Security -4+ years, CyberArk,Conjur, Vault, or equivalent; RBAC design, compliance boundary architecture
Engineering Management -4+ years, Hiring, team building, performance management, roadmap ownership for teams of5+engineers
Other training and experience may be substituted for the job requirements at the discretion of the manager
Preferred Skills
Hands-onexperience or working knowledge of Boomi integrations PaaS(iPaaS) technologies
Experiencewithdesign anddevelopmentof DR test application/automation and process workflows forcorporateBCPexecution.
Hands-on experience working with AWS products in a Well-architected Framework and multi-account model to develop various compute, storage, networkiaaSand PaaS services for IT applications.
Hands-on experience working with BAS / BMS systems in a Datacenter / OTenvironment.
Base Pay Range: Annualized salary range offered for this position is estimated to be $170,000 - $200,000. However, the actual pay range depends on each candidate’s experience, location, and qualifications.
Variable Pay: Discretionary annual bonus, based on personal and company performance.
Not meeting every single requirement? No problem! We are looking for candidates who possess unique skills that set them apart from the rest. If you're enthusiastic about this role and believe you have the skills and abilities that would make you successful, don't hesitate to apply today!
Benefits of working at Flexential:
• Medical, Telehealth, Dental and Vision
• 401(k)
• Health Savings Accounts (HSA) and Flexible Spending Accounts (FSA)
• Life and AD&D
• Short Term and Long-Term disability
• Flex Paid Time Off (PTO)
• Leave of Absence
• Employee Assistance Program
• Wellness Program
• Rewards and Recognition Program
Benefits are subject to change at the Company's discretion.
Flexential participates in the E-Verify program. Please click here for more information.
EEOC Statement: Flexential is an equal opportunity employer, and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity or expression, pregnancy, age, national origin, disability status, genetic information, protected veteran status, or any other characteristic protected by law.
