Daniel Frank
@danielfrank
Senior Systems Engineer specializing in reliability, observability, and automation.
What I'm looking for
I’m a Senior Systems Engineer focused on turning complex infrastructure into dependable, observable systems through automation. I architected a Dynatrace SaaS migration for 300+ VMs, 100+ Lambdas, and 10 ROSA clusters to deliver enterprise-wide observability.
I’ve built CI/CD-integrated monitoring configuration using Terraform (and Monaco), and I create Python automation to reduce operational cost and streamline reliability work. I’ve also engineered automated remediation workflows for deployment failures to reduce MTTR and improve production reliability across AWS environments.
As a Reliability/Observability Engineer Lead, I established golden signals, improved MTTD and MTTR with APM-driven alerting and documented playbooks, and ran chaos engineering with Gremlin. I train teams on Dynatrace and Splunk monitoring, create error budgets, and deliver RCA deep dives and architecture deep dives so product teams and leadership can act quickly and confidently.
Experience
Work history, roles, and key accomplishments
Senior Systems Engineer
Delta Air Lines
Oct 2024 - Present (1 year 8 months)
Architected Dynatrace SaaS migration for 300+ VMs, 100+ Lambdas, and 10 ROSA clusters to deliver enterprise-wide observability. Automated monitoring configuration via CI/CD, built Python remediation to disable monitoring on inactive clusters, and reduced MTTR through automated remediation workflows.
Site Reliability Engineer
Center for Medicare & Medicaid Services (VITG)
Feb 2024 - Oct 2024 (8 months)
Led RCA deep dives and observability consulting across CMS cloud product teams. Deployed and configured Datadog APM, and advocated SRE practices including Golden Signals, SLAs, Error Budgets, and blameless post-incident reviews.
Improved reliability for tier I and II applications by establishing golden signals, reducing MTTD by up to 50% with automated APM alerting, and reducing MTTR with documented playbooks. Conducted chaos engineering with Gremlin, trained SRE teams on Dynatrace and Splunk, and built deployment pipelines.
Developed 50+ REST APIs in C# on the ASP.NET platform to support automation of software security scanning and compliance. Built containerized dev environments, implemented JWT authentication between microservices, designed ScyllaDB data models, and integrated ELK logging via Serilog.
Provided Tier III support for full stack web applications on Google Cloud Platform, troubleshooting errors and data anomalies and reporting weekly metrics. Managed monitoring for performance and SLAs, supported applications using C#/TypeScript/JavaScript, and decommissioned unused VMs and pods.
Provided technical consultations to business and hosting customers, supporting WordPress, MySQL, JavaScript, server APIs, FTP, and Apache. Performed UX research and QA analytics, reported bugs to Jira, and administered web hosting accounts on FreeBSD and Red Hat instances in AWS.
Provided IT support for hardware and software for business clients, diagnosing and repairing POS terminals, printers, scan guns, card readers, and signature capture devices. Monitored backup power supplies and maintained network traffic load balancing.
Managed inventory and bill of materials data via a Unix-based order processing system and SSH. Tested data integrity, managed SQL Server schemas, and prepared databases for migration from OpenEdge to Microsoft SQL Server while leading weekly scrum meetings with offshore engineering.
Seasonal Product Support
Garmin International
May 2016 - Sep 2016 (4 months)
Responded to customer inquiries for Garmin fitness products and provided B2B support services for fitness watches. Diagnosed and resolved technical issues, escalating tickets to product engineers and providing operational support via phone and email.
Web Developer
Western Oregon University
Nov 2014 - Jun 2015 (7 months)
Migrated WOU websites from legacy systems to WordPress and taught web designers the platform. Designed and managed WordPress sites and social media channels used to coordinate student media publications.
IT Support Team Member
Bandon School District
Sep 2006 - Jun 2010 (3 years 9 months)
Responded to service request tickets and supported district network administration. Performed routine maintenance on Open Enterprise Server and deployed Windows workstations across the district.
Computer Restoration Technician
Free Geek
Sep 2006 - Jun 2010 (3 years 9 months)
Restored damaged and old computers for donation and installed Ubuntu and educational software. Tested hardware for functionality prior to deployment for educational institutions.
.NET Developer Intern
Charles Schwab & Co
Built business software solutions using C# and VB.NET for financial derivatives trading in an Agile environment. Migrated applications from .NET 1.1 to .NET 4.0 and developed server-side apps targeting SQL Server 2012 on Windows Server 2008 using TFS.
Education
Degrees, certifications, and relevant coursework
Western Oregon University
Bachelor of Science, Computer Information Systems
2011 - 2016
Earned a Bachelor of Science in Computer Information Systems from Western Oregon University. Completed a minor in Mathematics.
Tech stack
Software and tools used professionally
Postman
Splunk
Google Cloud Platform
GitHub
GitLab
Kubernetes
Jenkins
CircleCI
DB
MySQL
Microsoft SQL Server
Cassandra
Gmail
Spring Boot
.NET Core
.NET
Terraform
Jira
JavaScript
Java
ASP.NET
PowerShell
Serilog
Kafka
RabbitMQ
xMatters
Ubuntu
Windows
Windows Server
FreeBSD
Datadog
Elasticsearch
WordPress
Ansible
SQL
OpsGenie
Gremlin
Harness
Dynatrace
Bash
Depot
Remote
Bend
Amazon Q
Blameless
Availability
Location
Authorized to work in
Job categories
Skills
Interested in hiring Daniel?
You can contact Daniel and 90k+ other talented remote workers on Himalayas.
Message DanielFind your dream job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
