Sam Merwin
@smerwin
Senior Site Reliability Engineer building automation and incident response tooling.
What I'm looking for
I’m a Technical Support and Infrastructure Engineer with 10+ years of hands-on experience across escalation support, incident management, and cloud platform operations. I’ve progressed from frontline systems administration through promotions at DigitalOcean to senior engineering and management roles at NS1 and FanDuel.
I’m at my best in high-severity moments: I build tooling that reduces resolution times, automate manual operational workflows, and serve as a calm, reliable escalation point during production incidents. I’ve also strengthened reliability by maintaining AWS infrastructure and improving proactive monitoring and alerting in Datadog.
Across DNS, DDI, and platform operations, I partner with application and customer-facing teams to diagnose performance bottlenecks, reproduce complex issues in lab environments, and accelerate troubleshooting. At NS1, I standardized incident response procedures and built custom REST API debugging and integration tooling to streamline both internal and customer workflows.
My work spans the full stack—from DNS and networking to Kubernetes orchestration—and I prioritize clear communication and dependable execution. Outside of employee roles, I continue providing hands-on IT consulting and advise clients on AI/ML toolchain integration for practical, adoption-ready outcomes.
Experience
Work history, roles, and key accomplishments
Proprietor
Low Key Internet Concerns
Nov 2021 - Present (4 years 8 months)
Provided hands-on IT consulting for small businesses, including code review, refactoring, and technical debt remediation. Advised clients on AI/ML toolchain integration and adoption of tools like Claude Code, Copilot, and MCP-based workflows.
Maintained reliability of AWS production infrastructure supporting a platform with millions of active users during peak sporting events. Built Datadog monitoring/alerting pipelines and participated in 24/7 on-call for critical P1/P2 incidents.
Senior Manager of Customer Success Eng
NS1
Mar 2022 - Dec 2022 (9 months)
Managed a team of customer success engineers responsible for incident response and escalation handling for enterprise DNS customers. Standardized incident response procedures and developed REST API debugging/integration tooling to accelerate troubleshooting workflows.
Senior Customer Success Engineer
NS1
Mar 2021 - Mar 2022 (1 year)
Served as primary escalation point for high-severity incidents involving managed DNS and DDI services for enterprise accounts. Reproduced customer-reported issues in lab environments and maintained documentation to reduce repeated escalations.
Customer Success Engineer II
NS1
Apr 2020 - Mar 2021 (11 months)
Triaged and resolved customer issues with managed DNS services as part of a 24/7 on-call support rotation. Provided DNS configuration guidance (including DNSSEC and traffic management) and contributed to platform operations supporting Nomad-based deployments and Docker pipeline maintenance.
Designed and built custom Kubernetes tooling to improve platform observability and reduce time-to-diagnosis for cluster issues. Operated and maintained large-scale Kubernetes clusters supporting thousands of customer workloads.
Built and maintained deployment orchestration and configuration management systems supporting millions of customer virtual machines. Developed automation to eliminate repetitive manual operational workflows, reducing human error and improving incident response times.
Provided 24/7 escalation support as part of the Linux administration team, diagnosing issues across compute, storage, and networking layers. Developed troubleshooting approaches for recurring issues and collaborated on documenting infrastructure patterns affecting customer reliability.
Junior DevOps Engineer
New York Public Radio (WNYC)
Feb 2014 - Oct 2014 (8 months)
Managed AWS infrastructure supporting live audio streaming, web properties, and archival systems for a public radio organization. Supported production infrastructure operations across these platforms.
Provided systems administration across three office locations, maintaining Windows and Linux server environments. Resolved cross-platform infrastructure issues.
Software Developer
Internap Network Services
May 2012 - Nov 2012 (6 months)
Developed PHP and JavaScript features for the Ubersmith datacenter management platform used for billing, ticketing, and device tracking. Contributed application features within the platform.
Development & Operations
Alley
Apr 2011 - Apr 2012 (1 year)
Built custom content management solutions using Drupal and managed deployment pipelines for client web properties. Supported delivery and deployment workflows for client sites.
Systems Administrator
Voxel Dot Net
Jun 2009 - Apr 2011 (1 year 10 months)
Provided frontline technical support and implementation services for managed hosting customers as the first point of contact for server, networking, and application issues. Onboarded new customers and translated hosting configurations into guidance for non-technical stakeholders.
Course Developer
Noble Desktop, LLC
Sep 2008 - May 2009 (8 months)
Developed and tested curriculum workbook exercises for training courses in web and print design software. Supported course content creation and validation through testing.
Education
Degrees, certifications, and relevant coursework
Skidmore College
Bachelor of Science in Graphic Design, Graphic Design
2003 - 2008
Earned a Bachelor of Science in Graphic Design at Skidmore College from 2003 to 2008.
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Website
smerwin.comJob categories
Skills
Interested in hiring Sam?
You can contact Sam and 90k+ other talented remote workers on Himalayas.
Message SamGet matched with your dream remote job
Sign up now and join over 250,000+ remote workers who receive personalized job alerts, curated job matches, and more for free!
