Shailesh Soni
@shaileshsoni
Site Reliability Engineer focused on automating cloud infrastructure for high availability and reduced toil.
What I'm looking for
I am a Site Reliability Engineer with over 13 years of experience designing, automating, and optimizing cloud infrastructure across AWS, GCP, and Azure. I specialize in Kubernetes, Terraform, observability (Datadog, Splunk, Prometheus), and building CI/CD pipelines to improve reliability, reduce MTTR, and drive SRE best practices.
I have delivered end-to-end observability solutions, defined and managed SLOs/SLIs/Error Budgets, implemented DR and resiliency patterns, and automated provisioning with Terraform and scripting in Python, Bash, and PowerShell. I thrive in Agile teams, prioritize reducing operational toil, and continuously up-skill in cloud, AI, and automation to improve system resilience and delivery outcomes.
Experience
Work history, roles, and key accomplishments
Site Reliability Engineer
Genpact Headstrong Capital Markets
Jan 2021 - Present (4 years 11 months)
Designed and implemented end-to-end observability and SRE practices across infrastructure and applications, reducing MTTR and enforcing SLO/SLI/error budget management; automated provisioning and CI/CD to improve deployment velocity.
Sr. Site Reliability Engineer
Arrow Core Technologies Private Limited
Sep 2019 - Sep 2020 (1 year)
Built multi-cloud infrastructure across AWS/Azure/GCP, automated deployments with Terraform, and implemented SLO/SLI frameworks and DR solutions to improve resilience and operational stability.
Assistant IT Manager
Macmillan Publishers India Pvt. Ltd.
Mar 2016 - Sep 2019 (3 years 6 months)
Managed AWS and on-prem infrastructure, automated cloud operations with scripting to reduce manual effort, and ensured high availability for production and staging environments while managing Office 365 services.
Datacenter Server Engineer
Wipro Limited
Sep 2014 - Mar 2016 (1 year 6 months)
Managed cloud services and data center operations including server deployment, monitoring, and maintenance, supporting cloud implementations in AWS and Azure to meet operational SLAs.
System Administrator
ATOS India Pvt. Ltd.
Dec 2012 - May 2014 (1 year 5 months)
Administered Windows Server environments and managed ITIL-based incident and change processes, handling backup and storage solutions to meet SLA requirements.
Education
Degrees, certifications, and relevant coursework
CDAC ACTS Pune
Post Graduate Diploma, IT Infrastructure and Security
Completed a P.G. Diploma in IT Infrastructure Systems & Security focusing on infrastructure and security practices.
Bansal College of Engineering
Bachelor of Engineering, Electronics and Communication
Earned a Bachelor of Engineering in Electronics & Communication from R.G.P.V. University, Bhopal.
Tech stack
Software and tools used professionally
Availability
Location
Authorized to work in
Job categories
Interested in hiring Shailesh?
You can contact Shailesh and 90k+ other talented remote workers on Himalayas.
Message ShaileshFind your dream job
Sign up now and join over 100,000 remote workers who receive personalized job alerts, curated job matches, and more for free!
