UKG
Production Reliability & Application Behavior Responsible for reliability outcomes across a large, heterogeneous application portfolio, including availability, performance, scalability, and recoverability Ensure applications meet defined reliability expectations as they operate on both on-prem and cloud platforms Lead and participate in major incident response, acting as a senior escalation point and ensuring effective executive communication Drive post-incident learning and systemic improvements to reduce repeat issues Establish standards for operational readiness, release safety, capacity planning, and disaster recovery across platforms Apply Site Reliability Engineering principles pragmatically across both legacy and cloud-native systems, including: Ensure SRE practices are consistent in intent but adapted in implementation across different technologies and environments Lead and develop SRE managers and engineers across a global organization Inherit existing teams and improve clarity of ownership, execution discipline, and engagement Hire and develop senior SRE leaders capable of operating across both cloud and enterprise platforms 10+ years of experience in software engineering, systems engineering, SRE, or related disciplines Proven experience leading established, globally distributed engineering organizations Strong understanding of production systems and application behavior at scale Experience operating and leading teams across hybrid environments (on-prem and public cloud) Demonstrated ability to influence outcomes in a matrixed enterprise environment Experience owning incident response, operational reviews, and executive-level communication Excellent communication skills, with the ability to clearly articulate technical and operational concepts to varied audiences Experience supporting large-scale application portfolios across both Windows/.NET and cloud-native environments Familiarity with Google Cloud Platform and enterprise-scale cloud operations Strong understanding of observability practices across application, platform, and infrastructure layers Prior experience partnering closely with Product, Infrastructure, and Cloud leadership
Verified Listing
This role has been verified for authenticity, market-rate compensation, and remote eligibility.
Get the latest updates on AI-powered hiring, career growth, and technical deep-dives delivered to your inbox.