LanceSoft, Inc.
Pay Range: CAD 40-45/hr
Responsible for developing and leading the companys enterprise observability and reliability capability. The SRE and Observability Lead will collaborate across multiple teams to ensure comprehensive monitoring of all environmental components. This role will designate Dynatrace as the system of record for platform health and apply SRE practices to improve availability| performance| and incident outcomes across applications| infrastructure| and integrations.A Typical Day
Own enterprise observability using Dynatrace across cloud| on-prem| ERP| WMS| e
Commerce| APIs| and integrations
Design service topology| dashboards| alerts| and health indicators that reflect business impact
Apply SRE principles (SLIs| SLOs| error budgets where appropriate) to reduce incidents and improve resilience
Accelerate incident detection and root-cause analysis lead post-incident reviews focused on systemic fixes
Identify reliability| performance| and capacity risks before they impact the business
Define observability and SRE standards and enable teams to use them effectively
To Land This Opportunity
You have 5 years in infrastructure| platform| operations| or reliability engineering
You demonstrate hands-on experience implementing and operating Dynatrace
You have a strong understanding of distributed systems| cloudhybrid environments| and integrations
You have practical experience with SRE or reliability engineering concepts
Youre comfortable operating in high-impact incident and production environments
Verified Listing
This role has been verified for authenticity, market-rate compensation, and remote eligibility.
Get the latest updates on AI-powered hiring, career growth, and technical deep-dives delivered to your inbox.