People Prime Worldwide
About Company :
Our client is a trusted global innovator of IT and business services. They help clients transform through consulting, industry solutions, business process services, digital & IT modernization and managed services. Our client enables them, as well as society, to move confidently into the digital future. We are committed to our clients’ long-term success and combine global reach with local client attention to serve them in over 50 countries around the globe.
· Job Title: Cloud Observability SRE
· Location: PAN INDIA
· Experience: 5+ yrs
· Job Type : Contract to hire.
· Notice Period:- Immediate joiners.
Mandatory Skills:
This role requires working one shift aligned to US business hours, preferably 10 AM CST – 6 PM CST (+/- 2 hrs. before or after).
New Relic, Azure Cloud & Dev
Ops
The Cloud Observability SRE is responsible for implementing, maintaining, and evolving cloud observability services for Abbott’s Enterprise Cloud environments. This role operates within a Dev
Sec
Ops model, enabling continuous integration, deployment, and scaling of observability practices. The engineer will deliver end‑to‑end observability solutions including requirements gathering, design, integration using Infrastructure-as-Code, and operational support. The role also involves evaluating vendor technologies, participating in proofs of concept, and ensuring adherence to observability best practices with a focus on resiliency, scalability, security, and automation. The ideal candidate is self-driven, highly collaborative, an effective communicator, and passionate about innovation, cost-efficient solutions, and bringing new ideas to the team.
CORE JOB RESPONSIBILITIES
Deploy and configure observability platforms using best practices, reusable IaC templates, SOPs, and standardized configurations.
Build dashboards for performance, availability, service health, synthetic monitoring, alerting, and SLO/SLI management across applications, platforms, and infrastructure.
Apply AI/ML‑enabled observability tools and data-driven approaches to detect anomalies, generate predictive insights, and perform deep analysis on latency, reliability, error rates, MTTD, MTBF, MTTR, and other key metrics.
Provide SME-level support to cross-functional teams for incident triage and troubleshooting across the full cloud application/infrastructure stack.
Assist Dev
Ops teams with troubleshooting cloud-native applications and infrastructure issues using observability insights, including end‑user experience metrics.
MINIMUM EXPERIENCE / TRAINING / SKILLS
Bachelor’s degree in technology, information systems, or related discipline (preferred).
6+ years of Observability/Monitoring SRE experience.
AWS and/or Azure cloud certifications (required).
Certified Observability Expert (required).
MUST HAVE SKILLS
Monitor.
Now ITSM for ticketing and event correlation.
Strong knowledge of AWS/Azure IaaS/PaaS services including compute, storage, and networking.
Proficiency with Terraform (IaC), Ansible (Configuration Management), Python, JSON.
Experience with Git/Git
Hub Copilot, CI/CD platforms (Jenkins, Azure Dev
Ops), and Big Panda (Event Management).
Familiarity with cloud-native backup/recovery and site-recovery solutions for AWS/Azure.
Basic understanding of database platforms including AWS RDS, Azure SQL, PostgreSQL, MongoDB Atlas, Cosmos DB.
Exposure to RAG applications and LLM model evaluation platforms such as Azure OpenAI and AWS Bedrock.
PREFERRED (NICE-TO-HAVE) SKILLS
Experience with Jira (Scrum/Sprint Management) and Confluence (Documentation).
Understanding of Kubernetes, Docker, and CaaS platforms on Azure/AWS.
Verified Listing
This role has been verified for authenticity, market-rate compensation, and remote eligibility.
Get the latest updates on AI-powered hiring, career growth, and technical deep-dives delivered to your inbox.