In this role, you will operate within multi-cloud environments, primarily AWS and Alibaba Cloud. Your responsibility will be to ensure our online platforms—used by millions daily—are reliable, fast, and scalable, particularly during peak traffic and breaking news events.
Key responsibilities
- Design and manage cloud infrastructure across AWS and Alibaba Cloud, ensuring high availability and cost-efficiency
- Build and maintain reliable systems by setting performance targets (SLIs/SLOs), managing incidents, and minimizing downtime
- Use Infrastructure as Code (IaC) tools like Terraform or CloudFormation to create consistent and manageable environments
- Develop automated deployment pipelines to speed up releases and reduce errors
- Optimize system performance to handle large traffic spikes smoothly
- Set up monitoring, logging, and security measures to keep systems secure and transparent
- Lead and mentor junior engineers, make key technical decisions, and promote best practices for operational excellence
Candidate profile
- At least 5 years’ experience in DevOps, SRE, or cloud architecture
- Deep knowledge of AWS (EC2, EKS, Lambda, RDS) and Alibaba Cloud services (ECS, ACK, OSS, SLB) Experience with CDN solutions is a plus
- Hands-on experience with IaC tools like Terraform or Ansible
- Strong skills with CI/CD tools such as Jenkins, GitLab CI, or GitHub Actions
- Proficiency in container orchestration (Kubernetes) and observability tools (Prometheus, Grafana)
- Experience working in online media, news, or high-traffic digital platforms is highly preferred
- Fluent in Cantonese and English; Mandarin skills are a bonus for regional collaboration
What’s next
Shape the future of cloud reliability and scalability for high-traffic digital platforms. Apply now!