
Site Reliability Engineer
6 days ago
Job Summary: We are seeking a Senior Site Reliability Engineer (SRE) with 10–15 years of proven experience in building, managing, and maintaining highly available, scalable, and secure infrastructure across multi-cloud and hybrid cloud environments—including on-premises data centers . The ideal candidate will have deep knowledge of SRE principles , strong hands-on experience in automation , observability , incident response , and infrastructure resilience , and the ability to architect solutions that span cloud and traditional data center environments. Key Responsibilities: Design, implement, and manage reliable and scalable systems across public clouds (AWS, Azure, GCP) and on-premises data centers . Apply SRE best practices —including SLIs, SLOs, error budgets, incident management, and postmortems —across cloud and non-cloud environments. Develop and maintain Infrastructure as Code (IaC) using tools like Terraform, Ansible, or CloudFormation. Drive automation for deployment, scaling, monitoring, and infrastructure management. Implement and enhance observability practices (monitoring, logging, tracing) using tools like Prometheus, Grafana, ELK, Datadog, New Relic, etc. Work with application teams to ensure high availability , performance , and cost optimization across hybrid environments. Lead and participate in on-call rotations and improve overall incident response processes. Collaborate with security and compliance teams to enforce best practices in data protection , access control, and system hardening in hybrid setups. Evaluate and recommend emerging tools and technologies for resilience engineering , disaster recovery , and infrastructure modernization . Required Qualifications: 10–15 years of experience in SRE, DevOps, or infrastructure engineering roles. Proven experience managing infrastructure in multi-cloud (AWS, Azure, GCP) and hybrid cloud/on-prem environments . Solid understanding of networking, load balancing, storage, virtualization, and container orchestration (Kubernetes, Docker). Strong scripting and programming skills (e.g., Python, Go, Bash). Experience with CI/CD pipelines , tools like Jenkins, GitLab CI, ArgoCD, etc. In-depth knowledge of SRE methodologies and real-world application of SLAs, SLOs, and error budgets. Hands-on experience with monitoring and observability stacks . Strong analytical and troubleshooting skills for production incidents across complex, distributed systems. #J-18808-Ljbffr
-
Site Reliability Engineer
2 weeks ago
Singapore RigNet Full timeAbout us One team. Global challenges. Infinite opportunities. At Viasat, we’re on a mission to deliver connections with the capacity to change the world. For more than 35 years, Viasat has helped shape how consumers, businesses, governments and militaries around the globe communicate. We’re looking for people who think big, act fearlessly, and create an...
-
Site Reliability Engineer
2 weeks ago
Singapore ABAXX SINGAPORE PTE. LTD. Full timeSite Reliability Engineer - Networking We are seeking competent candidate joining our Infrastructure Team for the mission building and operating MAS regulated marketplace and clearing house. This role is ideal for someone with a strong foundation in AWS services, infrastructure as code, and cloud security, who is passionate about building scalable, secure,...
-
Site Reliability Engineer
1 week ago
Singapore ABAXX SINGAPORE PTE. LTD. Full timeSite Reliability Engineer - Networking We are seeking competent candidate joining our Infrastructure Team for the mission building and operating MAS regulated marketplace and clearing house. This role is ideal for someone with a strong foundation in AWS services, infrastructure as code, and cloud security, who is passionate about building scalable, secure,...
-
Site Reliability Engineer
2 days ago
Singapore Abaxx Commodity Futures Exchange and Clearinghouse Full timeSite Reliability Engineer - Networking We are seeking a competent candidate joining our Infrastructure Team for the mission building and operating a MAS regulated marketplace and clearing house. This role is ideal for someone with a strong foundation in AWS services, infrastructure as code, and cloud security, who is passionate about building scalable,...
-
Site Reliability Engineer
6 days ago
Singapore NetEase Games Full timeOverview Join to apply for the Site Reliability Engineer role at NetEase Games . As a leading internet technology company based in China, NetEase provides premium online services centered around content creation and operates a broad gaming ecosystem. Job Description Site Reliability Engineering (SRE) refers to using software engineering methods to manage...
-
Site Reliability Engineer
6 days ago
Singapore NetEase Games Full timeOverview Join to apply for the Site Reliability Engineer role at NetEase Games . As a leading internet technology company based in China, NetEase provides premium online services centered around content creation and operates a broad gaming ecosystem. Job Description Site Reliability Engineering (SRE) refers to using software engineering methods to manage...
-
Site Reliability Engineer
2 weeks ago
Singapore Point72 Full timeJoin to apply for the Site Reliability Engineer role at Point72 About the role As part of Point72’s Technology Team, you will focus on developing and maintaining complex, distributed, real-time systems that support our Global Macro business. Your responsibilities will include optimizing operations through automation, building foundational SRE components,...
-
Site Reliability Engineer
1 week ago
Singapore Point72 Full timeJoin to apply for the Site Reliability Engineer role at Point72About the role As part of Point72’s Technology Team, you will focus on developing and maintaining complex, distributed, real-time systems that support our Global Macro business. Your responsibilities will include optimizing operations through automation, building foundational SRE components,...
-
Site Reliability Engineer
6 days ago
Singapore APPLE SOUTH ASIA PTE. LTD. Full timeSummary At Apple, new ideas have a way of becoming excellent products, services, and customer experiences very quickly. Bring passion and dedication to your job and there’s no telling what you could accomplish. The people here at Apple don’t just build products - they craft the kind of wonder that’s revolutionized entire industries. It’s the...
-
Site Reliability Engineer
6 days ago
Singapore APPLE SOUTH ASIA PTE. LTD. Full timeSummary At Apple, new ideas have a way of becoming excellent products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish. The people here at Apple don't just build products - they craft the kind of wonder that's revolutionized entire industries. It's the diversity of...