Site Reliability Engineer

3 days ago


Singapur, Singapore Trulyyy Full time

Global Leader in Connectivity and Smart Technology

Responsibilities
  • Act as the technical Subject Matter Expert (SME) for the deployment and management of Microservices on Kubernetes-based cloud platforms.
  • Collaborate with Cloud Technical Development and DevOps teams to facilitate the deployment of services across Multi-Cloud Environments.
  • Conduct Load Testing and Chaos Engineering exercises to validate the scalability and resilience of microservices.
  • Develop observability solutions for Microservices and cloud platforms such as AWS, OCI, Azure, and GCP.
  • Create and implement Disaster Recovery plans in partnership with Development and DevOps teams.
  • Analyze and troubleshoot production risks stemming from resource limitations, including node groups, CPU, memory, HPA scheduling, JVM pre-warming, etc.
  • Write and maintain automation scripts using languages like Python, Go, or Bash.
  • Define and monitor KPIs (SLA/SLO/SLI) for all cloud microservices in collaboration with development teams to enhance business insights.
  • Produce and maintain comprehensive technical documentation, including architecture diagrams, design specifications, and operational procedures.
  • Lead incident response efforts to swiftly diagnose and resolve production issues.
  • Conduct post-incident reviews to identify root causes and recommend solutions or mitigations.
  • Support product and technology selection processes, including Proof of Concepts (POCs).
Requirements
  • Bachelor’s degree in Computer Science, Information Technology, or a related discipline.
  • At least 1 year of experience as a Site Reliability Engineer.
  • Proficiency in programming and scripting languages such as Java, Python, Bash, or PowerShell.
  • Practical experience in SRE, DevOps, cloud operations, and cloud security best practices.
  • Strong understanding of security technologies, including Identity and Access Management, Network Security, Application Security, and Data Protection.
  • Excellent problem-solving and analytical capabilities, with the ability to work both independently and collaboratively.
Seniority level
  • Mid-Senior level
Employment type
  • Full-time
#J-18808-Ljbffr

  • Singapur, Singapore IDEMIA Full time

    Join to apply for the Site Reliability Engineer role at IDEMIA Join to apply for the Site Reliability Engineer role at IDEMIA Get AI-powered advice on this job and more exclusive features. PurposeThis role plays a critical part in ensuring reliability, scalability, and performance of our systems and services. You will work closely with development and...


  • Singapur, Singapore Beijing Foreign Enterprise Management Consultants Co.,Ltd. Full time

    Direct message the job poster from Beijing Foreign Enterprise Management Consultants Co.,Ltd. On behalf of Huawei, a world-renowned information and communication technology company, we are seeking passionate and talented individuals to join our team as Site Reliability Engineer Overview On behalf of Huawei, a world-renowned information and communication...


  • Singapur, Singapore Point72 Full time

    Join to apply for the Site Reliability Engineer role at Point72 About the role As part of Point72’s Technology Team, you will focus on developing and maintaining complex, distributed, real-time systems that support our Global Macro business. Your responsibilities will include optimizing operations through automation, building foundational SRE...


  • Singapur, Singapore WeChat International Pte. Ltd. Full time

    Site Reliability Engineer page is loadedSite Reliability Engineer Apply remote type Onsite locations Singapore-CapitaSky time type Full time posted on Posted 30+ Days Ago job requisition id R Business Unit Technology Engineering Group (TEG) is responsible for supporting the company and its business groups on technology and operational platforms, as well as...

  • Site Reliability

    3 days ago


    Singapur, Singapore Canonical Full time

    Join to apply for the Site Reliability / Gitops Engineer role at Canonical 1 day ago Be among the first 25 applicants Join to apply for the Site Reliability / Gitops Engineer role at Canonical Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is very widely...


  • Singapur, Singapore Apple Inc. Full time

    There is a lot that goes into building the most secure yet user-friendly devices in the world. We are a unique Software Development group with a charter to secure our platforms, which include iOS software, iOS Devices, and Mac. We build solutions that are used by our customers, engineering teams, and manufacturing environments.We are lookng for Site...


  • Singapur, Singapore IDEMIA Full time

    Join to apply for the Site Reliability Engineer role at IDEMIA Overview This role plays a critical part in ensuring reliability, scalability, and performance of our systems and services. You will work closely with development and operations teams to build and maintain robust infrastructure and tools that support high availability, monitoring and rapid...


  • Singapur, Singapore RigNet Full time

    About us One team. Global challenges. Infinite opportunities. At Viasat, we’re on a mission to deliver connections with the capacity to change the world. For more than 35 years, Viasat has helped shape how consumers, businesses, governments and militaries around the globe communicate. We’re looking for people who think big, act fearlessly, and create an...


  • Singapur, Singapore Tower Research Capital Full time

    Join to apply for the Site Reliability Engineer role at Tower Research Capital Join to apply for the Site Reliability Engineer role at Tower Research Capital Tower Research Capital is a leading quantitative trading firm founded in 1998. Tower has built its business on a high-performance platform and independent trading teams. We have a 25+ year track...


  • Singapur, Singapore AvePoint Full time

    Site Reliability Engineer (SRE) (GovTech) We are seeking a skilled and passionate Engineer to join our team to build and operate a Whole-of-Government (WoG) runtime platform.As a Site Reliability Engineer, you will be responsible for designing and operating GitLab, AWS and Kubernetes-based infrastructure and solutions that power our platform, to ensure the...