Current jobs related to Site Reliability Engineer Iii, Infrastructure - Singapore - NodeFlair


  • Singapore ByteDance Full time

    [About ByteDance] Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok, Helo, and Resso, as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create...


  • Singapore Rapsys Technologies Full time

    **Experience**: 4+ Years **Location**: Changi, Singapore **Roles and Responsibilities**: 2. Set up and operate the server infrastructure and software (Linux, Elasticsearch, Logstash, Grafana, Kibana, Kafka, Nginx) based on bank’s security standards and industry’s security standards. 3. Perform continuous improvement for the platform covering areas...


  • Singapore JPMorgan Chase & Co Full time

    **JOB DESCRIPTION** **Job responsibilities** - Guides and assists others in the areas of building appropriate level designs and gaining consensus from peers where appropriate - Collaborates with other software engineers and teams to design and implement deployment approaches using automated continuous integration and continuous delivery pipelines -...


  • Singapore beBee Careers Full time

    About the PositionThis Infrastructure Technician III role requires a skilled individual to contribute to the daily site operation. The ideal candidate will have a strong background in mission critical facilities operating/engineering or equivalent equipment experience.Key DutiesCreation and modification of site operating procedures.Contribution to change...


  • Singapore f5 Full time

    Everything we do centers around people. That means we obsess over how to make the lives of our customers, and their customers, better. And it means we prioritize a diverse F5 community where each individual can thrive. Role Overview: Join a growing team securing both leading-edge protection solutions and enterprise infrastructure. As a Senior Site...


  • Singapore Imperva Full time

    **Site Reliability Engineer**:** About the role** Imperva’s Infrastructure and Cloud team is looking for a highly technical Site Reliability Engineer to drive innovation, scale, and create operational excellence for the Imperva globally distributed network. As an SRE in the ICO organization, you approach solving, supporting, and optimizing the...


  • Singapore Retentia technology private limited Full time

    **3+ years of experience in Site Reliability Engineering, DevOps**, or a related field. - **Strong knowledge of cloud platforms (AWS, GCP, Azure) and containerization technologies (Docker, Kubernetes).** - Experience with automation and configuration management tools (e.g., T**erraform, Ansible, Chef, or Puppet).** - Proficiency in at least **one programming...


  • Singapore Experis Full time

    **About the Team** The Datacenter Infrastructure Engineering team supports the company's fast growth by building and operating hyperscale datacenters. The team manages the end to end lifecycle of server fleet, providing cloud solutions and various infrastructure services ensuring that they are scalable and are reliable. **Responsibilities** - Operate basic...


  • North-East Singapore PERSOLKELLY Full time

    The Site Reliability Engineer is responsible for ensuring the reliability, scalability, and efficiency of our systems and infrastructure. This role involves monitoring, troubleshooting, and resolving issues to maintain optimal performance. The engineer will also collaborate with cross-functional teams to automate processes and improve system reliability....


  • Singapore The Edge Asia Full time

    Our client is a US hedge fund and their Technology group is constantly improving the company’s IT infrastructure, positioning them at the forefront of a rapidly evolving technology landscape. They are a team of experts experimenting, discovering new ways to harness the power of open-source solutions, and embracing enterprise agile methodology. Their...

Site Reliability Engineer Iii, Infrastructure

2 weeks ago


Singapore NodeFlair Full time

**Job Summary**:
**Salary**
S$8,000 - S$16,000 / Monthly

**Job Type**

**Seniority**

Mid

**Years of Experience**
At least 5 years

**Tech Stacks**
AppDynamics GitLab Terraform Jenkins Datadog Dynatrace SOAP Puppet Grafana Prometheus Splunk Ansible

**Job responsibilities**
- Guides and assists others in the areas of building appropriate level designs and gaining consensus from peers where appropriate
- Collaborates with other software engineers and teams to design and implement deployment approaches using automated continuous integration and continuous delivery pipelines
- Collaborates with technical experts, key stakeholders, and team members to resolve complex problems
- Understands service level indicators and utilizes service level objectives to proactively resolve issues before they impact customers
- Improve aspects of network products related to reliability related nonfunctional requirements such as logging, monitoring, observability, performance, scalability, capacity, resiliency, etc.
- Perform research and discovery on industry tools and lead build versus buy
- Collaborate with other network and software engineering teams to automate processes, reduce toil and modernize operations
- Participate in on-call rotation as an escalation contact for production issues
- Turn theory into practice, navigate through ambiguity to build a plan
- Accomplish common goals using SCRUM practices

**Required qualifications, capabilities, and skills**
- Bachelor’s degree in computer science or related fields
- Minimally 5 years of site reliability engineering or related experience
- Ability to contribute to large and collaborative teams by presenting information in a logical and timely manner with compelling language and limited supervision
- Ability to proactively recognize road blocks and demonstrates interest in learning technology that facilitates innovation
- Experience in observability such as white and black box monitoring, service level objective alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, and others
- Familiarity with troubleshooting common networking technologies and issues
- Ability to initiate and implement ideas to solve business problems
- Experience with continuous integration and continuous delivery tools like Jenkins, GitLab, or Terraform
- Experience with one or more infrastructure automation technologies (Ansible, Terraform, Puppet, building APIs and services using REST, SOAP, etc.)

**Preferred qualifications, capabilities, and skills**
- Certifications in networking are a plus