Reliability Engineer

18 hours ago


Singapore NE Digital Full time

COMPANY DESCRIPTION

NE Digital is the digital, data and technology organization that serve as a center of excellence to drive digital transformation for our group of NTUC Social Enterprises to meet the critical social needs of Singapore's community. Delivering innovative products and solutions, we empower our people to lead a better and meaningful life through digital services in the area of daily essentials, health and community care, childcare and education as well as financial services.

**The Team**

We believe that diversity is key to driving an innovative, cohesive, productive and fun workplace Hence, at NE Digital our people join us from all around the world. Be sure to be soaked in an environment with different ethnic groups driving innovation and injecting some creative juice as one

Contributing to a social purpose through technology, our team of passionate and dedicated folks are spread into different social enterprises such as NTUC Fairprice Group, NTUC First Campus, NTUC Health and among others

**Creating technologies that impacts**

DESIGNATION : Reliability Engineer

RESPONSIBILITIES

NE Digital is currently hiring for Reliability Engineer to join Digital Product Development

organization. The team combines software and system engineering to architect and run large-scale,

distributed, and fault-tolerant systems. The primary team’s goal is to ensure sustainably achieve

product reliability through software engineering practices, architecture patterns, culture

embracement, process standardization, automation framework, education, and sharing. The

team practices industry reliability frameworks such as Service Level Objectives (SLOs) and

Service Level Indication (SLIs), release engineering, IaC, and operations automation. The team

will empower our product developers in the Product Development Life Cycle to ensure product

reliability, it is not limited to building self-serve tools/processes, and an infrastructure foundation

that allows the product team to constantly deliver a high-reliability system.

mindset or a highly skilled system administrator with knowledge of programming and operations

automation. You must be the person who likes to solve complex problems with simplicity in

mind, work around the clock to ensure system reliability, enjoy collaborating with other teams to

embrace reliability discipline and frameworks.

As a Reliability Engineer in NE Digital, you have the opportunity to manage the complex

challenges of the Social Enterprise System that are unique to NE Digital, while using your

expertise in coding, algorithm, complexity analysis, and large-scale system design.

You will be reporting to the Architecture & Reliability Lead.
- Work with product developers to ensure that the software delivery pipeline is as reliable
- as possible.
- Responsible to drive practices that ensure reliability of the product.
- Collaborate closely with product developers to ensure that the designed solution
- responds to non-functional requirements such as availability, performance, security, and
- maintainability.
- Responsible for availability, latency, performance, efficiency, monitoring, emergency
- response, and system capacity planning.
- To improve the whole lifecycle of services from inception and design, through
- deployment, operation, and refinement.
- Support services before they go live through activities such as system design consulting,
- developing software platforms and frameworks, system capacity planning and
- post-mortems.
- Maintain services once they are launched by measuring and monitoring availability,
- latency, and overall system health.
- Scale systems sustainably through mechanisms like automation; evolve systems by
- pushing for changes that improve reliability and velocity.
- Practice sustainable incident response and blameless postmortems.
- Documenting “tribal” knowledge.
- Advocate for Reliability Engineering practices

QUALIFICATIONS
- Experience in analyzing and troubleshooting systems.
- Understanding of Infrastructure monitoring, logging, alerting release, and configuration
- management.
- Understanding of networking (e.g. TCP/IP, routing, network topology, load balancers,
- DNS, NTP).
- Experience in one of the following: Python, Java, Go, Perl, Ruby, or shell scripting.
- Experience in Public Cloud, AWS, and/or GCP.
- Experience with software deployment and/or orchestration technologies, e.g., Puppet,
- Chef, Salt, Ansible, Docker, Kubernetes, Terraform.
- Experience in CI/CD (e.g., JIRA, Git, Jenkins, Nexus,...)
- Experience in standard IT security practices (e.g., encryption, certificates, key
- management)
- Excellent communication, and problem-solving skills with strong attention to detail.
- Flexibility to work non-business hours that may include weekends and/or holidays
- Self-starter who is able to identify and perform tasks with mínimal supervision


  • Reliability Engineer

    18 hours ago


    Central Singapore Chevron Full time

    All interested applicants, please read the Data Privacy Notice Responsibilities for this position may include but are not limited to: - Facilitates & stewards the roll out of global reliability initiatives, such as Facility Integrity & Reliability Management (FIRM), within SMP by engaging cross functional stakeholder - Responsible for plant reliability KPI...


  • Singapore GLOBALFOUNDRIES Full time

    **About GlobalFoundries** **Introduction** **Your Job** - SRAM/Flash/NVM/OTP/MTP/eFUSE/CPI reliability setup & analysis, and handle PRM (Periodic reliability monitoring) - Work with customer/vendor to design & bring in hardware & software for reliability characterization. - Establish wafer and/or package level test methodologies and test program. - Support...


  • Singapore Amazon Asia-Pacific Resources Private Limited (Singapore) Full time

    Bachelor's or Master’s degree in Reliability Engineering, Physics, Electrical, Mechanical or Materials Engineering or related field - 6+ years of Reliability Engineering work experience in high reliability industry - 4+ years experience with failure analysis activities and root cause analysis - 4+ years experience with accelerated life testing, stress...

  • Reliability Engineer

    2 weeks ago


    Singapore Cognizant Full time

    **About the role** The Reliability Engineer ensures stability of the manufacturing plant, systems health, lifecycle management, user satisfaction. Prioritizing digital capabilities and infrastructure's reliability, performance, and efficiency is a must. All employees involved in the development and maintenance of these services must work collaboratively to...

  • Reliability Engineer

    2 weeks ago


    Singapore Cognizant Full time

    **About the role** The Reliability Engineer ensures stability of the manufacturing plant, systems health, lifecycle management, user satisfaction. Prioritizing digital capabilities and infrastructure's reliability, performance, and efficiency is a must. All employees involved in the development and maintenance of these services must work collaboratively to...


  • Singapore Flowserve Full time

    Flowserve is presently recruiting a Reliability Engineer to support our Innovation & Product Development initiatives reporting to the Director Application Development & Emerging Technologies. This is an opportunity to take a lead role on supporting engineering design and consultancy activities on all aspects related to technical risks and reliability...

  • Associate Engineer

    1 week ago


    Singapore Systems on Silicon Manufacturing Co. Pte. Ltd. Full time

    **Responsibilities**: - Reliability Engineering and Analysis - Conduct process reliability tests, analysis and reliability risk assessments - Perform reliability monitoring and resolve monitoring issues - Lead by engineers on reliability qualification projects and to resolve qualification issues - Support lab operations **Requirements**: - Diploma in...


  • Singapore Systems on Silicon Manufacturing Co. Pte. Ltd. Full time

    Position Detail - Reliability Engineer/ Senior Engineer- Posting Date : 03 Jul 2025 | Closing Date :01 Oct 2025_SSMC (Systems on Silicon Manufacturing Company Pte. Ltd.), is a Joint Venture between NXP and TSMC. We offer flexible and cost effective semiconductor fabrication solutions by maintaining fully equipped SMIF cleanroom environment, 100% equipment...


  • Singapore NodeFlair Full time

    **Job Summary**: **Salary** S$3,500 - S$6,800 / Monthly **Job Type** **Seniority** Mid **Years of Experience** At least 3 years **Tech Stacks** MODE **Purpose and Scope** The Reliability Engineer plays a crucial role in ensuring the optimal performance, availability, and lifespan of assets. The purpose of this role is to develop and implement...


  • Singapore ETEAM WORKFORCE PTE. LTD. Full time

    Position: Site Reliability Engineer (SRE) Work Mode - Onsite/Hybrid Timing - 9am to 6 pm Duration – 1 Year (Highly extendable) Salary: 6018 SGD Work Location: Robinson Road, Singapore About the Role We are looking for a seasoned Site Reliability Engineer (SRE) with 5+ years of experience to join our Platform Engineering team. This role is ideal for someone...