Reliability Engineer
2 hours ago
COMPANY DESCRIPTION
NE Digital is the digital, data and technology organization that serve as a center of excellence to drive digital transformation for our group of NTUC Social Enterprises to meet the critical social needs of Singapore's community. Delivering innovative products and solutions, we empower our people to lead a better and meaningful life through digital services in the area of daily essentials, health and community care, childcare and education as well as financial services.
**The Team**
We believe that diversity is key to driving an innovative, cohesive, productive and fun workplace Hence, at NE Digital our people join us from all around the world. Be sure to be soaked in an environment with different ethnic groups driving innovation and injecting some creative juice as one
Contributing to a social purpose through technology, our team of passionate and dedicated folks are spread into different social enterprises such as NTUC Fairprice Group, NTUC First Campus, NTUC Health and among others
**Creating technologies that impacts**
DESIGNATION : Reliability Engineer
RESPONSIBILITIES
NE Digital is currently hiring for Reliability Engineer to join Digital Product Development organization. The team combines software and system engineering to architect and run large-scale, distributed, and fault-tolerant systems. The primary team’s goal is to ensure sustainably achieve product reliability through software engineering practices, architecture patterns, culture embracement, process standardization, automation framework, education, and sharing. The team practices industry reliability frameworks such as Service Level Objectives (SLOs) and Service Level Indication (SLIs), release engineering, IaC, and operations automation. The team will empower our product developers in the Product Development Life Cycle to ensure product reliability, it is not limited to building self-serve tools/processes, and an infrastructure foundation that allows the product team to constantly deliver a high-reliability system.
As a Reliability Engineer in NE Digital, you have the opportunity to manage the complex challenges of the Social Enterprise System that are unique to NE Digital, while using your expertise in coding, algorithm, complexity analysis, and large-scale system design.
You will be reporting to the Architecture & Reliability Lead.
- Work with product developers to ensure that the software delivery pipeline is as reliable as possible.
- Responsible to drive practices that ensure reliability of the product.
- Collaborate closely with product developers to ensure that the designed solution responds to non-functional requirements such as availability, performance, security, and maintainability.
- Responsible for availability, latency, performance, efficiency, monitoring, emergency response, and system capacity planning.
- To improve the whole lifecycle of services from inception and design, through deployment, operation, and refinement.
- Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, system capacity planning and post-mortems.
- Maintain services once they are launched by measuring and monitoring availability, latency, and overall system health.
- Scale systems sustainably through mechanisms like automation; evolve systems by pushing for changes that improve reliability and velocity.
- Practice sustainable incident response and blameless postmortems.
- Documenting “tribal” knowledge.
QUALIFICATIONS
- Experience in analyzing and troubleshooting systems.
- Understanding of Infrastructure monitoring, logging, alerting release, and configuration management.
- Understanding of networking (e.g. TCP/IP, routing, network topology, load balancers, DNS, NTP).
- Experience in one of the following: Python, Java, Go, Perl, Ruby, or shell scripting.
- Experience in Public Cloud, AWS, and/or GCP.
- Experience with software deployment and/or orchestration technologies, e.g., Puppet, Chef, Salt, Ansible, Docker, Kubernetes, Terraform.
- Experience in CI/CD (e.g., JIRA, Git, Jenkins, Nexus,...)
- Experience in standard IT security practices (e.g., encryption, certificates, key management)
- Excellent communication, and problem-solving skills with strong attention to detail.
- Flexibility to work non-business hours that may include weekends and/or holidays
- Self-starter who is able to identify and perform tasks with mínimal supervision
-
Reliability Engineer
3 days ago
Singapore NE Digital Full timeCOMPANY DESCRIPTION NE Digital is the digital, data and technology organization that serve as a center of excellence to drive digital transformation for our group of NTUC Social Enterprises to meet the critical social needs of Singapore's community. Delivering innovative products and solutions, we empower our people to lead a better and meaningful life...
-
Reliability Engineer
2 weeks ago
Singapore Chevron Full time**Responsibilities for this position may include but are not limited to**: - Facilitates & stewards the roll out of global reliability initiatives, such as Facility Integrity & Reliability Management (FIRM), within SMP by engaging cross functional stakeholder - Responsible for plant reliability KPI tracking & reporting - Leads the assessment of Asset...
-
Reliability Engineer
2 weeks ago
Singapore Chevron Full time**Responsibilities for this position may include but are not limited to**: - Facilitates & stewards the roll out of global reliability initiatives, such as Facility Integrity & Reliability Management (FIRM), within SMP by engaging cross functional stakeholder - Responsible for plant reliability KPI tracking & reporting - Leads the assessment of Asset...
-
Reliability Engineer
2 weeks ago
Singapore NE Digital Full timeCOMPANY DESCRIPTION NE Digital is the digital, data and technology organization that serve as a center of excellence to drive digital transformation for our group of NTUC Social Enterprises to meet the critical social needs of Singapore's community. Delivering innovative products and solutions, we empower our people to lead a better and meaningful life...
-
Reliability Engineer
3 days ago
Central Singapore Chevron Full timeAll interested applicants, please read the Data Privacy Notice Responsibilities for this position may include but are not limited to: - Facilitates & stewards the roll out of global reliability initiatives, such as Facility Integrity & Reliability Management (FIRM), within SMP by engaging cross functional stakeholder - Responsible for plant reliability KPI...
-
Engineer Reliability
7 days ago
Singapore GLOBALFOUNDRIES Full time**About GlobalFoundries** **Introduction** **Your Job** - SRAM/Flash/NVM/OTP/MTP/eFUSE/CPI reliability setup & analysis, and handle PRM (Periodic reliability monitoring) - Work with customer/vendor to design & bring in hardware & software for reliability characterization. - Establish wafer and/or package level test methodologies and test program. - Support...
-
Singapore Amazon Asia-Pacific Resources Private Limited (Singapore) Full timeBachelor's or Master’s degree in Reliability Engineering, Physics, Electrical, Mechanical or Materials Engineering or related field - 6+ years of Reliability Engineering work experience in high reliability industry - 4+ years experience with failure analysis activities and root cause analysis - 4+ years experience with accelerated life testing, stress...
-
Reliability Engineer
5 days ago
Singapore Annexion Partners Pte Ltd Full timeLocation: - Singapore- Discipline: - Client type: - Contact: - Ethan Tan- Reference: - 868- Posted: - about 1 hour agoWe are currently looking for a Reliability Engineer for a leading Data Centre Operator in the region, who will bring onboard with him/her knowledge on the DC market in Singapore to add value to the team. He/She will be able to work with a...
-
Reliability Engineer
5 days ago
Singapore ANTER CONSULTING PTE. LTD. Full time**Responsibilities**: - Collaborating on the enhancement of Reliability strategies and programs, prioritizing process safety, yields, capacity, and uptime performance. - Providing technical expertise for addressing Reliability-related challenges, aiding in the resolution of process and equipment issues. Leading Root Cause Failure Analysis (RCFA) for...
-
Reliability Engineer
2 days ago
Singapore Tower Research Capital Full timeReliability Engineer Tower Research Capital, a high-frequency proprietary trading firm founded in 1998, seeks a Reliability Engineer to join our APAC Application Reliability Engineering team in Singapore or Hong Kong. **Responsibilities** - Managing the widely-deployed Order Management Systems and Market Data Delivery Systems involving every major...