Sr. Infra Reliability Engineer

3 days ago


Singapore AMAZON ASIA-PACIFIC RESOURCES PRIVATE LIMITED Full time
Roles & Responsibilities

Our Snr Reliability Engineers have experience in using Physics-of-Failure based approach to develop and implement both analytical and empirical approaches for product quality/reliability risk identification and assessment during product design, manufacture as well as deployment stages. They drive AWS application-specific requirements in carrying out both lifecycle environmental and operational stress driven risk analysis, including thermal, electrical, chemical and mechanical stresses so to identify overstress and fatigue-related product weaknesses. Evaluate product design quality/reliability risks and assess electronics manufacture process related quality/reliability issues.


They drive critical component identification and the associated vendor selection and qualification requirements. Using their knowledge of process capability for electronic component production as well as system-level performance requirements to establish critical to quality and reliability metrics, they develop datacenter system level reliability model and related reliability quantification and risk analysis for datacenter configuration optimization.


During sustaining stage, you will be responsible for monitoring product performance in the field and will be responsible to drive root cause analysis of any critical failures and the associated corrective and preventive actions. You will drive effective vendor auditing and quarterly review process to drive the continuous improvements of datacenter availability.


As an SME in the reliability engineering field and product reliability leadership, as well as business negotiations and program management, you will conduct problem analysis and solve as well as communicate with vendors.

In this role, you will be required to travel within APAC and internationally.

Key job responsibilities
Here are the key role responsibilities for this role :

  • Proactively drive reliability risk identification, assessment, and mitigation for critical data center infrastructure equipment
  • Conduct comprehensive root cause analysis of any critical equipment failures in the field
  • Collaborate cross-functionally with internal and external partners to influence product specification, design, and reliability qualification
  • Develop and maintain data center infrastructure reliability models and quantify reliability risks
  • Monitor field performance and drive ongoing reliability improvements
  • Serve as a subject matter expert and provide technical leadership on reliability engineering best practices


Basic qualifications

  • Bachelor's or Master’s degree in Reliability Engineering, Physics, Electrical, Mechanical or Materials Engineering or related field
  • 8+ years of Reliability Engineering work experience in high reliability industry
  • 5+ years of experience with failure analysis activities and root cause analysis
  • 5+ years of experience with accelerated life testing, stress analysis and finite element analysis

Tell employers what skills you have

Finite Element Analysis
Product Design
Ubuntu
Process Capability
Root Cause Analysis
Reliability
Administration Management
Reliability Engineering
Materials Engineering
Quantification
Technical Consultation
Technical Engineering
Stress Analysis
Electronics
Failure Analysis

  • Singapore AMAZON ASIA-PACIFIC HOLDINGS PRIVATE LIMITED Full time

    Roles & ResponsibilitiesAWS Infrastructure Services owns the design, planning, delivery, and operation of all AWS global infrastructure. In other words, we’re the people who keep the cloud running. We support all AWS data centers and all of the servers, storage, networking, power, and cooling equipment that ensure our customers have continual access...

  • Reliability Engineer

    1 month ago


    Singapore Cognizant Full time

    About the role The Reliability Engineer ensures stability of the manufacturing plant, systems health, lifecycle management, user satisfaction. Prioritizing digital capabilities and infrastructure's reliability, performance, and efficiency is a must. All employees involved in the development and maintenance of these services must work collaboratively to...

  • Infra Engineer

    3 days ago


    Singapore NCS PTE. LTD. Full time

    Roles & ResponsibilitiesNCS is a leading technology services firm that operates across the Asia Pacific region in over 20 cities, providing consulting, digital services, technology solutions, and more. We believe in harnessing the power of technology to achieve extraordinary things, creating lasting value and impact for our communities, partners, and people....


  • Singapore ZENITH INFOTECH (S) PTE LTD. Full time

    Job SummaryZenith Infotech (S), a leading ICT consulting firm, is seeking a skilled Infra Backup Lead to join our team. The ideal candidate will have expertise in hypervisor technologies, storage systems, and backup management.This is a 12-month contract role with a focus on operational responsibilities, including shift work. The successful candidate will be...


  • Singapore ADDVALUE INNOVATION PTE LTD Full time

    Roles & ResponsibilitiesReliability EngineerResponsibilities Work with product development teams to develop relaibility requirements, establish a reliability / test program and perform appropriate analyse to ensure that new products meet all the relaibility targets. Perform risk / reliabilty analysis (FMEA, FMECA, MTBF) for existing and new products Able...


  • Singapore Singtel Group Full time

    Primary Purpose The Cloud Engineer is responsible to develop cloud software based on design requirements and ensure software and subroutines are working to specification, program codes have conformed to standards and are delivered with quality meeting schedule and requirements.   Responsibilities Experience in Design and building AWS...


  • Singapore VISA WORLDWIDE PTE. LIMITED Full time

    Roles & ResponsibilitiesCompany DescriptionVisa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more than 200 countries and territories each year. Our mission is to connect the world through the most innovative,...


  • Singapore AVAGO TECHNOLOGIES INTERNATIONAL SALES PTE. LIMITED Full time

    Roles & ResponsibilitiesMinimum qualifications: Bachelor’s degree in electrical engineering, or an engineering field. 10 years of experience as a Semiconductor IC, Quality, Reliability or Product Engineer. Experience in a technical team and/or people management. Experience with quality control, test coverage, and IC qualification.Preferred...

  • Reliability Engineer

    1 month ago


    Singapore Cognizant Full time

    About the RoleThe Reliability Engineer plays a crucial role in ensuring the stability and reliability of the manufacturing plant's systems, infrastructure, and digital capabilities. This position requires a proactive approach to prevent downtime, optimize system performance, and rapidly deploy new features and capabilities.Key ResponsibilitiesSystem...


  • Singapore Celanese Corporation Full time

    Electrical Reliability Engineer Role At Celanese Corporation, we seek an experienced Electrical Reliability Engineer to join our team. In this position, you will play a critical role in ensuring the reliability and efficiency of our electrical systems. Key Responsibilities: Provide technical expertise to enhance electrical reliability and ensure all...


  • Singapore JOHN CRANE SINGAPORE PTE LTD Full time

    Roles & ResponsibilitiesPurpose of roleTo provide on-site support and manage reliability contract of mechanical seals for customers in Singapore.Roles and Responsibilities Assist and guide seal installation / removal on site On site pump /seal initial assessment and inspection Commissioning of seal system Perform 5-point checks on the equipment’s prior...


  • Singapore BYTEPLUS PTE. LTD. Full time

    Role OverviewAt ByteDance, we're seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you'll be responsible for ensuring the reliability and normal operation of multiple core systems for big data and online computing. This includes building automated operation solutions for large-scale systems, cooperating with the...


  • Singapore BYTEDANCE PTE. LTD. Full time

    About the JobAt ByteDance, we are looking for a talented Site Reliability Engineer to join our team. In this role, you will be responsible for ensuring the reliability and normal operation of multiple core systems for big data and online computing, while paying attention to system capacity and stability.Key Responsibilities Ensure the reliability and normal...


  • Singapore Aptitude Asia Full time

    At Aptitude Asia, we're seeking a skilled Site Reliability Engineer to join our team. This role is crucial in ensuring the high reliability, availability, and performance of our applications throughout their lifecycle.Key Responsibilities:Develop and implement automation scripts to streamline repetitive tasks and address recurring issues.Collaborate with...


  • Singapore AMGEN SINGAPORE MANUFACTURING PTE. LTD. Full time

    Roles & ResponsibilitiesSr Mfg Systems EngineerAmgen Singapore ManufacturingHOW MIGHT YOU DEFY IMAGINATION?Amgen is one of the world’s leading independentbiotechnology companies. For over 4 decades, Amgen has pioneered biotechnology breakthroughs,to bring state-of-the-art medicines from laboratory to the patient. Amgen has not only discovered and developed...


  • Singapore Squarepoint Capital Full time

    Job Role: Reliability Software EngineerTeam: Market AccessAs a Reliability Software Engineer at Squarepoint Capital, you will play a critical role in ensuring the performance, stability, and availability of our software systems, as well as their day-to-day operations. The team requires a high software development capacity, along with strong analytical...


  • Singapore Aptitude Asia Full time

    Job SummaryAptitude Asia seeks a skilled Site Reliability Engineer to ensure the high reliability, availability, and performance of applications throughout their lifecycle.Key ResponsibilitiesReliability and Performance: Ensure applications operate with high reliability, availability, and performance.Automation and Innovation: Automate repetitive tasks and...

  • Database Administrator

    2 months ago


    Singapore D L RESOURCES PTE LTD Full time

    Roles & ResponsibilitiesJob Description:Functionally reports to the Service Build Engineering Lead under Group Infrastructure Platform Services, the Build engineer will be responsible for detail design, build, post provision support and continuously optimize and automate the end-to-end build and provisioning process in accordance to the bank standard.This...


  • Singapore GOCODE GEEK PTE. LTD. Full time

    Roles & Responsibilities Collaborate with various teams that includes Development/Infra/Products to ensure successful delivery, maintenance planning and correction of build errors. Day-to-day monitoring, backup, deployment and maintenance of systems. Knowledgeable in industry best practices for security and security incidents monitoring and alert, so as...


  • Singapore Squarepoint Capital Full time

    Squarepoint Capital is a global investment management firm that utilizes a diversified portfolio of systematic and quantitative strategies across financial markets to achieve high-quality, uncorrelated returns for our clients. We have deep expertise in trading, technology, and operations and attribute our success to rigorous scientific research. As a...