Reliability Engineering Specialist

2 weeks ago


Singapore ADVANCED MICRO DEVICES (SINGAPORE) PTE LTD Full time
Roles & Responsibilities

THE ROLE:

Join a dynamic global team dedicated to advanced reliability testing of module and system boards of AMD's cutting-edge products. Collaborate closely with cross-functional teams across AMD Global Operations & Quality, and Data Center organizations on accelerator-product system setup and reliability testing.

KEY RESPONSIBILITIES:

  • System-level setup and testing:
    • Plan, execute, and optimize system-level setups for accelerator products, including server rack and system configurations.
    • Ensure seamless integration and functionality of server systems with advanced cooling solutions and environmental management systems.
    • Validate and maintain reliability test scripts for automated and manual testing processes.
  • Reliability assessment and testing:
    • Conduct comprehensive reliability assessments of accelerator systems, focusing on mechanical, thermal, and electrical stress factors.
    • Design and implement environmental stress tests to simulate data center conditions, including operational stress, thermal cycling, signal, and power integrity.
    • Evaluate material interactions and their impact on product reliability, ensuring robustness in diverse operating environments.
    • Analyze results to identify potential reliability risks and areas for design improvement.
  • Functional testing and fault isolation:
    • Perform detailed functional testing to evaluate system performance under various operational conditions.
    • Identify, isolate, and troubleshoot faults using advanced diagnostic tools and methodologies.
  • Failure analysis and reporting:
    • Perform root cause analysis for identified reliability failures and develop corrective actions for design and process enhancement.
    • Collaborate with cross-functional teams to conduct root cause analysis of reliability testing failures.
  • Collaboration and documentation:
    • Work closely with design, manufacturing, and quality teams to align reliability goals with overall product requirements.
    • Generate comprehensive reports detailing reliability test results, analysis, and recommendations.
    • Maintain meticulous records of testing methodologies and outcomes for future reference and continuous improvement initiatives.
  • Mentorship:
    • Effectively mentor junior engineers, providing guidance in both technical domains and professional skill development to foster growth and team success.

PREFERRED EXPERIENCE:

  • Knowledge of reliability engineering principles, product lifecycle, and standards in high-performance computing environments.
  • Proven experience in system-level setup and testing for accelerator products or similar technologies.
  • Proficiency in developing and executing reliability test scripts and protocols.
  • Familiarity with reliability standards and best practices in high-performance computing environments.
  • Familiarity with data center environmental management, server rack/system configurations, and integrated cooling solutions.
  • Strong understanding of environmental stress factors, including thermal, mechanical, and electrical stresses, in server systems (L6–L10).
  • Expertise in failure analysis techniques, including root cause analysis and fault isolation methodologies.
  • Excellent written and verbal communication skills for clear reporting and collaboration.
  • Strong analytical, problem-solving, and communication skills.
  • Experience with reliability testing tools, simulation software and statistical tools is an added advantage.
  • Knowledge in project and risk management is an added advantage.
  • Self-starter and able to independently drive tasks to completion.
  • Ability to structure and execute complex analysis, draw insights, and communicate summary conclusions/recommendations to senior management and AMD customers/partners.
  • Ability to network, build relationships, and collaborate to drive effective decision-making across multiple functions and levels within AMD.

ACADEMIC CREDENTIALS:

  • Bachelor's or Master's degree in Electrical/Electronics Engineering (EE) or a related field.

LOCATION:

Singapore

Tell employers what skills you have
Cycling
Manual Testing
Budget Management
Ubuntu
Root Cause Analysis
Reliability
Administration Management
Reliability Engineering
Infrastructure Architecture
RedHat
Technical Consultation
Environmental Management Systems
Technical Engineering
Failure Analysis

  • Singapore beBeePerformance Full time

    Job Title: Reliability Engineering Specialist This is a challenging role that involves ensuring the reliability, scalability, and performance of our Service to Enterprise Customers. As a Reliability Engineering Specialist, you will identify and investigate customer performance problems, recommend remediation actions, and drive initiatives to improve system...


  • Singapore beBeePerformance Full time $100,000 - $150,000

    Job Title: Reliability Engineering SpecialistThis is a challenging role that involves ensuring the reliability, scalability, and performance of our Service to Enterprise Customers. As a Reliability Engineering Specialist, you will identify and investigate customer performance problems, recommend remediation actions, and drive initiatives to improve system...


  • Singapore beBeePerformance Full time

    Job Title: Reliability Engineering Specialist This is a challenging role that involves ensuring the reliability, scalability, and performance of our Service to Enterprise Customers. As a Reliability Engineering Specialist, you will identify and investigate customer performance problems, recommend remediation actions, and drive initiatives to improve system...


  • Singapore beBeeEngineering Full time $90,000 - $120,000

    Reliability Engineering SpecialistWe are seeking an exceptional Reliability Engineering Specialist to join our team. As a key member, you will play a pivotal role in ensuring the reliability and robustness of our products through advanced engineering practices and comprehensive testing.The ideal candidate will have a strong background in electrical...


  • Singapore ONE STOP ENGINEERING PTE. LTD. Full time

    **Job Scope**: - Assisting in the development and execution of **reliability test plans **for new gaming peripherals during the NPI phase. - Supporting the **failure analysis **process by helping identify and root-cause product defects. - Assisting with the development of **stress testing protocols **, including mechanical, thermal, and environmental...


  • Singapore AMD Full time

    Overview Join to apply for the Reliability Engineering Specialist role at AMD . Role Join a dynamic global team dedicated to advanced reliability testing of module and system boards of AMD's cutting-edge products. Collaborate closely with cross-functional teams across AMD Global Operations & Quality, and Data Center organizations on accelerator-product...


  • Singapore AMD Full time

    Overview Join to apply for the Reliability Engineering Specialist role at AMD . Role Join a dynamic global team dedicated to advanced reliability testing of module and system boards of AMD's cutting-edge products. Collaborate closely with cross-functional teams across AMD Global Operations & Quality, and Data Center organizations on accelerator-product...


  • Singapore beBeeReliability Full time $150,000 - $250,000

    Job Title: Senior Reliability SpecialistJob DescriptionWe are seeking a highly skilled Senior Reliability Specialist to join our team. In this role, you will be responsible for ensuring the long-term reliability and robustness of high-voltage semiconductor devices and systems.As a Senior Reliability Specialist, you will design and execute reliability test...


  • Singapore beBeeReliability Full time $80,000 - $120,000

    Job Title: Reliability Specialist We are seeking a skilled reliability specialist to join our team. As a key member of our organization, you will be responsible for ensuring the reliability and integrity of our products. Your expertise in reliability engineering will play a critical role in identifying and addressing potential issues before they become...

  • Reliability Engineer

    2 weeks ago


    Singapore beBeeMaintenance Full time $90,000 - $120,000

    Reliability Specialist PositionThe Maintenance and Reliability Division is seeking a skilled professional to ensure the reliability and efficiency of our manufacturing operations. This role will play a vital part in driving key performance indicators (KPIs) and supporting the reliability program in the maintenance department.Your Responsibilities:Ensure all...