Reliability Engineering Specialist

2 days ago


Singapore ADVANCED MICRO DEVICES (SINGAPORE) PTE LTD Full time
Roles & Responsibilities

THE ROLE:

Join a dynamic global team dedicated to advanced reliability testing of module and system boards of AMD's cutting-edge products. Collaborate closely with cross-functional teams across AMD Global Operations & Quality, and Data Center organizations on accelerator-product system setup and reliability testing.

KEY RESPONSIBILITIES:

  • System-level setup and testing:
    • Plan, execute, and optimize system-level setups for accelerator products, including server rack and system configurations.
    • Ensure seamless integration and functionality of server systems with advanced cooling solutions and environmental management systems.
    • Validate and maintain reliability test scripts for automated and manual testing processes.
  • Reliability assessment and testing:
    • Conduct comprehensive reliability assessments of accelerator systems, focusing on mechanical, thermal, and electrical stress factors.
    • Design and implement environmental stress tests to simulate data center conditions, including operational stress, thermal cycling, signal, and power integrity.
    • Evaluate material interactions and their impact on product reliability, ensuring robustness in diverse operating environments.
    • Analyze results to identify potential reliability risks and areas for design improvement.
  • Functional testing and fault isolation:
    • Perform detailed functional testing to evaluate system performance under various operational conditions.
    • Identify, isolate, and troubleshoot faults using advanced diagnostic tools and methodologies.
  • Failure analysis and reporting:
    • Perform root cause analysis for identified reliability failures and develop corrective actions for design and process enhancement.
    • Collaborate with cross-functional teams to conduct root cause analysis of reliability testing failures.
  • Collaboration and documentation:
    • Work closely with design, manufacturing, and quality teams to align reliability goals with overall product requirements.
    • Generate comprehensive reports detailing reliability test results, analysis, and recommendations.
    • Maintain meticulous records of testing methodologies and outcomes for future reference and continuous improvement initiatives.
  • Mentorship:
    • Effectively mentor junior engineers, providing guidance in both technical domains and professional skill development to foster growth and team success.

PREFERRED EXPERIENCE:

  • Knowledge of reliability engineering principles, product lifecycle, and standards in high-performance computing environments.
  • Proven experience in system-level setup and testing for accelerator products or similar technologies.
  • Proficiency in developing and executing reliability test scripts and protocols.
  • Familiarity with reliability standards and best practices in high-performance computing environments.
  • Familiarity with data center environmental management, server rack/system configurations, and integrated cooling solutions.
  • Strong understanding of environmental stress factors, including thermal, mechanical, and electrical stresses, in server systems (L6–L10).
  • Expertise in failure analysis techniques, including root cause analysis and fault isolation methodologies.
  • Excellent written and verbal communication skills for clear reporting and collaboration.
  • Strong analytical, problem-solving, and communication skills.
  • Experience with reliability testing tools, simulation software and statistical tools is an added advantage.
  • Knowledge in project and risk management is an added advantage.
  • Self-starter and able to independently drive tasks to completion.
  • Ability to structure and execute complex analysis, draw insights, and communicate summary conclusions/recommendations to senior management and AMD customers/partners.
  • Ability to network, build relationships, and collaborate to drive effective decision-making across multiple functions and levels within AMD.

ACADEMIC CREDENTIALS:

  • Bachelor's or Master's degree in Electrical/Electronics Engineering (EE) or a related field.

LOCATION:

Singapore

Tell employers what skills you have

Cycling
Manual Testing
Budget Management
Ubuntu
Root Cause Analysis
Reliability
Administration Management
Reliability Engineering
Infrastructure Architecture
RedHat
Technical Consultation
Environmental Management Systems
Technical Engineering
Failure Analysis

  • Singapore ONE STOP ENGINEERING PTE. LTD. Full time

    Title**:Reliability Engineer Purpose Statement (2-3 Sentences): - Ensures reliability and maintainability of equipment, processes, utilities, facilities and controls with an objective to constantly improve site production and cost performance. - Develops engineering solutions to repetitive failures and all other problems that adversely affect plant...


  • Singapore Advanced Micro Devices Full time

    **Reliability Engineering Specialist**: - Singapore, Singapore - Engineering - 66974 **Job Description**: **WHAT YOU DO AT AMD CHANGES EVERYTHING** - We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences...


  • Singapore beBeeMechanical Full time $80,000 - $120,000

    Static Equipment Specialist Job Summary">We are seeking a highly skilled and experienced mechanical engineer to support our engineering division in Singapore. The successful candidate will be responsible for providing specialist mechanical engineering support for static equipment, ensuring compliance with statutory obligations and engineering codes of...


  • Singapore ST Engineering Group Full time $90,000 - $120,000 per year

    ResponsibilitiesResponsible to perform regular reliability reviews of components for our contracted customers.Develop reliability reports and presentation for customers / internal stakeholders to improve fleet reliability.Support any technical queries of department through liaising with original equipment manufacturer, if required.Able to work strategically...


  • Singapore beBeeReliability Full time $4,000 - $8,000

    System Reliability SpecialistWe are seeking a highly skilled and experienced System Reliability Specialist to join our team. The successful candidate will be responsible for ensuring the optimal performance and availability of our systems.Key Responsibilities:Troubleshoot system issues and resolve them in a timely mannerConduct root cause analysis for...


  • Singapore beBeeMechanicalEngineer Full time

    Job Description: This role involves leading reliability studies and analysis, including root caused failure analysis (RCA) and failure modes & effect analysis (FMEA). The incumbent will review and update maintenance procedures for mechanical equipment, ensuring best practices are maintained. Additionally, they will be responsible for the predictive...


  • Singapore beBeeReliability Full time $4,500 - $6,000

    Job Opportunity: Reliability Test Specialist">We are seeking a highly skilled and detail-oriented individual to join our team as a reliability test specialist. In this role, you will be responsible for conducting static and dynamic reliability tests in a controlled environment. Your attention to detail and ability to analyze complex data will be essential in...


  • Singapore Flowserve Full time

    Flowserve is presently recruiting a Reliability Engineer to support our Innovation & Product Development initiatives reporting to the Director Application Development & Emerging Technologies. This is an opportunity to take a lead role on supporting engineering design and consultancy activities on all aspects related to technical risks and reliability...


  • Singapore Singapore Technologies Engineering Ltd Full time

    Job ID: 19205 - Location: Aero - 505A Airport Road, SG - Description: - Responsibilities - Responsible to perform regular reliability reviews of components for our contracted customers. - Develop reliability reports and presentation for customers / internal stakeholders to improve fleet reliability. - Support any technical queries of department through...


  • Singapore beBeeMaintenance Full time

    Reliability Engineering Specialist This role is crucial in ensuring the efficiency and dependability of our manufacturing processes. The Maintenance & Reliability Engineer will be responsible for driving key performance indicators (KPIs) and supporting the reliability program within the maintenance department. About the Position Ensure all maintenance...