
Senior System Reliability Engineer
2 weeks ago
Reliability Engineering Specialist
This position involves working within a dynamic, global team that is dedicated to advanced reliability testing of cutting-edge products. The specialist will collaborate closely with cross-functional teams across various organizations on setup and testing of accelerator-product systems.
Main Responsibilities
- Setup and Testing:
- Plan, execute, and optimize system-level setups for accelerator products, including server rack and system configurations.
- Ensure seamless integration and functionality of server systems with advanced cooling solutions and environmental management systems.
- Validate and maintain reliability test scripts for automated and manual testing processes.
- Reliability Assessment and Testing:
- Conduct comprehensive reliability assessments of accelerator systems, focusing on mechanical, thermal, and electrical stress factors.
- Design and implement environmental stress tests to simulate data center conditions, including operational stress, thermal cycling, signal, and power integrity.
- Evaluate material interactions and their impact on product reliability, ensuring robustness in diverse operating environments.
- Analyze results to identify potential reliability risks and areas for design improvement.
- Functional Testing and Fault Isolation:
- Perform detailed functional testing to evaluate system performance under various operational conditions.
- Identify, isolate, and troubleshoot faults using advanced diagnostic tools and methodologies.
- Failure Analysis and Reporting:
- Perform root cause analysis for identified reliability failures and develop corrective actions for design and process enhancement.
- Collaborate with cross-functional teams to conduct root cause analysis of reliability testing failures.
- Collaboration and Documentation:
- Work closely with design, manufacturing, and quality teams to align reliability goals with overall product requirements.
- Generate comprehensive reports detailing reliability test results, analysis, and recommendations.
- Maintain meticulous records of testing methodologies and outcomes for future reference and continuous improvement initiatives.
- Mentorship:
- Effectively mentor junior engineers, providing guidance in both technical domains and professional skill development to foster growth and team success.
Required Skills and Qualifications
- Key Requirements:
- Possess knowledge of reliability engineering principles, product lifecycle, and standards in high-performance computing environments.
- Demonstrate proven experience in system-level setup and testing for accelerator products or similar technologies.
- Show proficiency in developing and executing reliability test scripts and protocols.
- Familiarity with reliability standards and best practices in high-performance computing environments.
- Familiarity with data center environmental management, server rack/system configurations, and integrated cooling solutions.
- Strong understanding of environmental stress factors, including thermal, mechanical, and electrical stresses, in server systems (L6–L10).
- Expertise in failure analysis techniques, including root cause analysis and fault isolation methodologies.
- Excellent written and verbal communication skills for clear reporting and collaboration.
- Strong analytical, problem-solving, and communication skills.
- Experience with reliability testing tools, simulation software and statistical tools is an added advantage.
- Knowledge in project and risk management is an added advantage.
- Self-starter and able to independently drive tasks to completion.
- Ability to structure and execute complex analysis, draw insights, and communicate summary conclusions/recommendations to senior management and customers/partners.
- Ability to network, build relationships, and collaborate to drive effective decision-making across multiple functions and levels.
Academic Credentials
- Education:
- Bachelor's or Master's degree in Electrical/Electronics Engineering (EE) or a related field.
-
Systems Optimization Expert
3 days ago
Singapore beBee Engineer Full time $80,000 - $120,000Role Description:">">We are seeking an Engineer to join our team, responsible for monitoring and maintaining critical information-communication systems.">">About the Role:">As a key member of our team, you will work alongside fellow engineers to ensure the reliability and functionality of our communication systems. Your expertise in advanced technologies...
-
Senior System Reliability Engineer
2 weeks ago
Singapore beBeeReliability Full time $100,000 - $140,000Reliability Engineering SpecialistThis position involves working within a dynamic, global team that is dedicated to advanced reliability testing of cutting-edge products. The specialist will collaborate closely with cross-functional teams across various organizations on setup and testing of accelerator-product systems.Main ResponsibilitiesSetup and...
-
Reliability Engineer/ Senior Engineer
4 days ago
Singapore Systems on Silicon Manufacturing Co. Pte. Ltd. Full timePosition Detail - Reliability Engineer/ Senior Engineer- Posting Date : 28 Apr 2025 | Closing Date :27 Jul 2025_SSMC (Systems on Silicon Manufacturing Company Pte. Ltd.), is a Joint Venture between NXP and TSMC. We offer flexible and cost effective semiconductor fabrication solutions by maintaining fully equipped SMIF cleanroom environment, 100% equipment...
-
Reliable Systems Engineer
2 weeks ago
Singapore beBeeReliability Full time $100,000 - $140,000Job OverviewThe position of System Reliability Engineer Specialist is available in a dynamic global team focused on advanced reliability testing for cutting-edge products.Key Responsibilities:System-Level Setup and TestingDevelop and optimize system-level setups for accelerator products, including server rack and system configurations.Ensure seamless...
-
Senior Reliability Engineer
7 days ago
Singapore Air Liquide Full timeAir Liquide Singapore Private Limited (ALSg), the largest and leading industrial gas company in South-east Asia, employs 800 people recognized for their high level of expertise to provide gases, equipment, services and packaged solutions to support the requirements of the Singapore manufacturing industry. The Senior Reliability Engineer is responsible to...
-
Reliable Systems Engineer
1 week ago
Singapore beBeeReliability Full time $80,000 - $120,000System Reliability ExpertiseThis role entails delivering technical support across the lifecycle of clients' systems and equipment, focusing on reliability, availability, and maintainability (RAM).The successful candidate will propose and implement integrated logistics support (ILS) solutions to optimize the reliability, maintainability, and logistics...
-
Senior Site Reliability Engineer
1 week ago
Singapore EC1 Partners Full time $120,000 - $150,000 per yearSenior Site Reliability Engineer – Linux Focus (Singapore) Buy Side | Permanent | Singapore EC1 Partners is working with a leading buy-side firm that is scaling its technology platform to support growing global demand. The firm is known for its high-performance systems, collaborative culture, and strong emphasis on engineering excellence. As part of their...
-
Senior Principal Reliability Engineer
2 days ago
Singapore NXP Semiconductors Full timeSenior Principal Reliability Engineer page is loaded## Senior Principal Reliability Engineerlocations: Singaporetime type: Full timeposted on: Posted Todayjob requisition id: R- We are looking for Reliability Engineer role in preparation for the formation of the joint venture of NXP and VIS, known as VSMC.**Job Description**This posting is for a Senior...
-
Senior Reliability Engineer
3 days ago
Singapore beBeeReliability Full time $150,000 - $200,000Job Title: Senior Reliability EngineerWe are seeking an experienced Senior Reliability Engineer to lead and manage a team of engineers in achieving corporate goals in reliability.The successful candidate will be responsible for providing reliability analysis and lab support for all company technologies and customer issues, developing corporate-wide standards...
-
System Reliability Specialist
1 week ago
Singapore beBeeInfrastructure Full time $90,000 - $120,000Reliable Systems EngineerWe're looking for a skilled engineer to join our team and help us build and maintain reliable systems.Treat infrastructure and operations as software engineering problems.Design and architect new solutions to improve system agility.Optimize existing solutions to reduce downtime and increase efficiency.Key Responsibilities:Manage AWS,...