Senior Hpc Engineer

2 weeks ago


Singapore Nanyang Technological University Full time

The High-Performance Computing Centre (HPCC) was established in 2010 to support the needs of large-scale and compute and data intensive computation at the University.

This role will support NSCC-NTU Active Business Continuity Platform (ABCP) operations which has about 20PB of data, and support HPCC Compute Resources that has compute power of more than 750 TFlops and 2.7PB of data. There are more than 160 active projects with HPCC and more 800 staff and students involved with research that requires HPC computation.

**Responsibilities**

1. Support the Active Business Continuity Platform (ABCP)
- Monitor and Analysis of ABCP System operation performance
- Operate Data Management Framework (DMF) Storage policies and efficiency Utilization
- Conduct regular Backup of data and Lightweight Directory Access Protocol (LDAP) Information from primary site to HPCC Plan, Carry Backup Continuity Platform Annual Operational Exercise successfully

2. System Administration of large-scale HPC cluster and Petabyte Storage
- Monitoring and analysis of Cluster Performance.
- Managing of High Performance, Ultra-Low Latency Network (Ethernet and InfiniBand).
- Deployment and Managing new HPC technology such as Container Technology and Field Programmable Gate Arrays (FPGAs)
- Optimizing and performance tuning of Parallel File System to optimize performance and capacity
- Installation and Compilation of Open-Source and Commercial Software Stacks for High Performance Computing and Deep Learning
- Optimization of Software Stack for large scale parallelization

3. Support for HPC and Deep-Learning Users
- Conducting training for users and updating of User Guide
- Provide Technical support to users' issues when running jobs at HPC.

**Requirements**:

- Recognized Bachelor Degree in Computer Engineering, Sciences or its equivalent. Diploma with relevant working experiences with Open-Source Software will be considered.
- At least 3 years of Good Working Experience in Linux/UNIX and Open-Source Software Stack is preferred.
- Experienced in BASH, Python or Perl Programming is expected
- Understanding and working experiences with container-based infrastructure is preferred
- Experience in scientific/engineering packages such as in Life Science, Computation Fluid Dynamics (CFD), Visualization, modelling and other areas in Science and Engineering is preferred.
- Working experiences of Scheduler (PBS-Professional) is advantageous
- Project management experience with demonstrated ability to lead planning meetings, insuring design and solution reviews take place.
- Good self-motivated team player
- Strong analytical and problem solving skills
- Good planning, organizational and time management skills
- Good interpersonal and communication skills is able to interact with faculty members and researchers

Hiring Institution: NTU

In line with Singapore’s nationwide Vaccination-Differentiated Safe Management Measures (VDS), employees must be fully vaccinated to return to the workplace, unless certified to be medically ineligible. For Information on VDS, please click here.


  • Hpc Build Engineer

    3 days ago


    Singapore JAN AI PTE. LTD. Full time

    This role is responsible for the design, assembly and configuration of high-performance computing (HPC) systems to meet the specific requirements for computational workloads of researchers and scientists. It involves selecting and integrating the appropriate hardware and software components as well as thoroughly testing and optimising the HPC systems. **Key...


  • Singapore beBeeSoftwareDevelopment Full time

    Digital innovation is at the forefront of technological advancements, and high-performance computing is a crucial aspect of this journey. We are seeking an exceptional professional to lead our efforts in developing and maintaining cutting-edge HPC systems.Job Description:We require an individual with extensive experience in managing parallel file systems,...


  • Singapore beBeeAI Full time

    Job Title: Senior AI/HPC EngineerAre you a creative and autonomous professional who loves a challenge? Do you have the skills to deploy, manage, and maintain complex AI/HPC infrastructure in Linux-based environments?About the Role:We are seeking an experienced engineer to join our team as a Senior AI/HPC Engineer. This is a dynamic customer-facing role that...


  • Singapore beBeeHighPerformance Full time $180,000 - $250,000

    Job OverviewWe are seeking a Senior Engineer to lead the operations of our High-Performance Computing (HPC) infrastructure.This role involves ensuring the reliable operation of central GPU Clusters used for AI training and HPC Clusters, advising users on workload execution and optimisation strategies, providing user support for resources they need, and...


  • Singapore Pure Storage Full time

    We are looking for a passionate, inspirational, hands-on System Engineer for Pure's fast-growing AI and HPC Systems Engineering team. This group is composed of highly motivated technical sales resources whose goal is to develop and lead Pure's AI and HPC business, including providing guidance, enablement, and support of sales opportunities and partnerships...


  • Singapore beBeeHardware Full time $120,000 - $180,000

    Job DescriptionWe are seeking an experienced HPC infrastructure professional to join our team as a Hardware Manager. The ideal candidate will have a strong understanding of high-performance computing and experience in deriving hardware specifications based on requirements.This role involves collaborating with cross-functional teams to develop and implement...


  • Singapore KLA Full time

    Join to apply for the HPC AI Infrastructure Hardware Manager role at KLA Continue with Google Continue with Google Join to apply for the HPC AI Infrastructure Hardware Manager role at KLA Get AI-powered advice on this job and more exclusive features. Sign in to access AI-powered advices Continue with Google Continue with Google Continue with Google Continue...


  • Singapore NOVAGLOBAL PTE LTD Full time

    Be involved in complex architectural design and development of AI-HPC infrastructure. - Ensures completeness and compatibility of the technical infrastructure to support system performance. - Requires leading-edge skills in the latest areas of new technology including AI/DL/ML, HPC & Kubernetes - Ability to diagnose and fix the most complex server and...


  • Singapore KLA-Belgium Full time

    Company Overview KLA is a global leader in diversified electronics for the semiconductor manufacturing ecosystem. Virtually every electronic device in the world is produced using our technologies. No laptop, smartphone, wearable device, voice-controlled gadget, flexible screen, VR device or smart car would have made it into your hands without us. KLA...

  • Technical Manager

    2 weeks ago


    Singapore OPENSOURCE PTE. LTD. Full time

    **Technical Service Delivery Manager - High Performance Computing (HPC)**: **Responsibilities**: - **Client Communication** Serve as the primary point of contact for clients in the HPC domain—understanding their needs and communicating technical solutions effectively. - **Service Delivery Oversight** Manage the delivery of technical services related to...