Senior Hpc Engineer

7 days ago


Singapore Nanyang Technological University Full time

The High-Performance Computing Centre (HPCC) was established in 2010 to support the needs of large-scale and compute and data intensive computation at the University.

This role will support NSCC-NTU Active Business Continuity Platform (ABCP) operations which has about 20PB of data, and support HPCC Compute Resources that has compute power of more than 750 TFlops and 2.7PB of data. There are more than 160 active projects with HPCC and more 800 staff and students involved with research that requires HPC computation.

**Responsibilities**

1. Support the Active Business Continuity Platform (ABCP)
- Monitor and Analysis of ABCP System operation performance
- Operate Data Management Framework (DMF) Storage policies and efficiency Utilization
- Conduct regular Backup of data and Lightweight Directory Access Protocol (LDAP) Information from primary site to HPCC Plan, Carry Backup Continuity Platform Annual Operational Exercise successfully

2. System Administration of large-scale HPC cluster and Petabyte Storage
- Monitoring and analysis of Cluster Performance.
- Managing of High Performance, Ultra-Low Latency Network (Ethernet and InfiniBand).
- Deployment and Managing new HPC technology such as Container Technology and Field Programmable Gate Arrays (FPGAs)
- Optimizing and performance tuning of Parallel File System to optimize performance and capacity
- Installation and Compilation of Open-Source and Commercial Software Stacks for High Performance Computing and Deep Learning
- Optimization of Software Stack for large scale parallelization

3. Support for HPC and Deep-Learning Users
- Conducting training for users and updating of User Guide
- Provide Technical support to users' issues when running jobs at HPC.

**Requirements**:

- Recognized Bachelor Degree in Computer Engineering, Sciences or its equivalent. Diploma with relevant working experiences with Open-Source Software will be considered.
- At least 3 years of Good Working Experience in Linux/UNIX and Open-Source Software Stack is preferred.
- Experienced in BASH, Python or Perl Programming is expected
- Understanding and working experiences with container-based infrastructure is preferred
- Experience in scientific/engineering packages such as in Life Science, Computation Fluid Dynamics (CFD), Visualization, modelling and other areas in Science and Engineering is preferred.
- Working experiences of Scheduler (PBS-Professional) is advantageous
- Project management experience with demonstrated ability to lead planning meetings, insuring design and solution reviews take place.
- Good self-motivated team player
- Strong analytical and problem solving skills
- Good planning, organizational and time management skills
- Good interpersonal and communication skills is able to interact with faculty members and researchers

Hiring Institution: NTU

In line with Singapore’s nationwide Vaccination-Differentiated Safe Management Measures (VDS), employees must be fully vaccinated to return to the workplace, unless certified to be medically ineligible. For Information on VDS, please click here.


  • HPC Network Engineer

    44 minutes ago


    Singapore ByteDance Full time

    HPC Network Engineer - Physical Network Infrastructure About the Team ByteDance Networking brings together innovative ideas and technologies from network architecture, software‐defined networking (SDN), network virtualization, switch software and hardware co‐design, and high‐speed networking, to create hyperscale data‐center networking solutions that...


  • Singapore AMD Full time

    Field Application Engineer - HPC Join to apply for the Field Application Engineer - HPC role at AMD Overview AMD’s mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes...


  • Singapore Advanced Micro Devices Full time

    WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create...


  • Singapore Singtel Full time $40,000 - $80,000 per year

    Cloud DevOps Engineer #GeneralInternshipDate: 3 Sept 2025Location: Singapore, SingaporeCompany: Singtel GroupAs an DevOps Engineer Intern for SingTel's GPU Cloud, you will help in implementing processes and integration of operations to advance customer's AI and HPC capabilities. You will be exposed to both physical data center implementation and software...

  • SITE ENGINEER

    2 weeks ago


    Singapore HPC Holdings Limited Full time

    Responsibilities Monitor and review master construction program and prepare catch-up program, if applicable. Monitor and supervise Engineers / foremen; ensure compliance with quality procedures and workmanship standards; check quality and quantity of work by subcontractors. Prepare detailed program for the site team to follow and complete on time. Resolve...


  • Singapore KLA-Tencor Full time

    LS-SWIFT HPC Team is charted to provide pioneering High Performance Computing solutions in enabling-Image processing algorithms for Reticle/Photomask inspections in real time. As a computer system engineer, you would Design and Engineer an embedded HPC Cluster which is a critical sub-system in KLA inspection tool. Your primary responsibilities include...


  • Singapore Oracle Full time

    Overview Hands on technical architect responsible to design, build and manage large compute (GPU/HPC) clusters, troubleshoot issues for POC and production deployments. Implement solutions and ensure successful deployments through code development and scripting. Build automated cluster deployment tool using Terraform and Ansible. Work with some of the largest...


  • Singapore Hunter Bond Full time

    Linux Platform Engineer Singapore, $85,000-$150,000 SGD per year. Direct message the job poster from Hunter Bond. The role: My client is seeking a Linux Platform Engineer to work on their low latency Linux estate. The role is a cross between Linux Systems Engineering and Site Reliability Engineering. You will build, design and support automated solutions for...


  • Singapore KLA-Tencor (Singapore) Pte Ltd Full time

    Overview Role Summary: LS-SWIFT HPC Team is charted to provide pioneering High Performance Computing solutions in enabling-Image processing algorithms for Reticle/Photomask inspections in real time. As a computer system engineer, you would Design and Engineer an embedded HPC Cluster which is a critical sub-system in KLA inspection tool. Your primary...

  • Linux Platform Engineer

    37 minutes ago


    Singapore Hunter Bond Full time

    Linux Platform Engineer - Elite FinTech - $85,000-$150,000 SGD Direct message the job poster from Hunter Bond. Role The client is seeking a Linux Platform Engineer to work on a low latency Linux estate. The role is a cross between Linux Systems Engineering and Site Reliability Engineering. You will build, design and support automated solutions for scalable...