Senior Expert in High-Performance Computing Infrastructure

5 days ago


Singapore beBeeComputing Full time
Job Summary:

We are seeking a High-Performance Computing Engineer to join our team. As a key member of our organization, you will be responsible for ensuring the reliable operations of central GPU Clusters used for AI training and High-Performance Computing (HPC) Clusters. You will also advise users on workload execution and optimization strategies, provide support for resources they need, and support the maintenance and troubleshooting of AI and HPC infrastructure to ensure system stability.

Key Responsibilities:
  • Ensure the reliable operations of the central GPU Clusters used for AI training and HPC Clusters.
  • Advise users on workload execution and optimization strategies.
  • Provide users with the necessary resources and support for their needs.
  • Support the maintenance and troubleshooting of AI and HPC infrastructure to ensure system stability.

Requirements:
  • Degree in Computer Engineering/Computer Science/Electrical & Electronic Engineering.
  • Proficient in UNI/Linux operating systems and command-line interfaces.
  • Familiar with monitoring tools.
  • Good knowledge and experience in HPC performance optimization and troubleshooting.
  • Proven working knowledge of HPC systems and software.
  • Strong programming skills in Python and Bash scripting.
  • Familiarity with HPC schedulers, container orchestration, and GPU-based systems.
  • Experience with HPC scheduling and workload management tools.
  • Experience in managing parallel file systems and cluster management software.

Skills and Qualifications:
  • Parallel Computing
  • Distributed Systems
  • Cluster Management

Benefits:
  • Competitive remuneration packages.
  • Scholarship opportunities.
  • Regular career dialogues and development frameworks.
  • A supportive work environment.


  • Singapore beBeeHpc Full time $90,000 - $120,000

    Job Opportunity: High-Performance Computing Expert">As a key member of our team, you will be responsible for designing and implementing high-performance computing infrastructure solutions that meet the needs of our clients.With opportunities for career progression through certifications and training, this role is ideal for those looking to develop their...


  • Singapore beBeeHpc Full time $90,000 - $120,000

    Job Title: High-Performance Computing Infrastructure SpecialistA leading organization is seeking an experienced professional to fill the role of High-Performance Computing Infrastructure Specialist. This position involves designing, implementing, and maintaining high-performance computing infrastructure for various applications.Key Responsibilities:Design...


  • Singapore beBeeSoftwareDevelopment Full time $90,000 - $120,000

    About UsWe are a leading organization in the field of defense research and development, committed to developing technological solutions to enhance national security. In this role, you will have the opportunity to make a real impact and shape the future of our organization across various domains. Our team is passionate about delivering digital capabilities...


  • Singapore DSO National Laboratories Full time

    JOB DESCRIPTION DSO National Laboratories (DSO) is Singapore's largest defence research and development (R&D) organisation, with the critical mission to develop technological solutions to sharpen the cutting edge of Singapore's national security. At DSO, you will develop more than just a career. This is where you will make a real impact and shape the future...


  • Singapore beBeeHighperformance Full time $90,000 - $120,000

    High-Performance Computing ProfessionalWe are seeking a skilled High-Performance Computing (HPC) professional to join our team. In this role, you will be responsible for the reliable operation of central GPU clusters used for AI training and HPC.Ensure the smooth functioning of GPU and HPC clusters;Provide users with expert advice on workload execution and...


  • Singapore DSO National Laboratories Full time

    JOB DESCRIPTION DSO National Laboratories (DSO) is Singapore's largest defence research and development (R&D) organisation, with the critical mission to develop technological solutions to sharpen the cutting edge of Singapore's national security. At DSO, you will develop more than just a career. This is where you will make a real impact and shape the future...


  • Singapore beBeeHpcInfrastructure Full time

    Job Description "> AWS and PB-based infrastructure is at the heart of this role, with opportunities for career progression through certifications and broadening your technology scope. "> You will gain exposure to AI infrastructure solutions and deploy, implement, and support AI/HPC (GPU) infrastructure, including servers, storage, networking, and...

  • Senior Sre

    1 week ago


    Singapore Oxford Knight Full time

    Senior SRE (High Performance Computing) | Singapore or Hong Kong **Salary**: up to 250-275k SGD base **Summary** High-frequency prop trading firm with offices worldwide looking for skilled Senior Site Reliability Engineer developer to join their High Performance Computing team, developing and supporting their large-scale compute and storage...


  • Singapore beBeeInfrastructure Full time $80,000 - $150,000

    Cloud Infrastructure Engineer Job DescriptionAs a Cloud Infrastructure Engineer, you will be responsible for designing, deploying and maintaining scalable cloud infrastructure for high-performance computing applications. This includes managing container orchestration platforms, automating provisioning of compute resources, and monitoring infrastructure...


  • Singapore beBeeDataCentreEngineer Full time $1,200,000 - $1,400,000

    Experience the thrill of working with cutting-edge AI technology in a fast-paced, dynamic environment. We are seeking a skilled Data Centre Engineer to join our team, supporting the daily operations and maintenance of our high-performance computing infrastructure.This role requires strong technical expertise, excellent problem-solving skills, and the ability...