HPC System Architect

1 week ago


Singapore beBeeExpert Full time $90,000 - $120,000

We are seeking a seasoned expert to lead the administration and operation of our Linux-based High-Performance Computing (HPC) environment.

This role involves providing hands-on support for HPC system software, troubleshooting issues across hardware, software, OS, and networking layers, and collaborating with engineers to support AI/deep learning applications.

Key Responsibilities:
  • Manage and operate HPC Linux clusters, storage systems, and high-speed networks.
  • Support HPC system software including cluster management, parallel file systems, and job schedulers.
  • Troubleshoot and resolve complex technical issues.
  • Collaborate with engineers on AI/deep learning application support.
  • Advise researchers on application development, debugging, optimization, and parallelization.
  • Plan and execute HPC application tuning and parallelization.
  • Provide expert-level guidance on job management and code execution.
  • Engage with end-users to support numerical simulation applications.
  • Support Distributed Data Parallel (DDP) training across multi-GPU setups.
  • Assist users with AI/ML frameworks such as TensorFlow, PyTorch, and Hugging Face Transformers.
Requirements:
  • Bachelor's or Master's degree in Computer Science, Engineering, Physics, or a related field.
  • At least 5 years of experience with large-scale HPC systems, including cluster operations and user support.
  • In-depth knowledge of parallel programming and code optimization using languages like Fortran, C, C++, and libraries like MPI, OpenMP.
  • Proficient in Linux OS administration and scripting for system automation.
  • Strong understanding of HPC system architecture, performance tuning, and resource management.
  • Experience with numerical simulations, such as weather forecasting, climate modeling, and CFD applications.
  • Familiarity with AI/DL tools and scalable training across multi-GPU environments.
  • Experience with HPC job scheduling systems like SLURM, LSF, or PBS.
About You:
  • Highly motivated and results-driven with strong problem-solving skills.
  • Excellent collaboration and interpersonal skills.
  • Strong verbal and written communication skills.
  • Confident in presenting technical topics to diverse audiences.
  • Proactive and resourceful with a strong customer service orientation.


  • Singapore OPENSOURCE PTE. LTD. Full time

    **Job Title**:HPC & Linux System Administrator **Location**:Singapore **Experience**:10+ Years **Professional Summary**: A highly experienced and driven HPC & Linux System Administrator with over a decade of expertise in managing hybrid HPC infrastructures and enterprise Linux environments. Skilled in integrating, operating, and optimizing high-performance...


  • Singapore Pure Storage Full time

    AI/HPC Consulting Field Solutions Architect Join to apply for the AI/HPC Consulting Field Solutions Architect role at Pure Storage AI/HPC Consulting Field Solutions Architect 1 day ago Be among the first 25 applicants Join to apply for the AI/HPC Consulting Field Solutions Architect role at Pure Storage Get AI-powered advice on this job and more exclusive...


  • Singapore Pure Storage Full time

    AI/HPC Consulting Field Solutions Architect Join to apply for the AI/HPC Consulting Field Solutions Architect role at Pure Storage AI/HPC Consulting Field Solutions Architect 1 day ago Be among the first 25 applicants Join to apply for the AI/HPC Consulting Field Solutions Architect role at Pure Storage Get AI-powered advice on this job and more...


  • Singapore Pure Storage Full time

    AI/HPC Consulting Field Solutions Architect Join to apply for the AI/HPC Consulting Field Solutions Architect role at Pure Storage AI/HPC Consulting Field Solutions Architect 1 day ago Be among the first 25 applicants Join to apply for the AI/HPC Consulting Field Solutions Architect role at Pure Storage Get AI-powered advice on this job and more exclusive...

  • System Engineer

    4 days ago


    Singapore NodeFlair Full time

    **Job Summary**: **Salary** S$8,000 - S$9,000 / Monthly **Job Type** **Seniority** Mid **Years of Experience** At least 5 years **Tech Stacks** C++ Linux C Fujitsu is seeking a High-Performance Computational (HPC) Engineer. This position will participate in the support of our Linux based high-performance computing, storage, and networking environment...


  • Singapore beBeeAIHPC ARCHITECT Full time $90,000 - $120,000

    Unlock Your Potential as a Field Solutions Architect","Join us in driving innovation in the data storage industry with your expertise in AI and HPC. As a key member of our Field Solutions Architecture Team, you will play a vital role in aligning customer needs with our cutting-edge solutions for AI/ML workloads, GPU-accelerated computing, distributed...


  • Central Singapore Smartedge Solutions Full time

    **Location** - Singapore, Central Singapore**Job Type** - Full Time**Date Posted** - 5 hours agoAdditional Details **Job ID** - 130623**Job Views** - 1HPC PreSales Senior Solution Architect, Singapore for Tier 1 client. JOB RESPONSIBILITIES: Duties and Responsibilities: - Work closely with customers and prospects to understand business and technical...


  • Singapore beBeeHighPerformance Full time $120,000 - $140,000

    Job OverviewWe are seeking a highly experienced and driven High-Performance Computing (HPC) professional to support our Linux-based HPC environment.This is an exceptional opportunity for individuals with a passion for HPC system administration, user support, and collaboration with software engineers to support AI/deep learning applications and desktop...


  • Singapore Pure Storage Full time

    AI/HPC Consulting Field Solutions Architect Join to apply for the AI/HPC Consulting Field Solutions Architect role at Pure Storage AI/HPC Consulting Field Solutions Architect 1 day ago Be among the first 25 applicants Join to apply for the AI/HPC Consulting Field Solutions Architect role at Pure Storage Get AI-powered advice on this job and more...

  • System Engineer

    2 weeks ago


    Singapore Jobline Resources Pte Ltd Full time $90,000 - $120,000 per year

    Responsibilities• Administration and operation of several HPC Linux clusters, storage, networking and associated system and application software. • Understand and work with parallel file systems, HPC cluster management software, and HPC job scheduler software. • Troubleshooting hardware, software, operating systems and networking as necessary. • Work...