Hpc Ai Infrastructure Hardware Manager

6 days ago


Singapore KLA Corporation Full time

**Company Overview**

KLA is a global leader in diversified electronics for the semiconductor manufacturing ecosystem. Virtually every electronic device in the world is produced using our technologies. No laptop, smartphone, wearable device, voice-controlled gadget, flexible screen, VR device or smart car would have made it into your hands without us. KLA invents systems and solutions for the manufacturing of wafers and reticles, integrated circuits, packaging, printed circuit boards and flat panel displays. The innovative ideas and devices that are advancing humanity all begin with inspiration, research and development. KLA focuses more than average on innovation and we invest 15% of sales back into R&D. Our expert teams of physicists, engineers, data scientists and problem-solvers work together with the world's leading technology providers to accelerate the delivery of tomorrow's electronic devices. Life here is exciting and our teams thrive on tackling really hard problems. There is never a dull moment with us.

**Group/Division**

**Job**Description/Preferred**Qualifications**

**Principal Responsibilities**:

- Drive team growth and development, providing mentorship and support to team members.
- Ensure the successful execution of projects, meeting deadlines and delivering high-quality results.
- Work with various OEMs to understand their Product offerings and Roadmaps to create optimal HPC Solution Offerings.
- Collaborate with other sub-system teams on developing HPC Cluster Roadmaps that meet Product Requirements.
- Collaborate within a customer-focused teams to design, develop, test, and deploy Embedded HPC infrastructure in alignment with business needs.
- Foster strong relationships with Product and Program Management, Software engineering, Mfg and Service teams to ensure the HPC Platforms effectively meet their requirements.

**Qualifications/Skills**:

- 3+ years' experience in managing, and mentoring teams.
- Knowledge of Linux Hardware Ecosystem centered around CPU, GPU and PCIE Architecture.Â
- Deep understanding of Linux Operating systems, Networking with practical experience in tuning HPC workloads.
- Experience with configuration management and automation tools, such as Chef, Ansible, Salt, Packer
- Experience with building monitoring and alerting on logs and metrics with excellent troubleshooting and analytical skills.
- Experience with and a strong understanding of containers (docker/singularity). Container orchestration with Kubernetes a Plus.
- Maintain a grounded approach, making decisions based on data and strategic goals rather than emotions and clearly articulate the decisions.
- International traveling couple times a year will be required.

**Minimum Qualifications**
- Engineering degree (Preferably CS, CE)
- Experience working with HPC Technologies.
- We offer a competitive, family friendly total rewards package. We design our programs to reflect our commitment to an inclusive environment, while ensuring we provide benefits that meet the diverse needs of our employees. _
- KLA is proud to be an equal opportunity employer _



  • Singapore KLA Full time

    Join to apply for the HPC AI Infrastructure Hardware Manager role at KLA Continue with Google Continue with Google Join to apply for the HPC AI Infrastructure Hardware Manager role at KLA Get AI-powered advice on this job and more exclusive features. Sign in to access AI-powered advices Continue with Google Continue with Google Continue with Google Continue...


  • Singapore KLA-Belgium Full time

    Company Overview KLA is a global leader in diversified electronics for the semiconductor manufacturing ecosystem. Virtually every electronic device in the world is produced using our technologies. No laptop, smartphone, wearable device, voice-controlled gadget, flexible screen, VR device or smart car would have made it into your hands without us. KLA...


  • Singapore beBeeinfrastructure Full time $80,000 - $120,000

    **System Engineer Role Overview**Main Responsibilities: Implement and support AI/HPC infrastructure solutions.Develop project documentation, including design statements, as-built documents, performance tests, system integration tests, and user acceptance tests.Lead projects or collaborate with project managers to manage project deliverables and...


  • Singapore beBeeInfrastructure Full time

    Job Overview We are seeking an experienced professional to lead the deployment and implementation of AI/HPC infrastructure solutions. This includes servers, virtualization, storage, networking, and AI/ML/HPC software stack. The ideal candidate will have a strong background in Linux server OS installation, configuration, hardening, and networking,...


  • Singapore beBeeInfrastructure Full time $80,000 - $120,000

    System Architect Job SummaryWe are seeking a skilled System Architect to design and implement high-performance computing and artificial intelligence infrastructure solutions.Key Responsibilities:Design, implementation, and support of AI/HPC infrastructure solutions including servers, virtualization, storage, networking, and AI/ML/HPC software stack.Project...


  • Singapore beBeeAIengineer Full time $80,000 - $120,000

    Unlock the Power of AI and HPC with Our Team.We're seeking a highly motivated System Engineer to join our fast-growing team, where you'll play a pivotal role in developing and leading our AI and HPC business.Your primary goal will be to understand our clients' and partners' overall objectives and demonstrate how our technology aligns with their goals.To...

  • HPC System Architect

    2 weeks ago


    Singapore beBeeExpert Full time $90,000 - $120,000

    We are seeking a seasoned expert to lead the administration and operation of our Linux-based High-Performance Computing (HPC) environment.This role involves providing hands-on support for HPC system software, troubleshooting issues across hardware, software, OS, and networking layers, and collaborating with engineers to support AI/deep learning...


  • Singapore beBeeHighPerformance Full time $120,000 - $140,000

    Job OverviewWe are seeking a highly experienced and driven High-Performance Computing (HPC) professional to support our Linux-based HPC environment.This is an exceptional opportunity for individuals with a passion for HPC system administration, user support, and collaboration with software engineers to support AI/deep learning applications and desktop...


  • Singapore beBeeInfrastructure Full time $90,000 - $120,000

    AI Infrastructure SpecialistNVIDIA is seeking a highly skilled AI Infrastructure Specialist to join its team. As an expert in high-performance computing, you will be responsible for designing and implementing large-scale AI/HPC projects.This role involves collaborating with customers, partners, and internal teams to analyze, define, and implement complex AI...


  • Singapore Nebius Full time

    Overview Senior Cloud Support Engineer (HPC, AI/ML)role at Nebius . Direct message the job poster from Nebius. Nebius is leading a new era in cloud computing to serve the global AI economy. We create the tools and resources our customers need to solve real-world challenges and transform industries, without massive infrastructure costs or the need to build...