HPC AI Infrastructure Hardware Manager

1 day ago


Singapore KLA Full time

Join to apply for the
HPC AI Infrastructure Hardware Manager
role at
KLA
Continue with Google Continue with Google
Join to apply for the
HPC AI Infrastructure Hardware Manager
role at
KLA
Get AI-powered advice on this job and more exclusive features.
Sign in to access AI-powered advices
Continue with Google Continue with Google
Continue with Google Continue with Google
Continue with Google Continue with Google
Continue with Google Continue with Google
Continue with Google Continue with Google
Continue with Google Continue with Google
Company Overview
KLA is a global leader in diversified electronics for the semiconductor manufacturing ecosystem. Virtually every electronic device in the world is produced using our technologies. No laptop, smartphone, wearable device, voice-controlled gadget, flexible screen, VR device or smart car would have made it into your hands without us. KLA invents systems and solutions for the manufacturing of wafers and reticles, integrated circuits, packaging, printed circuit boards and flat panel displays. The innovative ideas and devices that are advancing humanity all begin with inspiration, research and development. KLA focuses more than average on innovation and we invest 15% of sales back into R&D. Our expert teams of physicists, engineers, data scientists and problem-solvers work together with the world’s leading technology providers to accelerate the delivery of tomorrow’s electronic devices. Life here is exciting and our teams thrive on tackling really hard problems. There is never a dull moment with us.
Company Overview
KLA is a global leader in diversified electronics for the semiconductor manufacturing ecosystem. Virtually every electronic device in the world is produced using our technologies. No laptop, smartphone, wearable device, voice-controlled gadget, flexible screen, VR device or smart car would have made it into your hands without us. KLA invents systems and solutions for the manufacturing of wafers and reticles, integrated circuits, packaging, printed circuit boards and flat panel displays. The innovative ideas and devices that are advancing humanity all begin with inspiration, research and development. KLA focuses more than average on innovation and we invest 15% of sales back into R&D. Our expert teams of physicists, engineers, data scientists and problem-solvers work together with the world’s leading technology providers to accelerate the delivery of tomorrow’s electronic devices. Life here is exciting and our teams thrive on tackling really hard problems. There is never a dull moment with us.
Group/Division
With over 40 years of semiconductor process control experience, chipmakers around the globe rely on KLA to ensure that their fabs ramp next-generation devices to volume production quickly and cost-effectively. Enabling the movement towards advanced chip design, KLA's Global Products Group (GPG), which is responsible for creating all of KLA’s metrology and inspection products, is looking for the best and the brightest research scientist, software engineers, application development engineers, and senior product technology process engineers. The LS-SWIFT Division of KLA’s Global Products Group provides patterned wafer inspection systems for high-volume semiconductor manufacturing. Its mission is to deliver market-leading cost of ownership in defect detection for a broad range of applications in the production of semiconductors. Customers from the foundry, logic, memory, automotive, MEMS, advanced packaging and other markets rely upon high-sample wafer inspection information generated by LS-SWIFT products. LS (Laser Scanning) systems enable cost-effective patterned wafer defect detection for the industry’s most sophisticated process technologies deployed in leading-edge foundry, logic, DRAM, and NAND fabs. SWIFT (Simultaneous Wafer Inspection at Fast Throughput) systems deliver all-wafer-surface (frontside, backside, and edge) macro inspection that is critical for automotive IC, MEMS, and advanced packaging processes as well as foundry/logic and memory fabs. LS-SWIFT operates from a global footprint that includes the US, Singapore, India and Germany, and serves a worldwide customer base across Asia, Europe and North America.
Job Description/Preferred Qualifications
The ideal candidate will have a strong understanding of HPC infrastructure, Experience in deriving Hardware Specs based requirements, and proficiency in product lifecycle management. They will engage with teams to understand their requirements, drive development for our HPC platforms, and collaborate with other teams for integration. The candidate should also have expertise in Hardware System Design, Linux Systems Administration, container orchestration, networking, security, diagnostics tooling and performance tuning. Experience integrating, testing, and optimizing the integration of HPC with storage and data platforms is also essential.
Principal Responsibilities:
Drive team growth and development, providing mentorship and support to team members.
Ensure the successful execution of projects, meeting deadlines and delivering high-quality results.
Work with various OEMs to understand their Product offerings and Roadmaps to create optimal HPC Solution Offerings.
Collaborate with other sub-system teams on developing HPC Cluster Roadmaps that meet Product Requirements.
Collaborate within a customer-focused teams to design, develop, test, and deploy Embedded HPC infrastructure in alignment with business needs.
Foster strong relationships with Product and Program Management, Software engineering, Mfg and Service teams to ensure the HPC Platforms effectively meet their requirements.
Qualifications/Skills:
3+ years’ experience in managing, and mentoring teams.
Knowledge of Linux Hardware Ecosystem centered around CPU, GPU and PCIE Architecture.
Deep understanding of Linux Operating systems, Networking with practical experience in tuning HPC workloads.
Experience with configuration management and automation tools, such as Chef, Ansible, Salt, Packer
Experience with building monitoring and alerting on logs and metrics with excellent troubleshooting and analytical skills.
Experience with and a strong understanding of containers (docker/singularity). Container orchestration with Kubernetes a Plus.
Maintain a grounded approach, making decisions based on data and strategic goals rather than emotions and clearly articulate the decisions.
International traveling couple times a year will be required.
Minimum Qualifications
Engineering degree (Preferably CS, CE)
Experience working with HPC Technologies.
We offer a competitive, family friendly total rewards package. We design our programs to reflect our commitment to an inclusive environment, while ensuring we provide benefits that meet the diverse needs of our employees.
KLA is proud to be an equal opportunity employer
Be aware of potentially fraudulent job postings or suspicious recruiting activity by persons that are currently posing as KLA employees. KLA never asks for any financial compensation to be considered for an interview, to become an employee, or for equipment. Further, KLA does not work with any recruiters or third parties who charge such fees either directly or on behalf of KLA. Please ensure that you have searched KLA’s Careers website for legitimate job postings. KLA follows a recruiting process that involves multiple interviews in person or on video conferencing with our hiring managers. If you are concerned that a communication, an interview, an offer of employment, or that an employee is not legitimate, please send an email to to confirm the person you are communicating with is an employee. We take your privacy very seriously and confidentially handle your information.
Seniority level
Seniority level Mid-Senior level
Employment type
Employment type Full-time
Job function
Job function Engineering and Information Technology
Industries Semiconductor Manufacturing
Referrals increase your chances of interviewing at KLA by 2x
Sign in to set job alerts for “Hardware Manager” roles.
Continue with Google Continue with Google
Continue with Google Continue with Google
Principal Manufacturing Engineer, Hardware Engineering
Commodity Management Manager (Individual Contributor)
Senior Officer, Computer Operator, Group Infrastructure & Platform Services
Linux Engineering Manager - Optimisation for Latest Hardware
Software Engineering Manager - Desktop and Embedded Linux Optimisation
Software Engineering Manager - Ubuntu Linux Kernel
Solutions Architect - Professional Services - Asia
Signalling Hardware Design Lead Engineer
Solution Architect - Cyber Security Hardware - Singapore
Senior Mechanical Design Engineer, Digital Microscope, LCM
Senior Research Engineer I (Satellite System Hardware)
Machine Learning Architect (Software/Hardware Co-Design)
Research Associate (High Temperature-Resistant & Miniaturized Payload Hardware Design)
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
#J-18808-Ljbffr



  • Singapore KLA-Belgium Full time

    Company Overview KLA is a global leader in diversified electronics for the semiconductor manufacturing ecosystem. Virtually every electronic device in the world is produced using our technologies. No laptop, smartphone, wearable device, voice-controlled gadget, flexible screen, VR device or smart car would have made it into your hands without us. KLA...


  • Singapore beBeeinfrastructure Full time $80,000 - $120,000

    **System Engineer Role Overview**Main Responsibilities: Implement and support AI/HPC infrastructure solutions.Develop project documentation, including design statements, as-built documents, performance tests, system integration tests, and user acceptance tests.Lead projects or collaborate with project managers to manage project deliverables and...


  • Singapore beBeeInfrastructure Full time $100,000 - $140,000

    Job OverviewWe are seeking an experienced professional to lead the deployment and implementation of AI/HPC infrastructure solutions. This includes servers, virtualization, storage, networking, and AI/ML/HPC software stack.The ideal candidate will have a strong background in Linux server OS installation, configuration, hardening, and networking,...


  • Singapore beBeeInfrastructure Full time

    Job Overview We are seeking an experienced professional to lead the deployment and implementation of AI/HPC infrastructure solutions. This includes servers, virtualization, storage, networking, and AI/ML/HPC software stack. The ideal candidate will have a strong background in Linux server OS installation, configuration, hardening, and networking,...


  • Singapore beBeeHardware Full time $150,000 - $200,000

    Job Description:">We are seeking an experienced Hardware Manager to join our team. The ideal candidate will have a strong understanding of HPC infrastructure, experience in deriving hardware specs based on requirements, and proficiency in product lifecycle management. This role involves working with teams to understand their requirements, driving development...


  • Singapore beBeeHpc Full time $90,000 - $120,000

    We are seeking a highly experienced and driven HPC Professional to support our large-scale computing environment which includes clusters, storage systems, and high-speed networking used by researchers, staff, and students.Lead the administration and operation of HPC clusters, storage systems, and high-speed networks.Provide hands-on support for HPC system...


  • Singapore beBeeHpcEngineer Full time $90,000 - $120,000

    Role Overview:Job Title: Professional Services EngineerReports to: Infrastructure Specialist Team Lead NVIDIA seeks an experienced and skilled Professional Services Engineer to join its dynamic team. The successful candidate will be responsible for assisting customers with the installation, onboarding, and optimization of NVIDIA's AI/HPC...


  • Singapore beBeeInfrastructure Full time $80,000 - $120,000

    Job Title: AI Infrastructure Support SpecialistAbout the Role:We are seeking a skilled and experienced individual to join our team as an AI Infrastructure Support Specialist. In this role, you will be responsible for supporting the daily operations and maintenance of our AI-accelerated high-performance computing (HPC) infrastructure.Key...


  • Singapore A*STAR Agency for Science, Technology and Research Full time

    Provide expert advice to on-board new users to NSCC?s systems. Engage with new researchers, communities and disciplines with data-intensive computing. Translate user requirements into optimal computational work plans. Assist in the design of NSCC's HPC/AI systems, including benchmarking NSCC workloads on various platforms and recommending the most suitable...


  • Singapore beBeeInfrastructure Full time $80,000 - $120,000

    System Architect Job SummaryWe are seeking a skilled System Architect to design and implement high-performance computing and artificial intelligence infrastructure solutions.Key Responsibilities:Design, implementation, and support of AI/HPC infrastructure solutions including servers, virtualization, storage, networking, and AI/ML/HPC software stack.Project...