AI Infrastructure Support Specialist

7 days ago


Singapore beBeeInfrastructure Full time $80,000 - $120,000

Job Title: AI Infrastructure Support Specialist

About the Role:

We are seeking a skilled and experienced individual to join our team as an AI Infrastructure Support Specialist. In this role, you will be responsible for supporting the daily operations and maintenance of our AI-accelerated high-performance computing (HPC) infrastructure.

Key Responsibilities:

  • Deploy, configure, and maintain various high-end GPU servers, storage servers, networking equipment, and software components in highly secure environments.
  • Perform hardware diagnostics, systems functionality, and firmware updates as required.
  • Collaborate with engineering teams to assist in tailored customer environment deployments, including bare-metal systems, HPC Clusters, Kubernetes, and Slurm.
  • Serve as the first line of engineering support for onsite operational issues, including troubleshooting hardware, network, and software problems.
  • Troubleshoot incidents, escalate critical issues, and provide feedback to appropriate teams for improvements.
  • Participate in an on-call rotation to ensure 24/7 availability and responsiveness to critical issues.
  • Provide technical support to the GOC Support Specialist team in troubleshooting HPC-related problems.
  • Document incident details, resolutions, and lessons learned to enhance future problem-solving.
  • Maintain clear, accurate, and up-to-date documentation to promote effective knowledge sharing across the team.
  • Communicate effectively with GOC, HPC Engineers, internal teams, stakeholders, and end-users to ensure alignment on issue resolution.

Required Skills and Qualifications:

  • Bachelor's degree in computer engineering, computer science, or a related technical field.
  • 5+ years of experience in field service technical areas.
  • Strong understanding of server hardware technology, Linux environments, and troubleshooting hardware problems, with adherence to physical and system-level security standards.
  • Experience with scripting languages, such as Bash and Python.
  • Familiarity with using workload manager and cluster software, such as Slurm, Kubernetes, Nvidia BCM, Prometheus, Grafana, ELK, etc.
  • Excellent problem-solving and analytical skills.
  • Ability to work independently and as part of a team.
  • Strong communication skills, both written and verbal.

Benefits:

  • Full-time employment basis.
  • Diverse and inclusive workplace culture.
  • Opportunities for professional growth and development.


  • Singapore beBeeInfrastructurespecialist Full time $80,000 - $120,000

    Job Title: AI Infrastructure SpecialistWe are seeking a skilled AI infrastructure specialist to join our team. As an AI infrastructure specialist, you will be responsible for deploying, testing, and maintaining AI systems on-site.The ideal candidate will have a strong background in computer science or a related field, with experience in operating and...


  • Singapore beBeeinfrastructure Full time $90,000 - $130,000

    Job Title: AI Infrastructure SpecialistAbout the Role:This is an exciting opportunity for a highly skilled AI Engineer to join our team as an AI Infrastructure Specialist. As a key member of our infrastructure team, you will be responsible for designing, building, and maintaining scalable and efficient AI/ML platforms. Your expertise in cloud computing,...


  • Singapore beBeeArtificialintelligence Full time $96,000 - $144,000

    Job Title: AI Infrastructure VisionaryWe are seeking an experienced AI Infrastructure Visionary to lead our strategy, design, and delivery of advanced AI-driven solutions for infrastructure projects. As a key member of our team, you will be responsible for designing robust, scalable AI/ML architectures, model pipelines, data ingestion, and deployment...


  • Singapore beBeeNetwork Full time $90,000 - $120,000

    As a senior network architect, you will be responsible for designing and deploying scalable, secure, and high-performance network infrastructure for artificial intelligence environments.The ideal candidate will have expertise in AI networks and take technical ownership of the design, deployment, and optimization of our network infrastructure.This role offers...


  • Singapore beBeeItspecialist Full time $100,000 - $120,000

    Job OverviewThe role of an IT & AI Operations Specialist is a key position that oversees daily IT operations, ensuring the stability and security of our infrastructure. This specialist leverages AI-powered platforms to enhance user support, automate routine tasks, and analyze systems for proactive IT management.This position requires technical proficiency in...


  • Singapore beBeeItspecialist Full time

    Job Overview The role of an IT & AI Operations Specialist is a key position that oversees daily IT operations, ensuring the stability and security of our infrastructure. This specialist leverages AI-powered platforms to enhance user support, automate routine tasks, and analyze systems for proactive IT management. This position requires technical...


  • Singapore beBeeDeveloper Full time $90,000 - $120,000

    Job DescriptionWe are seeking a skilled and innovative AI Engineer to join our team. The successful candidate will be responsible for managing AI platforms, building robust data ingestion and transformation pipelines, collaborating with data scientists, and integrating AI solutions with existing platforms and business systems.Key Responsibilities:Manage AI...


  • Singapore beBeeArtificialIntelligence Full time $180,000 - $250,000

    AI Architect Job DescriptionA visionary AI Architect will lead the strategy, design, and delivery of advanced AI-driven solutions to transform how infrastructure is designed and delivered.Key Responsibilities:Design robust, scalable AI/ML architectures, including model pipelines, data ingestion, and deployment frameworks tailored to infrastructure...


  • Singapore beBeeArtificialIntelligence Full time

    AI Architect Job Description A visionary AI Architect will lead the strategy, design, and delivery of advanced AI-driven solutions to transform how infrastructure is designed and delivered. Key Responsibilities: Design robust, scalable AI/ML architectures, including model pipelines, data ingestion, and deployment frameworks tailored to infrastructure...


  • Singapore beBeeInfrastructure Full time $120,000 - $150,000

    Job Title: AI Infrastructure Deployment SpecialistWe are seeking a skilled professional to implement and manage CI/CD pipelines for AI models and GPU-accelerated applications.Implement and manage continuous integration and deployment (CI/CD) pipelines for AI models and GPU-accelerated applications.Automate infrastructure and application deployments on...