
Data Centre Engineer, Field Operations
2 weeks ago
Overview
Firmus Technologies is seeking a skilled Data Centre Engineer to join our Operations team, supporting the daily operations and maintenance of our AI-accelerated high-performance computing (HPC) infrastructure. This role will work closely with Field Service Engineers, HPC and Network Engineering teams, and assist the Global Operations Centre (GOC). This is a unique opportunity to contribute directly to the stability and growth of cutting-edge AI infrastructure.
Responsibilities- Support in the deployment, configuration, and maintenance of various high-end GPU servers, storage servers, networking equipment and software components in highly secure environments.
- Perform hardware diagnostics, systems functionality and firmware updates as required.
- Collaborate with engineering teams to assist in tailored customer environments deployment (eg: bare-metal systems, HPC Clusters, Kubernetes, Slurm etc).
- Serve as first line of engineering support for onsite operational issues, including troubleshooting hardware, network and software problems.
- Troubleshoot incidents, escalate critical issues and provide feedback to appropriate teams for improvements.
- Participate in an on-call rotation to ensure 24/7 availability and responsiveness to critical issues.
- Provide technical support to the GOC Support Specialist team in troubleshooting HPC-related problems.
- Document incident details, resolutions, and lessons learned to enhance future problem-solving.
- Maintain clear, accurate, and up-to-date documentation to promote effective knowledge sharing across the team.
- Communicate effectively with GOC, HPC Engineers, internal teams, stakeholders, and end-users to ensure alignment on issue resolution.
- Take part in team meetings and knowledge-sharing sessions to foster collaboration and continuous learning.
- Bachelor’s degree in computer engineering, computer science, or a related technical field.
- 5+ years of experience in field service technical areas.
- Strong understanding of server hardware technology, Linux environments and troubleshooting hardware problems, with adherence to physical and system-level security standards.
- Experience with scripting languages (eg: Bash, Python)
- Familiarity with using workload manager and cluster softwares (eg: Slurm, Kubernetes, Nvidia BCM) and Observability tools (eg: Prometheus, Grafana, ELK, etc)
- Excellent problem-solving and analytical skills.
- Ability to work independently and as part of a team.
- Strong communication skills, both written and verbal.
Full Time
At Firmus, we are committed to building a diverse and inclusive workplace. We encourage applications from candidates of all backgrounds who are passionate about creating a more sustainable future through innovative engineering solutions.
Join us in our mission to revolutionize the AI industry through sustainable practices and cutting-edge engineering. Apply now to be part of shaping the future of sustainable AI infrastructure.
#J-18808-Ljbffr-
Data Centre Engineer, Field Operations
3 days ago
Singapur, Singapore SMC Cloud Full timeOverview Data Centre Engineer, Field Operations role at SMC Cloud — SMC Cloud is seeking a skilled Data Centre Engineer to join the Operations team, supporting the daily operations and maintenance of AI-accelerated high-performance computing (HPC) infrastructure. This role will work closely with Field Service Engineers, HPC and Network Engineering teams,...
-
Data Centre Engineer
3 weeks ago
Singapur, Singapore Internal Security Department Full timeJoin to apply for the Data Centre Engineer role at Internal Security Department 2 weeks ago Be among the first 25 applicants Join to apply for the Data Centre Engineer role at Internal Security Department What The Role Is ISD confronts and addresses threats to Singapore’s internal security and stability. For over 70 years, ISD and its predecessor...
-
Data Centre Engineer
3 weeks ago
Singapur, Singapore Carlo Hefti AG Full timeWe are looking for an experienced Data Centre Engineer who can help our team keep Jane Street’s data safe and accessible around the clock, which will involve a mix of hands-on work in our data and colocation centres, process thinking and project management. Your work will mostly involve designing, building, scaling and performing daily upkeep in our...
-
Singapur, Singapore NTT DATA, Inc. Full timeOverview Cross Technology Service Delivery Field Support Engineer (L2) -1 at NTT T DATA, Inc. Join to apply for the Cross Technology Service Delivery Field Support Engineer (L2)-1 role at NTT DATA, Inc. Make an impact with NTT DATA — Join a company that is pushing the boundaries of what is possible. We are renowned for our technical excellence and...
-
Singapur, Singapore NTT DATA, Inc. Full timeCross Technology Service Delivery Field Support Engineer (L2) Join to apply for the Cross Technology Service Delivery Field Support Engineer (L2) role at NTT DATA, Inc. Cross Technology Service Delivery Field Support Engineer (L2) 1 week ago Be among the first 25 applicants Join to apply for the Cross Technology Service Delivery Field Support Engineer (L2)...
-
Data Centre Engineer
1 week ago
Singapur, Singapore TANGSPAC CONSULTING PTE LTD Full timeResponsibilities Possess good knowledge of data centre operation tasks and duties. Perform day-to- day data centre / computer operations duties (key management, escorting vendors, facilities infrastructure checks, degaussing, routine checks, desktop & laptop management) Strong ability to support activities in data centre and computer rooms Ensure data...
-
Data Centre Engineer
3 weeks ago
Singapur, Singapore TALENTSIS PTE. LTD. Full timeOverview Our client, a security solutions provider has been established for more than 25 years, with a strong reputation in the industry. They specialize in delivering comprehensive turn-key security systems, from design and integration to maintenance and support and serve clients in government, trade and commercial sectors. They are now looking for a Senior...
-
Security Engineer, Data Centre #catalystWSP
1 week ago
Singapur, Singapore Singtel Group Full timeSecurity Engineer, Data Centre #catalystWSP Singtel/Nxera x Singapore Institute of Technology have embarked on a Work-Study Programme (WSP) launched in Feb 2023. The WSP is called “The Catalyst Programme” which is a structured on-the-job (OJT) development WSP that allows Polytechnic Diploma holders to secure a full-time position with Singtel/Nxera while...
-
Singapur, Singapore StarHub Full timeData Centre Service Delivery and Assurance Engineer Join to apply for the Data Centre Service Delivery and Assurance Engineer role at StarHub Data Centre Service Delivery and Assurance Engineer Join to apply for the Data Centre Service Delivery and Assurance Engineer role at StarHub Responsible for 24 x 7 data centre service delivery operations covering...
-
Security Engineer, Data Centre #catalystWSP
3 weeks ago
Singapur, Singapore Singtel Group Full timeSelect how often (in days) to receive an alert: Security Engineer, Data Centre #catalystWSP Singtel/Nxera x Singapore Institute of Technology have embarked on a Work-Study Programme (WSP) launched in Feb 2023. The WSP is called “The Catalyst Programme” which is a structured on-the-job (OJT) development WSP that allows Polytechnic Diploma holders to...