Site Reliability Engineer

2 weeks ago


Singapore RUNSUN CLOUD PTE. LTD. Full time

**Responsibilities**:

- Maintain and support AI infrastructure.
- Monitor system performance and reliability.
- Troubleshoot and resolve customer issues.
- Collaborate with cross-functional teams to improve system capabilities.
- Ensure system security and compliance.
- Rotated on call to handle emergency customer issues.

**Requirements**:

- 1-3 years of experience in a similar role.
- Strong Linux administration skills.
- Basic knowledge of networking and storage solutions.
- Proficient in using NVIDIA AI-related toolsets.
- Skilled in installing and debugging AI-related drivers and runtime environments.
- Familiar with model training, tuning, and inference, capable of effectively identifying and troubleshooting various issues during operation.
- Excellent problem-solving skills.
- Ability to work in a team environment.

**Preferred Qualifications**:

- Experience with cloud platforms.
- Familiarity with container technologies like Docker.

**Language Requirements**:

- Fluency in English is mandatory, and Mandarin is a plus.



  • Singapore IDEMIA Full time

    Join to apply for the Site Reliability Engineer role at IDEMIA Join to apply for the Site Reliability Engineer role at IDEMIA Get AI-powered advice on this job and more exclusive features. PurposeThis role plays a critical part in ensuring reliability, scalability, and performance of our systems and services. You will work closely with development and...


  • Singapore IDEMIA Full time

    Join to apply for the Site Reliability Engineer role at IDEMIA Join to apply for the Site Reliability Engineer role at IDEMIA Get AI-powered advice on this job and more exclusive features. PurposeThis role plays a critical part in ensuring reliability, scalability, and performance of our systems and services. You will work closely with development and...


  • Singapore IDEMIA Full time

    Join to apply for the Site Reliability Engineer role at IDEMIA Join to apply for the Site Reliability Engineer role at IDEMIA Get AI-powered advice on this job and more exclusive features. Purpose This role plays a critical part in ensuring reliability, scalability, and performance of our systems and services. You will work closely with development and...


  • Singapore beBeeSiteReliability Full time $90,000 - $120,000

    Unlock Your Full Potential in Site Reliability EngineeringAbout the RoleThis is an exciting opportunity to work with a global banking institution, leveraging your skills in production management and site reliability engineering to drive business growth.Develop and implement proactive, predictive models for shift production management using SRE...


  • Singapore beBeeSiteReliability Full time

    Unlock Your Full Potential in Site Reliability Engineering About the Role This is an exciting opportunity to work with a global banking institution, leveraging your skills in production management and site reliability engineering to drive business growth. Develop and implement proactive, predictive models for shift production management using SRE...


  • Singapore DHATCH CONSULTANCY PTE. LTD. Full time

    Site Reliability Engineer: **Preferred Qualifications** - 3+ years of experience in site reliability engineering, DevOps, or software engineering roles. - Proven skills in: - Monitoring & alerting tools (Grafana, New Relic) - CI/CD pipelines (Git, Jenkins, GitHub Actions, etc.) - Container orchestration (Docker, Kubernetes) - Infrastructure-as-code...


  • Singapore HCLTech Full time

    Get AI-powered advice on this job and more exclusive features. This role combines software and systems engineering to build run, and maintain high performant, distributed, fault tolerant and resilient financial systems. Site Reliability Engineers focus on ensuring a joyful customer journey. As a Site Reliability Engineer you will be filling a...


  • Singapore HCLTech Full time

    Get AI-powered advice on this job and more exclusive features. This role combines software and systems engineering to build run, and maintain high performant, distributed, fault tolerant and resilient financial systems. Site Reliability Engineers focus on ensuring a joyful customer journey. As a Site Reliability Engineer you will be filling a...


  • North-East Singapore PERSOLKELLY Full time

    The Site Reliability Engineer is responsible for ensuring the reliability, scalability, and efficiency of our systems and infrastructure. This role involves monitoring, troubleshooting, and resolving issues to maintain optimal performance. The engineer will also collaborate with cross-functional teams to automate processes and improve system reliability....


  • Singapore Manpower Singapore Full time

    Site Reliability Engineer Assistant (DevOps) Site Reliability Engineer Assistant (DevOps) This range is provided by Manpower Singapore. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Base pay range Responsible for the operation and maintenance of online game marketing services, to ensure the continuous...