Cloud Native Computing Platform SRE Engineer

2 weeks ago


Singapur, Singapore Tencent Full time

Technology Engineering Group (TEG) is responsible for supporting the company and its business groups on technology and operational platforms, as well as the construction and operation of R&D management and data centers. TEG provides users with a full range of customer services. As the operator of the largest networking, devices, and data center in Asia, TEG also leads the Tencent Technology Committee in strengthening infrastructure R&D through internal and distributed open source collaboration, constructing new platforms and supporting business innovation.

What The Role Entails
  • Responsible for daily operations, hardware/software troubleshooting, and optimization of GPU/CPU computing infrastructure to enhance resource efficiency and service reliability.
  • Manage and operate Kubernetes clusters and ML platforms, including monitoring/alerting, version upgrades, disaster recovery optimization, and security drills to ensure system high availability and maintainability.
  • Drive automation of operational workflows covering resource management, change control, self-healing solutions, and user tools.
Who We Look For
  • Proficient in GPU/ML principles and cloud platforms (eg. AWS); Hands-on experience in GPU hardware/drivers, CUDA, NCCL, and Mellanox network operations/optimization; Data center experience preferred.
  • Familiar with cloud native container technologies and disaster recovery solutions; Practical Docker/Kubernetes operations experience required.
  • Skilled in Linux/Shell environments; Proficient in ≥1 language (Go/Python/Java); Adept at leveraging automation/AI-driven methods to further enhance service stability and efficiency.
  • Strong accountability and self-motivation; Excellent learning/communication skills with demonstrated logical analysis, abstraction capabilities, and teamwork spirit.
Equal Employment Opportunity at Tencent

As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals.

Seniority level
  • Mid-Senior level
Employment type
  • Full-time
Job function
  • Information Technology
Industries
  • Software Development
#J-18808-Ljbffr
  • Cloud Engineer

    2 weeks ago


    Singapur, Singapore PERSOLKELLY SINGAPORE PTE. LTD. Full time

    Overview Cloud Engineer (SRE – Level 2) – Why we need you. Operating secure, compliant, and highly available cloud environments is critical for our client. We need someone who can maintain, optimize, and scale multi-service cloud infrastructure while ensuring uptime, audit readiness, and regulatory compliance. We’re looking for a Cloud Engineer (SRE...

  • Cloud SRE Engineer

    4 days ago


    Singapur, Singapore OCBC Full time

    Join to apply for the Cloud SRE Engineer - Linux role at OCBC Who We AreAs Singapore’s longest established bank, we have been dedicated to enabling individuals and businesses to achieve their aspirations since 1932. How? By taking the time to truly understand people. From there, we provide support, services, solutions, and career paths that meet their...

  • Cloud SRE Engineer

    4 weeks ago


    Singapur, Singapore OCBC Full time

    Join to apply for the Cloud SRE Engineer - Linux role at OCBC 2 days ago Be among the first 25 applicants Join to apply for the Cloud SRE Engineer - Linux role at OCBC Who We AreAs Singapore’s longest established bank, we have been dedicated to enabling individuals and businesses to achieve their aspirations since 1932. How? By taking the time to truly...


  • Singapur, Singapore Barings Full time

    Overview Cloud Platform Site Reliability Engineer – Barings. We are seeking a highly motivated and skilled professional to design, implement, and maintain Cloud infrastructure solutions for enterprise-level organizations. The role combines cloud engineering and operations with a focus on reliability, performance, monitoring, security, and cloud platform...

  • Cloud Engineer

    1 week ago


    Singapur, Singapore APBA TG HUMAN RESOURCE PTE. LTD. Full time

    Job Summary We are seeking a skilled and experienced Cloud Engineer (SRE – Level 2) to support a secure and scalable cloud infrastructure for a Singapore Government-appointed agency operating on commercial cloud platforms. This Subject Matter Expert (SME) role requires real-world experience in managing multi-service cloud environments using AWS, strong...


  • Singapur, Singapore Charterhouse Partnership | Asia Full time

    Overview Job Summary: We are seeking a visionary and technically hands-on Head of Cloud and Platform to lead the design, development, and operations of our cloud infrastructure and platform services. The ideal candidate will have deep experience in cloud-native development , Terraform for infrastructure-as-code , and CI/CD pipeline automation , with a...


  • Singapur, Singapore PERSOLKELLY SINGAPORE PTE. LTD. Full time

    Overview Site Reliability Engineer (SRE) — An excellent opportunity in a cutting-edge, fast-growing cloud environment. Job Purpose Job Purpose: Deliver reliable, secure, and scalable cloud services by managing and optimizing AWS infrastructure. Responsibilities Manage and support AWS services, ensuring uptime, performance, and security compliance....


  • Singapur, Singapore ByteDance Full time

    Site Reliability Engineer, Traffic Platform - Traffic SRE - 2025 Start Join to apply for the Site Reliability Engineer, Traffic Platform - Traffic SRE - 2025 Start role at ByteDance Site Reliability Engineer, Traffic Platform - Traffic SRE - 2025 Start 2 days ago Be among the first 25 applicants Join to apply for the Site Reliability Engineer, Traffic...


  • Singapur, Singapore Nutanix Full time

    Join to apply for the Cloud Native Sales Specialist role at Nutanix 2 days ago Be among the first 25 applicants Join to apply for the Cloud Native Sales Specialist role at Nutanix The OpportunityAre you a strategic thinker with a track record in enterprise sales, strong connections with CXOs, and a passion for cloud-native solutions? If so, you’ll want to...


  • Singapur, Singapore Dropsuite Full time

    Senior Manager – Site Reliability Engineering (SRE) Join to apply for the Senior Manager – Site Reliability Engineering (SRE) role at Dropsuite Senior Manager – Site Reliability Engineering (SRE) 1 day ago Be among the first 25 applicants Join to apply for the Senior Manager – Site Reliability Engineering (SRE) role at Dropsuite Get AI-powered...