Site Reliability Engineer-Experienced(A06044)

2 weeks ago


Singapore Xiaomi Full time
Site Reliability Engineer-Experienced(A06044)

Get AI-powered advice on this job and more exclusive features.

Direct message the job poster from Xiaomi Technology

1. Ensure the stability, reliability, and efficient operation of the Xiaomi's global business, maintaining high availability of services at all times.

2. Responsible for core operational tasks such as resource provisioning and management, incident response, capacity management, monitoring, and reliability improvements.

3. Review technical architecture design, assess soundness of the design, and proactively identify and resolve reliability risks.

4. Conduct in-depth analysis of systemic deficiencies, identify bottlenecks and develop optimization strategies; plan and execute projects to improve system reliability and ensure cost-effectiveness and highly availability of the systems.

5. Participate in 24/7 on-call rotation, promptly respond to and resolve production incidents to ensure service availability.

6. Analyze and improve processes to build stable, highly available systems; drive continuous automation improvements, and minimize manual intervention.

  • Job Requirements

1. Proficiency in one of the following programming languages: Python, Go, or shell scripting, with demonstrated ability to independently develop modules or platforms.

2. Familiar with cloud computing; experience in managing multi-cloud or hybrid cloud platforms (e.g., Alibaba Cloud, Azure, AWS) is preferred.

3. Strong foundation in computer science, with hands-on experience in Linux, networking, load balancing, and designing high-availability and disaster recovery architectures.

4. A good team player with a strong sense of responsibility, self-driven and highly motivated.

5. Minimum 3 years of working experience in operations and maintenance of large-scale web services is preferred; hands-on experience in managing or operating large-scale web services or projects is a plus.

6. Fluent in Mandarin (spoken) is a plus.

Seniority level
  • Seniority levelMid-Senior level
Employment type
  • Employment typeFull-time
Job function
  • Job functionBusiness Development
  • IndustriesSoftware Development

Referrals increase your chances of interviewing at Xiaomi Technology by 2x

Site Reliability Engineer Intern - 2025 StartProduction Engineer / Site Reliability EngineerSite Reliability Engineer (EMEA, Japan, Singapore, Australia)Software Engineer Intern, Dev Infra - 2025 StartSite Reliability Engineer-(Fresh-Grad)(A98145)Backend Software Engineer, TikTok Eng Privacy and Security(Location) Intern - 2025 StartSoftware Development Engineer in Test Intern , TikTok - 2025 StartPlatform Engineer, Operations & TechnologySite Reliability Engineer (SRE) (GovTech)

We're unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

#J-18808-Ljbffr

  • Singapore Xiaomi Full time

    Site Reliability Engineer-Experienced(A06044)Get AI-powered advice on this job and more exclusive features. Direct message the job poster from Xiaomi Technology Ensure the stability, reliability, and efficient operation of the Xiaomi's global business, maintaining high availability of services at all times. Responsible for core operational tasks such as...


  • Singapore ABAXX SINGAPORE PTE. LTD. Full time

    Site Reliability Engineer - Networking We are seeking competent candidate joining our Infrastructure Team for the mission building and operating MAS regulated marketplace and clearing house. This role is ideal for someone with a strong foundation in AWS services, infrastructure as code, and cloud security, who is passionate about building scalable, secure,...


  • Singapore Adyen Full time

    **This is Adyen** Adyen provides payments, data, and financial products in a single solution for customers like Meta, Uber, H&M, and Microsoft - making us the financial technology platform of choice. At Adyen, everything we do is engineered for ambition. For our teams, we create an environment with opportunities for our people to succeed, backed by the...


  • Singapore NodeFlair Full time

    **Job Summary**: **Salary** S$11,500 - S$16,500 / Monthly **Job Type** **Seniority** Senior **Years of Experience** At least 7 years **Tech Stacks** Microsoft Puppet Java Ansible Python **This is Adyen** Adyen provides payments, data, and financial products in a single solution for customers like Meta, Uber, H&M, and Microsoft - making us the...


  • Singapore beBeeReliability Full time $90,000 - $120,000

    **Reliability Professional Wanted**We are seeking an experienced reliability professional to join our team as a Reliability Engineer. The ideal candidate will have a strong background in mechanical engineering and experience in the oil and gas industry.Main Responsibilities:To provide on-site support and manage the reliability of mechanical seals for...


  • Singapore Hyphen Connect Full time

    Site Reliability Engineer (Crypto Trading) Join to apply for the Site Reliability Engineer (Crypto Trading) role at Hyphen Connect Site Reliability Engineer (Crypto Trading) 2 days ago Be among the first 25 applicants Join to apply for the Site Reliability Engineer (Crypto Trading) role at Hyphen Connect We are hiring for one of our ecosystem projects in...


  • Singapore NodeFlair Full time

    **Job Summary**: **Salary** S$5,500 - S$9,500 / Monthly **Job Type** **Seniority** Mid **Years of Experience** At least 3 years **Tech Stacks** AWS Shell Script Shell Java Linux Python Description: Looking for an experienced individual joining our Site Reliability Engineer team. The individual will support production monitoring and is expected to be...


  • Singapore Shopify Full time

    Site Reliability Engineer (EMEA, Japan, Singapore, Australia) Join to apply for the Site Reliability Engineer (EMEA, Japan, Singapore, Australia) role at Shopify . Overview We are not here to play zero-sum games. Shopify Engineering is focused on building the best product for our Merchants. You will enable entrepreneurship and create new value for the...


  • Singapore TRUEWATCH TECHNOLOGY INC PTE. LTD. Full time

    **Responsibility**: - Run production environment by monitoring availability and taking a holistic view of the system health. - Achieve site reliability automation, minimize system downtime, and reduce site reliability cost. - Manage risks and resolves issues that affect the release scope, schedule and quality. - Suggest architecture improvements, push for...


  • Singapore Hyphen Connect Full time

    Site Reliability Engineer (Crypto Trading) Join to apply for the Site Reliability Engineer (Crypto Trading) role at Hyphen Connect Site Reliability Engineer (Crypto Trading) 2 days ago Be among the first 25 applicants Join to apply for the Site Reliability Engineer (Crypto Trading) role at Hyphen Connect We are hiring for one of our ecosystem...