Site Reliability Engineer

18 hours ago


Singapore NodeFlair Full time

**Job Summary**:
**Salary**
S$10,000 - S$20,000 / Monthly

**Job Type**

**Seniority**

Mid

**Years of Experience**
At least 5 years

**Tech Stacks**
Combine Linux Python

**About TikTok**

TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. TikTok has global offices including Los Angeles, New York, London, Paris, Berlin, Dubai, Singapore, Jakarta, Seoul and Tokyo.

**Why Join Us**

Creation is the core of TikTok's purpose. Our platform is built to help imaginations thrive. This is doubly true of the teams that make TikTok possible.

Together, we inspire creativity and bring joy - a mission we all believe in and aim towards achieving every day.

To us, every challenge, no matter how difficult, is an opportunity; to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always.

At TikTok, we create together and grow together. That's how we drive impact - for ourselves, our company, and the communities we serve.

Join us.

**About the Team**

The e-Commerce industry has seen tremendous growth in recent years and has become a hotly contested space amongst leading Internet companies, and its future growth cannot be underestimated. With millions of loyal users globally, we believe TikTok is an ideal platform to deliver a brand new and better e-Commerce experience to our users. Our product engineering team is responsible for building an e-Commerce ecosystem that is innovative, secure and intuitive for our users. We are looking for passionate and talented people to join us as we drive the future of e-Commerce here at TikTok.

Our SRE engineers combine software and systems engineering disciplines to run high-performance, large-scale distributed infrastructure. This means you will be deeply involved in the developmental lifecycle of critical software services, collaborating closely with product engineers to combine software code and systems knowledge to ensure that TikTok e-Commerce's services are reliable, fault-tolerant, efficiently scalable and cost-effective. You will also be leveraging your software engineering expertise to develop software platforms and tools to optimise the operational and engineering efficiencies of complex systems at scale, with particular focus on improving the systems' observability, performance and maintainability.

**Responsibilities**
- Be responsible for service levels of mission critical, revenue-generating e-Commerce platform as well as all supporting infrastructure and services. This role will focus on service reliability, highly-scalable design, and release management in a cloud-native environment.
- Define service level indicators and data-driven objectives, and develop SRE standards, processes and methodologies, to uphold and improve uptime, latency, and system health of a core global e-commerce production platform.
- Collaborate cross-team with engineering and product to ensure that key stability and maintainability requirements, such as capacity planning and launch reviews, are performed to enable transparent service delivery to customers.
- Design strategies for risk detection and mitigation, disaster recovery & simulation, release management, cost optimisation, engineering quality etc.
- Automation geared towards infrastructure-as-code, scalability and service resiliency.
- Implement best practices around incident management, post-mortems while being part of on-call rotations.
- Research, design, and develop computer and network software or specialised utility programs.
- Update software, enhances existing software capabilities, and develops and direct software testing and validation procedures.

**Qualifications**
- Bachelor's or higher degree in Computer Science, Information Technology, Programming & System Analysis, Science (Computer Studies) or related discipline.
- Experience in designing, analyzing, and troubleshooting large-scale distributed systems.
- Experience developing platform/tools using scripting languages such as Python/Bash.
- Systematic problem-solving approach, coupled with effective communication skills and a sense of drive.
- Experience with running production-grade web services at scale in a cloud native environment.
- Experience with implementing observability solutions such as monitoring, logging and tracing in complex service meshes.

TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At TikTok, our mission is to inspire creativity and bring joy. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.



  • Singapore Beijing Foreign Enterprise Management Consultants Co.,Ltd. Full time

    Get AI-powered advice on this job and more exclusive features. Direct message the job poster from Beijing Foreign Enterprise Management Consultants Co.,Ltd. On behalf of Huawei, a world-renowned information and communication technology company, we are seeking passionate and talented individuals to join our team as Site Reliability Engineer. Job...


  • Singapore ETEAM WORKFORCE PTE. LTD. Full time

    Position: Site Reliability Engineer (SRE) Work Mode - Onsite/Hybrid Timing - 9am to 6 pm Duration – 1 Year (Highly extendable) Salary: 6018 SGD Work Location: Robinson Road, Singapore About the Role We are looking for a seasoned Site Reliability Engineer (SRE) with 5+ years of experience to join our Platform Engineering team. This role is ideal for someone...


  • Singapore ETEAM WORKFORCE PTE. LTD. Full time

    Position: Site Reliability Engineer (SRE)Work Mode -Onsite/Hybrid Timing -9am to 6 pm Duration –1 Year (Highly extendable)Salary: 6018 SGD Work Location: Robinson Road, Singapore About the Role We are looking for a seasoned Site Reliability Engineer (SRE) with 5+ years of experience to join our Platform Engineering team. This role is ideal for someone who...


  • Singapore JJ Consulting Services Full time

    Our Client is a fast growing company in Singapore, who is seeking to recruit a Site Reliability Engineer. **Site Reliability Engineer** **Key Roles & Responsibilities** - Providing ancillary support of Enterprise-Grade Products and solutions at customer's sites - Ironing out deployment issues or challenges that our customers may face - Responsible for...


  • Singapore Qlik Full time

    **What makes us Qlik?** A Gartner® Magic Quadrant Leader for 14 years in a row, Qlik transforms complex data landscapes into actionable insights, driving strategic business outcomes. Serving over 40,000 global customers, our portfolio leverages pervasive data quality and advanced AI/ML capabilities that lead to better decisions, faster. We excel in...


  • Singapore eTeam Full time

    Site Reliability Engineer (SRE) We are looking for a seasoned Site Reliability Engineer (SRE) with 5–10 years of experience to join our Platform Engineering team. This role is ideal for someone who thrives in a fast‑paced environment, is passionate about reliability, and enjoys solving complex challenges. You will play a key role in building and...


  • Singapore Adyen Full time

    **This is Adyen** Adyen provides payments, data, and financial products in a single solution for customers like Meta, Uber, H&M, and Microsoft - making us the financial technology platform of choice. At Adyen, everything we do is engineered for ambition. For our teams, we create an environment with opportunities for our people to succeed, backed by the...


  • Singapore Bohan Group Full time

    As a Site Reliability Engineer (SRE), you will drive operational excellence by combining deep technical knowledge with a strong focus on automation and tooling. **Your responsibilities will include**: - Designing and implementing core components of a robust SRE framework across both new and legacy systems. - Partnering with development and quantitative...


  • Singapore HCLTech Full time

    We are seeking a highly experienced Site Reliability Engineer (SRE) with 10 years of expertise in building, managing, and optimizing reliable, scalable, and secure systems. This role requires strong proficiency in end-to-end SRE practices across multi-cloud, hybrid cloud, and on-premises data center environments. The ideal candidate will drive automation,...


  • Singapore HCLTech Full time

    We are seeking a highly experienced Site Reliability Engineer (SRE) with 10 years of expertise in building, managing, and optimizing reliable, scalable, and secure systems. This role requires strong proficiency in end-to-end SRE practices across multi-cloud, hybrid cloud, and on-premises data center environments. The ideal candidate will drive automation,...