Site Reliability Engineer

3 weeks ago


Singapur, Singapore TikTok Full time

Overview Responsibilities About the team TikTok Shop is a content e-commerce business utilising international short video products as carriers. Our aim is to become the preferred choice for users seeking to discover and purchase affordable, high-quality products. We provide users with tailored, vibrant, and efficient consumption experiences while enabling merchants to access robust and dependable platform services in various scenarios, such as live e-commerce and short video content e-commerce. Our vision is to make affordable and high-quality products easily accessible, enhancing the quality of life for all. We are looking for passionate and talented people to join our product and operations team, to build an e-commerce ecosystem that is innovative, secure and intuitive for our users and brands. Our role combine software and systems engineering disciplines to run high-performance, large-scale distributed infrastructure. This means you will be deeply involved in the developmental lifecycle of critical software services, collaborating closely with product engineers to combine software code and systems knowledge to ensure that TikTok Shop's services are reliable, fault-tolerant, efficiently scalable and cost-effective. You will also be leveraging your software engineering expertise to develop software platforms and tools to optimise the operational and engineering efficiencies of complex systems at scale, with particular focus on improving the systems' observability, performance and maintainability. Responsibilities Focus on TikTok Shop business, provide SRE solutions that cater to actual business scenarios based on cross-team, cross-timezone, and cross-region collaboration mechanisms. Participate in building disaster recovery capabilities for TikTok Shop, offering end-to-end disaster recovery solutions to ensure the ability to switch over during extreme failure scenarios. Continuously enhance the core capabilities of TikTok Shop SRE in terms of stability, efficiency, cost, and security, and participate in the operation of key metrics (including incident recall rate, SLI, MTTD, MTTR, resource utilization, etc.). Promote the design and implementation of operation and maintenance tools and platform solutions to improve the infrastructure capabilities of the TikTok Shop platform. Participate in on-call duty, respond to performance and availability issues, resolve problems, and minimize downtime as much as possible. Qualifications Minimum Qualifications: Bachelor's or higher degree in Computer Science, Information Technology, Programming & System Analysis, Science (Computer Studies) or related discipline. Candidate should have at least 5 years of experience in one or more programming languages (such as Java, C++, Go), or scripting experience with Shell/Python. Familiarity with e-commerce business, common network and access layer faults, and relevant construction experience. Professional knowledge in operation, deployment, high availability, and quality assurance of large-scale distributed systems, with a strong sense of responsibility and strong problem analysis and solving skills. About TikTok TikTok is the leading destination for short-form mobile video. At TikTok, our mission is to inspire creativity and bring joy. TikTok's global headquarters are in Los Angeles and Singapore, and we also have offices in New York City, London, Dublin, Paris, Berlin, Dubai, Jakarta, Seoul, and Tokyo. Why Join Us Inspiring creativity is at the core of TikTok's mission. Our innovative product is built to help people authentically express themselves, discover and connect – and our global, diverse teams make that possible. Together, we create value for our communities, inspire creativity and bring joy - a mission we work towards every day. We strive to do great things with great people. We lead with curiosity, humility, and a desire to make impact in a rapidly growing tech company. Every challenge is an opportunity to learn and innovate as one team. We're resilient and embrace challenges as they come. By constantly iterating and fostering an "Always Day 1" mindset, we achieve meaningful breakthroughs for ourselves, our company, and our users. When we create and grow together, the possibilities are limitless. Join us. Diversity & Inclusion TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At TikTok, our mission is to inspire creativity and bring joy. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too. Seniority level Mid-Senior level Employment type Full-time Job function Engineering and Information Technology Industries Technology, Information and Internet Referrals increase your chances of interviewing at TikTok by 2x Get notified about new Site Reliability Engineer jobs in Singapore, Singapore . Site Reliability Engineer – Fresh Graduate Production Engineer / Site Reliability Engineer Platform Engineer - Up to $200k + Industry Leading Bonus - Elite FinTech Firm WeChat - Senior Site Reliability Engineer Site Reliability Engineer (EMEA, Japan, Singapore, Australia) Information Technology - Cloud/DevOps Engineer Engineer (Energy Management Systems Department) Site Reliability Engineer (SRE) - Global Hedge Fund - Singapore (open to relocation) Azure DevOps Engineer (Fully Remote - Worldwide) Site Reliability Engineer (SRE) (GovTech) Site Reliability Engineer, Engineering Infra - AZ SRE (Campus Recruitment 2026) We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI. #J-18808-Ljbffr



  • Singapur, Singapore NetEase Games Full time

    Overview Join to apply for the Site Reliability Engineer role at NetEase Games . As a leading internet technology company based in China, NetEase provides premium online services centered around content creation and operates a broad gaming ecosystem. Job Description Site Reliability Engineering (SRE) refers to using software engineering methods to manage...


  • Singapur, Singapore APPLE SOUTH ASIA PTE. LTD. Full time

    Summary At Apple, new ideas have a way of becoming excellent products, services, and customer experiences very quickly. Bring passion and dedication to your job and there’s no telling what you could accomplish. The people here at Apple don’t just build products - they craft the kind of wonder that’s revolutionized entire industries. It’s the...


  • Singapur, Singapore PERSOL SINGAPORE PTE. LTD. Full time

    Overview Site Reliability Engineer (SRE) – An excellent Site Reliability Engineer (SRE) opportunity is available in a cutting-edge, fast-growing cloud environment. Job Purpose Deliver reliable, secure, and scalable cloud services by managing and optimizing AWS infrastructure. Job Responsibilities Manage and support AWS services, ensuring uptime,...


  • Singapur, Singapore PERSOL SINGAPORE PTE. LTD. Full time

    Cloud Site Reliability Engineer (AWS) An excellent Cloud Site Reliability Engineer opportunity has just arisen in a global brand supporting mission‑critical government systems. Job Purpose Ensure reliable, secure, and automated cloud operations supporting mission‑critical systems and compliance needs. Responsibilities Manage and support AWS cloud...


  • Singapur, Singapore Crystal Equation Corporation Full time

    Overview We are seeking a skilled Site Reliability Engineer (SRE) to join our team. SRE will be responsible for keeping all internal user-facing applications and other production systems running smoothly. This hybrid role involves a combination of both development and operations skills to build and manage systems that are both efficient and reliable. The...


  • Singapur, Singapore Thales Full time

    Overview Join to apply for the Site Reliability Engineer role at Thales . Location: Singapore, Singapore Thales is a global technology leader trusted by governments, institutions, and enterprises to tackle their most demanding challenges. From quantum applications and artificial intelligence to cybersecurity and 6G innovation, our solutions empower critical...


  • Singapur, Singapore E-Solutions Full time

    Job Title: Site Reliability Engineer (SRE) Experience: 8+ years (including 3+ years in Java) About the Role: We’re looking for a skilled Site Reliability Engineer with strong Java and cloud-native development experience to design, build, and maintain reliable, scalable systems on Kubernetes and AWS. You’ll work closely with development and platform teams...


  • Singapur, Singapore Razer Inc. Full time

    Join to apply for the Site Reliability Engineer role at Razer Inc. 3 weeks ago Be among the first 25 applicants Joining Razer will place you on a global mission to revolutionize the way the world games. Razer is a place to do great work , offering you the opportunity to make an impact globally while working across a team located across 5 continents. Razer is...


  • Singapur, Singapore Manpower Singapore Full time

    Site Reliability Engineer - Global Support Apply for the Site Reliability Engineer - Global Support role at Manpower Singapore . Responsibilities Deploy and manage overseas games infrastructure, including game monitor system and login services. Monitor and dashboard game observability to ensure reliability, scalability, and security. Analyze game...


  • Singapur, Singapore EXASOFT PTE. LTD. Full time

    Job Summary: We are seeking a Senior Site Reliability Engineer (SRE) with 10–15 years of proven experience in building, managing, and maintaining highly available, scalable, and secure infrastructure across multi-cloud and hybrid cloud environments—including on-premises data centers . The ideal candidate will have deep knowledge of SRE principles ,...