Site Reliability Engineer

7 days ago


Singapur, Singapore Tiktok Full time

Responsibilities About the team: TikTok Shop is a content e-commerce business utilising international short video products as carriers. Our aim is to become the preferred choice for users seeking to discover and purchase affordable, high-quality products. We provide users with tailored, vibrant, and efficient consumption experiences while enabling merchants to access robust and dependable platform services in various scenarios, such as live e-commerce and short video content e-commerce. Our vision is to make affordable and high-quality products easily accessible, enhancing the quality of life for all. We are looking for passionate and talented people to join our product and operations team, to build an e-commerce ecosystem that is innovative, secure and intuitive for our users and brands. This role combines software and systems engineering disciplines to run high-performance, large-scale distributed infrastructure. This means you will be deeply involved in the developmental lifecycle of critical software services, collaborating closely with product engineers to combine software code and systems knowledge to ensure that TikTok Shop's services are reliable, fault-tolerant, efficiently scalable and cost-effective. You will also be leveraging your software engineering expertise to develop software platforms and tools to optimise the operational and engineering efficiencies of complex systems at scale, with particular focus on improving the systems' observability, performance and maintainability. Focused on TikTok Shop business, provide SRE solutions that cater to actual business scenarios based on cross-team, cross-timezone, and cross-region collaboration mechanisms. Participate in building disaster recovery capabilities for TikTok Shop, offering end-to-end disaster recovery solutions to ensure the ability to switch over during extreme failure scenarios. Continuously enhance the core capabilities of TikTok Shop SRE in terms of stability, efficiency, cost, and security, and participate in the operation of key metrics (including incident recall rate, SLI, MTTD, MTTR, resource utilization, etc.). Promote the design and implementation of operation and maintenance tools and platform solutions to improve the infrastructure capabilities of the TikTok Shop platform. Participate in on-call duty, respond to performance and availability issues, resolve problems, and minimize downtime as much as possible. Qualifications Bachelor's or higher degree in Computer Science, Information Technology, Programming & System Analysis, Science (Computer Studies) or related discipline. Candidate should have at least 5 years of experience in one or more programming languages (such as Java, C++, Go) or scripting experience with Shell/Python. Familiarity with e business, common network and access layer faults and relevant construction experience. Professional knowledge in operation, deployment, high availability and quality assurance of large-scale distributed systems, with a strong sense of responsibility and strong problem analysis and solving skills. About TikTok TikTok is the leading destination for short-form mobile video. At TikTok, our mission is to inspire creativity and bring joy. TikTok's global headquarters are in Los Angeles and Singapore, and we also have in New York City, London, Dublin, Paris, Berlin, Dubai, Jakarta, Seoul and Tokyo. Why Join Us Inspiring creativity is at the core of TikTok's mission. Our innovative product is built to help people authentically express themselves, discover and connect – and our global, diverse teams make that possible. Together, we create value for our communities, inspire creativity and bring joy – a mission we work towards every day. We strive to do great things with great people. We lead with curiosity, humility and a desire to make impact in a rapidly growing tech company. Every challenge is an opportunity to learn and innovate as one team. We’re resilient and embrace challenges as they come. By constantly iterating and fostering an "Always Day 1" mindset, we achieve meaningful breakthroughs for ourselves, our company and our users. When we create and grow together, the possibilities are limitless. Join us. Diversity & Inclusion TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences and unique perspectives. Our platform connects people from across the globe and so does our workplace. At TikTok, our mission is to inspire creativity and bring joy. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too. #J-18808-Ljbffr



  • Singapur, Singapore Qube Research & Technologies Full time

    Join to apply for the DevOps /Site Reliability Engineer role at Qube Research & Technologies Qube Research & Technologies (QRT) is a global quantitative and systematic investment manager, operating in all liquid asset classes across the world. We are a technology and data driven group implementing a scientific approach to investing. Combining data, research,...


  • Singapur, Singapore GroupBy Full time

    Overview Site Reliability Engineer GroupBy•Singapore About Rezolve Ai Rezolve Ai (NASDAQ: RZLV) is an industry leader in AI-powered solutions, specializing in enhancing customer engagement, operational efficiency, and revenue growth. The Brain Suite delivers advanced tools that harness artificial intelligence to optimize processes, improve decision-making,...


  • Singapur, Singapore Fastmarkets Full time

    Company Overview Fastmarkets is an industry-leading price-reporting agency (PRA) and information provider for global commodities, offering price data, news, analytics and events for agriculture, forest products, metals and mining, and new‑generation energy markets. Founded in 1865, it employs over 600 people across the UK, US, China, India, Singapore,...


  • Singapur, Singapore Crystal Equation Corporation Full time

    We are seeking a skilled Site Reliability Engineer (SRE) to join our team. SRE will be responsible for keeping all internal user-facing applications and other production systems running smoothly. This hybrid role involves a combination of both development and operations skills to build and manage systems that are both efficient and reliable. The Enterprise...


  • Singapur, Singapore Fastmarkets Full time

    Fastmarkets is an industry-leading price-reporting agency (PRA) and information provider for global commodities, providing price data, news, analytics and events for the agriculture, forest products, metals and mining and new-generation energy markets. Fastmarkets' data is critical for customers seeking to understand and predict dynamic, sometimes opaque...


  • Singapur, Singapore Crystal Equation Corporation Full time

    We are seeking a skilled Site Reliability Engineer (SRE) to join our team. SRE will be responsible for keeping all internal user‑facing applications and other production systems running smoothly. This hybrid role involves a combination of both development and operations skills to build and manage systems that are both efficient and reliable. The Enterprise...


  • Singapur, Singapore Medium Full time

    About Rezolve Ai Rezolve Ai (NASDAQ: RZLV) is an industry leader in AI-powered solutions, specializing in enhancing customer engagement, operational efficiency, and revenue growth. The Brain Suite delivers advanced tools that harness artificial intelligence to optimize processes, improve decision-making, and enable seamless digital experiences As a leader in...


  • Singapur, Singapore Viasat Full time

    About us One team. Global challenges. Infinite opportunities. At Viasat, we’re on a mission to deliver connections with the capacity to change the world. For more than 35 years, Viasat has helped shape how consumers, businesses, governments and militaries around the globe communicate. We’re looking for people who think big, act fearlessly, and create an...


  • Singapur, Singapore Starry Recruitment Full time

    Site Reliability Engineer (SRE) – Singapore Responsibilities Support the operation and maintenance of overseas cloud-based services, ensuring platform stability, reliability, and performance; proactively identify and resolve system bottlenecks. Follow internal operational processes, taking ownership of incident management, service request management,...


  • Singapur, Singapore RAZER (ASIA-PACIFIC) PTE. LTD. Full time

    We are looking for Site Reliability Engineers (SRE) to join our AI Software team. In this role, you will ensure the reliability, performance, scalability, and operational excellence of AI products, model-serving infrastructure, and backend API systems. You’ll work closely with software engineers, AI teams and release teams to automate operations, enhance...