Site Reliability Engineer

1 week ago


Singapore TIKTOK PTE. LTD. Full time

Live Streaming Infrastructure is a world-leading live streaming platform that provides end to end solutions for TikTok and also our external partners. We are building the next generation live streaming platform including live streaming ingestion, processing and delivery, to provide the best live streaming experience for our billions of users around the world. By joining us, you will have the opportunity to not only tackle the challenges of large scale distributed system across multi regions which generates huge amount of data from billions of users, but also the unique challenges from live streaming to provide low latency and high quality videos for different types of network connectivity even when the live event can go viral anytime to generate huge amount of traffic.

**Responsibilities**:

- Build global infrastructure for multi-media processing, storage and transport, to serve billions of users all over the world.
- Build live streaming CDN, and deploy nodes globally.
- Build tools, automations, visualisations and monitors to facilitate the operation and optimisation of the global infrastructure.
- Engage in and improve the whole service lifecycle, from inception and design, through deployment, operation and refinement.
- Scale up systems sustainably through mechanisms like automation, and initiate changes that improve system reliability and processing speed.

**Qualifications**:

- Bachelor's degree in Computer Science or a related technical background involving software/system engineering, or equivalent working experience.
- Good programming experience with at least one of the following languages: C, C++, Java, Python, or Go.
- Strong in analytical skills and the ability to solve real world problems in a fast moving environment
- Experience in designing, analyzing and building automation and tools for large scale systems
- Experience in building solutions for AWS, Google, Azure, and other cloud services
- Familiar with Unix/Linux operating systems.
- Good understanding of every aspect of microservice architecture, and hands on experience in troubleshooting in large scale distributed systems.



  • Singapore DHATCH CONSULTANCY PTE. LTD. Full time

    Site Reliability Engineer: **Preferred Qualifications** - 3+ years of experience in site reliability engineering, DevOps, or software engineering roles. - Proven skills in: - Monitoring & alerting tools (Grafana, New Relic) - CI/CD pipelines (Git, Jenkins, GitHub Actions, etc.) - Container orchestration (Docker, Kubernetes) - Infrastructure-as-code...


  • North-East Singapore PERSOLKELLY Full time

    The Site Reliability Engineer is responsible for ensuring the reliability, scalability, and efficiency of our systems and infrastructure. This role involves monitoring, troubleshooting, and resolving issues to maintain optimal performance. The engineer will also collaborate with cross-functional teams to automate processes and improve system reliability....


  • Singapore RigNet Full time

    About us One team. Global challenges. Infinite opportunities. At Viasat, we're on a mission to deliver connections with the capacity to change the world. For more than 35 years, Viasat has helped shape how consumers, businesses, governments and militaries around the globe communicate. We're looking for people who think big, act fearlessly, and create an...


  • Singapore RigNet Full time

    About us One team. Global challenges. Infinite opportunities. At Viasat, we're on a mission to deliver connections with the capacity to change the world. For more than 35 years, Viasat has helped shape how consumers, businesses, governments and militaries around the globe communicate. We're looking for people who think big, act fearlessly, and create an...


  • Singapore Viasat Full time

    About us One team. Global challenges. Infinite opportunities. At Viasat, we're on a mission to deliver connections with the capacity to change the world. For more than 35 years, Viasat has helped shape how consumers, businesses, governments and militaries around the globe communicate. We're looking for people who think big, act fearlessly, and create an...


  • Singapore NTT Data Singapore Full time $120,000 - $200,000 per year

    As a Site Reliability Engineer you will be filling a mission-critical role ensuring that our systems are healthy, monitored, automated, fault tolerant and designed to scale. You will collaborate and work closely with engineering teams to continually improve our production services, facilitating fast delivery of new products, and reducing downtime. Key...


  • Singapore Rapsys Technologies Full time

    Drive the Site Reliability Engineering agenda forward at an Enterprise Level to improve availability, reliability, and performance of services. - Drive cross-team efforts in resiliency assessment exercises and reporting - Draft and/or contribute to internal SRE training materials - Support services before they go live through activities such as Chaos testing...


  • Singapore ABAXX SINGAPORE PTE. LTD. Full time

    Site Reliability Engineer - Networking We are seeking competent candidate joining our Infrastructure Team for the mission building and operating MAS regulated marketplace and clearing house. This role is ideal for someone with a strong foundation in AWS services, infrastructure as code, and cloud security, who is passionate about building scalable, secure,...


  • Singapore Crystal Equation Corporation Full time

    We are seeking a skilled Site Reliability Engineer (SRE) to join our team. SRE will be responsible for keeping all internal user-facing applications and other production systems running smoothly. This hybrid role involves a combination of both development and operations skills to build and manage systems that are both efficient and reliable. The Enterprise...


  • Singapore Imperva Full time

    **Site Reliability Engineer**:** About the role** Imperva’s Infrastructure and Cloud team is looking for a highly technical Site Reliability Engineer to drive innovation, scale, and create operational excellence for the Imperva globally distributed network. As an SRE in the ICO organization, you approach solving, supporting, and optimizing the...