Site Reliability Engineer

1 week ago


Singapore BYTEDANCE PTE. LTD. Full time

ByteDance will be prioritizing applicants who have a current right to work in Singapore, and do not require ByteDance's sponsorship of a visa.

**About ByteDance**

Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok, Helo, and Resso, as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content.

**Why Join Us**

At ByteDance, our people are humble, intelligent, compassionate and creative. We create to inspire - for you, for us, and for millions of users across all of our products. We lead with curiosity and aim for the highest, never shying away from taking calculated risks and embracing ambiguity as it comes. Here, the opportunities are limitless for those who dare to pursue bold ideas that exist just beyond the boundary of possibility. Join us and make impact happen with a career at ByteDance.

**About the Team**

Our infrastructure team is seeking experienced site reliability engineers to build globally distributed platform for provisioning and deploying edge services, such as traffic acceleration, CDN cache, gaming, etc. We use Kubernetes to manage on-prem/cloud nodes and build an eco-system around it, including tools for monitoring, alerting, logging, CI/CD, etc. and various services with automated deployment/scaling in order to maximize daily operation efficiencies. On top of the Kubernetes infrastructure, we build a PaaS platform to help deploy and manage global edge services.

**Responsibilities**
- Deploy and administrate Kubernetes clusters both on-prem and in cloud (AWS, GCP, etc.).
- Collaborate with software engineers to build enterprise-level platform (PaaS) with cutting-edge Cloud Native Computing Foundation (CNCF) technologies.
- Design, develop, automate, and continuously improve platform services and pipelines, such as monitoring, alerting, logging, tracing, CI/CD, etc.
- Improve Kubernetes system efficiency and debug issues related to networking, storage, scheduling, etc.
- Collaborate with open-source communities to advance Kubernetes and Cloud Native technologies.

**Qualifications**
- Master’s degree (or Bachelor's degree with 3+ years of experience) in Computer Engineering, Computer Science, or related fields.
- Experience in Kubernetes administration.
- Experience in Unix/Linux systems from kernel to shell and beyond.
- Experience with Kubernetes CNI deployment and troubleshooting, including (but not limited to) the following CNIs: Cilium, Kube-Router, Calico, Flannel.
- Experience in designing, analyzing, and building automation tools for large scale and complex systems.

ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.



  • Singapore Beijing Foreign Enterprise Management Consultants Co.,Ltd. Full time

    Get AI-powered advice on this job and more exclusive features. Direct message the job poster from Beijing Foreign Enterprise Management Consultants Co.,Ltd. On behalf of Huawei, a world-renowned information and communication technology company, we are seeking passionate and talented individuals to join our team as Site Reliability Engineer. Job...


  • Singapore ETEAM WORKFORCE PTE. LTD. Full time

    Position: Site Reliability Engineer (SRE) Work Mode - Onsite/Hybrid Timing - 9am to 6 pm Duration – 1 Year (Highly extendable) Salary: 6018 SGD Work Location: Robinson Road, Singapore About the Role We are looking for a seasoned Site Reliability Engineer (SRE) with 5+ years of experience to join our Platform Engineering team. This role is ideal for someone...

  • Site Reliability Engineer

    59 minutes ago


    Singapore ETEAM WORKFORCE PTE. LTD. Full time

    Position: Site Reliability Engineer (SRE)Work Mode -Onsite/Hybrid Timing -9am to 6 pm Duration –1 Year (Highly extendable)Salary: 6018 SGD Work Location: Robinson Road, Singapore About the Role We are looking for a seasoned Site Reliability Engineer (SRE) with 5+ years of experience to join our Platform Engineering team. This role is ideal for someone who...


  • Singapore JJ Consulting Services Full time

    Our Client is a fast growing company in Singapore, who is seeking to recruit a Site Reliability Engineer. **Site Reliability Engineer** **Key Roles & Responsibilities** - Providing ancillary support of Enterprise-Grade Products and solutions at customer's sites - Ironing out deployment issues or challenges that our customers may face - Responsible for...


  • Singapore NodeFlair Full time

    **Job Summary**: **Salary** S$11,500 - S$16,500 / Monthly **Job Type** **Seniority** Senior **Years of Experience** At least 7 years **Tech Stacks** Microsoft Puppet Java Ansible Python **This is Adyen** Adyen provides payments, data, and financial products in a single solution for customers like Meta, Uber, H&M, and Microsoft - making us the...


  • Singapore HCLTech Full time

    We are seeking a highly experienced Site Reliability Engineer (SRE) with 10 years of expertise in building, managing, and optimizing reliable, scalable, and secure systems. This role requires strong proficiency in end-to-end SRE practices across multi-cloud, hybrid cloud, and on-premises data center environments. The ideal candidate will drive automation,...

  • Site Reliability Engineer

    46 minutes ago


    Singapore Salt Full time

    Description SALT is hiring Site Reliability Engineer for a global technology client in Singapore for 12 months & renewable contract assignment. Responsibilities: - Reliability Engineering: Define and implement SLIs, SLOs, and error budgets to measure and improve service reliability. - Cloud Infrastructure: Design, deploy, and manage infrastructure on Google...


  • Singapore Salt Talent Search Pte Ltd Full time

    SALT is hiring Site Reliability Engineer for a global technology client in Singapore for 12 months & renewable contract assignment. Responsibilities Reliability Engineering: Define and implement SLIs, SLOs, and error budgets to measure and improve service reliability. Cloud Infrastructure: Design, deploy, and manage infrastructure on Google Cloud Platform...


  • Singapore Salt Full time

    SALT is hiring Site Reliability Engineer for a global technology client in Singapore for 12 months & renewable contract assignment. Responsibilities: Reliability Engineering: Define and implement SLIs, SLOs, and error budgets to measure and improve service reliability. Cloud Infrastructure: Design, deploy, and manage infrastructure on Google Cloud Platform...

  • Site Reliability Engineer

    45 minutes ago


    Singapore Salt Full time

    SALT is hiring Site Reliability Engineer for a global technology client in Singapore for 12 months & renewable contract assignment. Responsibilities: Reliability Engineering: Define and implement SLIs, SLOs, and error budgets to measure and improve service reliability. Cloud Infrastructure: Design, deploy, and manage infrastructure on Google Cloud Platform...