Current jobs related to Site Reliability Engineering - Singapore - ByteDance


  • Singapore Hyphen Connect Full time

    Site Reliability Engineer (Crypto Trading) Join to apply for the Site Reliability Engineer (Crypto Trading) role at Hyphen Connect Site Reliability Engineer (Crypto Trading) 2 days ago Be among the first 25 applicants Join to apply for the Site Reliability Engineer (Crypto Trading) role at Hyphen Connect We are hiring for one of our ecosystem projects in...


  • Singapore TRUEWATCH TECHNOLOGY INC PTE. LTD. Full time

    **Responsibility**: - Run production environment by monitoring availability and taking a holistic view of the system health. - Achieve site reliability automation, minimize system downtime, and reduce site reliability cost. - Manage risks and resolves issues that affect the release scope, schedule and quality. - Suggest architecture improvements, push for...


  • Singapore Hyphen Connect Full time

    Site Reliability Engineer (Crypto Trading) Join to apply for the Site Reliability Engineer (Crypto Trading) role at Hyphen Connect Site Reliability Engineer (Crypto Trading) 2 days ago Be among the first 25 applicants Join to apply for the Site Reliability Engineer (Crypto Trading) role at Hyphen Connect We are hiring for one of our ecosystem...


  • Singapore TEAMLEASE DIGITAL CONSULTING PTE. LTD. Full time

    As a Site Reliability Engineer, you will be filling a mission-critical role ensuring that our systems are healthy, monitored, automated, fault-tolerant and designed to scale. You will collaborate and work closely with engineering teams to continually improve our production services, facilitating fast delivery of new products, and reducing downtime. Key...


  • Singapore HCLTech Full time

    Get AI-powered advice on this job and more exclusive features. This role combines software and systems engineering to build run, and maintain high performant, distributed, fault tolerant and resilient financial systems. Site Reliability Engineers focus on ensuring a joyful customer journey. As a Site Reliability Engineer you will be filling a...


  • Singapore Vega Solutions Full time

    Join to apply for the Site Reliability Engineer role at Vega SolutionsJoin to apply for the Site Reliability Engineer role at Vega SolutionsGet AI-powered advice on this job and more exclusive features.Tokka Labs | Singapore | Full-TimeTokka Labs is a proprietary trading firm with a focus on close collaboration, rigorous research, and cutting-edge...


  • Singapore Tardis Group Full time

    Direct message the job poster from Tardis Group Recruiter at Tardis Group | Finding Top Talent in Tech & Quant About the Company A rapidly growing technology firm operating at the forefront of artificial intelligence and advanced software solutions. The company fosters a fast-paced, collaborative, and innovation-driven culture, uniting talent across...


  • Singapore HCLTech Full time

    Get AI-powered advice on this job and more exclusive features.This role combines software and systems engineering to build run, and maintain high performant, distributed, fault tolerant and resilient financial systems. Site Reliability Engineers focus on ensuring a joyful customer journey.As a Site Reliability Engineer you will be filling a mission-critical...


  • Singapore ByteDance Full time

    Site Reliability Engineer - Privacy & Security - Singapore Site Reliability Engineer - Privacy & Security - Singapore 4 days ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive features. Responsibilities Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen...

  • Reliability Engineer

    2 weeks ago


    Singapore ONE STOP ENGINEERING PTE. LTD. Full time

    Title**:Reliability Engineer Purpose Statement (2-3 Sentences): - Ensures reliability and maintainability of equipment, processes, utilities, facilities and controls with an objective to constantly improve site production and cost performance. - Develops engineering solutions to repetitive failures and all other problems that adversely affect plant...

Site Reliability Engineering

2 weeks ago


Singapore ByteDance Full time

About ByteDance
Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok, Helo, and Resso, as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content.

Why Join Us
At ByteDance, our people are humble, intelligent, compassionate and creative. We create to inspire - for you, for us, and for millions of users across all of our products. We lead with curiosity and aim for the highest, never shying away from taking calculated risks and embracing ambiguity as it comes. Here, the opportunities are limitless for those who dare to pursue bold ideas that exist just beyond the boundary of possibility. Join us and make impact happen with a career at ByteDance.

About the Team
Our infrastructure team operates a large network of POPs around the world hosting edge services, such as traffic acceleration, CDN cache, gaming, etc. We are seeking experienced reliability/performance engineers to maintain stability and to optimize the performance of various edge services and products running on top of our Kubernetes-based platform (PaaS), and to create solutions for ever growing business needs on the edge.

Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed infrastructures. Our SREs are tasked with ensuring the infrastructure services are reliable, fault-tolerant, efficiently scalable and cost-effective. You will have the opportunity to manage a variety of complex systems at scale, including systems that administer hyperscale datacenters and public cloud, global content distribution networks (CDNs) and load balancers that handle Tbps of traffic. You will also have the opportunity to collaborate with various teams to translate business needs into concrete action items, and/or improvements in system design or procedures.

**Responsibilities**:

- Build metrics, tools, automations, visualizations and monitors to facilitate the operation and optimization of edge services.
- Build insights through statistical analysis to help drive targeted deployments to expand the coverage of our global infrastructure.
- Analyze, design and implement solutions at the system level to remove bottlenecks and improve edge service performance.
- Work in a fast-paced environment. Participate in technical operations and rotations in response to performance and reliability issues.
- Master’s degree or Bachelor's degree with 2+ years of experience in Computer Engineering, Electrical Engineering, Computer Science or related major
- 2+ years experience working with Unix Linux systems from kernel to shell and beyond with experience working with system libraries, file systems, and client-server protocols.
- 2+ years experience in one or more programming languages such as Java, C++, Go, or scripting experience in Shell and Python.

ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.