Site Reliability Engineering
4 days ago
**About ByteDance**
Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok, Helo, and Resso, as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content.
**Why Join Us**
At ByteDance, our people are humble, intelligent, compassionate and creative. We create to inspire - for you, for us, and for millions of users across all of our products. We lead with curiosity and aim for the highest, never shying away from taking calculated risks and embracing ambiguity as it comes. Here, the opportunities are limitless for those who dare to pursue bold ideas that exist just beyond the boundary of possibility. Join us and make impact happen with a career at ByteDance.
**About the Team**
Our infrastructure team operates a large network of POPs around the world hosting edge services, such as traffic acceleration, CDN cache, gaming, etc. We are seeking experienced reliability/performance engineers to maintain stability and to optimize the performance of various edge services and products running on top of our Kubernetes-based platform (PaaS), and to create solutions for ever growing business needs on the edge.
Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed infrastructures. Our SREs are tasked with ensuring the infrastructure services are reliable, fault-tolerant, efficiently scalable and cost-effective. You will have the opportunity to manage a variety of complex systems at scale, including systems that administer hyperscale datacenters and public cloud, global content distribution networks (CDNs) and load balancers that handle Tbps of traffic. You will also have the opportunity to collaborate with various teams to translate business needs into concrete action items, and/or improvements in system design or procedures.
**Responsibilities**
- Build metrics, tools, automations, visualizations and monitors to facilitate the operation and optimization of edge services.
- Build insights through statistical analysis to help drive targeted deployments to expand the coverage of our global infrastructure.
- Analyze, design and implement solutions at the system level to remove bottlenecks and improve edge service performance.
- Work in a fast-paced environment. Participate in technical operations and rotations in response to performance and reliability issues.
**Qualifications**
- Master’s degree or Bachelor's degree with 2+ years of experience in Computer Engineering, Electrical Engineering, Computer Science or related major
- 2+ years experience working with Unix Linux systems from kernel to shell and beyond with experience working with system libraries, file systems, and client-server protocols.
- 2+ years experience in one or more programming languages such as Java, C++, Go, or scripting experience in Shell and Python.
ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.
-
Site Reliability Engineer
2 weeks ago
Singapore PERSOLKELLY Full timeWe have partnered with a renowned global leader in information and communications technology (ICT) infrastructure and smart devices. They are providing full-stack, all-scenario solution for products and services carriers, enterprises, governments, and individual consumers worldwide. Our client is looking for enthusiastic Site Reliability Engineer to...
-
Site Reliability Engineer
3 weeks ago
Singapore PERSOLKELLY Full timeWe have partnered with a renowned global leader in information and communications technology (ICT) infrastructure and smart devices. They are providing full-stack, all-scenario solution for products and services carriers, enterprises, governments, and individual consumers worldwide. Our client is looking for enthusiastic Site Reliability Engineer to...
-
Site Reliability Engineer
6 days ago
Singapore ByteDance Full timeResponsibilities About the Team The Infrastructure Engineering team supports the company's fast growth by building and operating hyperscale datacenters. The team manages the end to end lifecycle of server fleet, providing cloud solutions and various infrastructure services ensuring that they are scalable and are reliable. Responsibilities Build, expand,...
-
Site Reliability Engineer
3 weeks ago
Singapore TRUEWATCH TECHNOLOGY INC PTE. LTD. Full timeSite Reliability Engineer**Roles and Responsibilities**The Site Reliability Engineer plays a crucial role in ensuring the availability, reliability, and performance of our production environment.Monitor system health and take a holistic view to ensure optimal operation. Implement site reliability automation to minimize downtime and reduce costs. Manage...
-
Site Reliability Engineer
6 days ago
Singapore Sea Limited Full timeEngineering and Technology - Infrastructure, Singapore - Entry Level Our DevOps Engineering team plays an important role in developing and maintaining the internal systems and tools for the Infrastructure team. As a Site Reliability Engineer, you are responsible for improving the availability and reliability of our Infrastructure services. - Responsible for...
-
Site Reliability Engineer
4 weeks ago
Singapore TRUEWATCH TECHNOLOGY INC PTE. LTD. Full timeRoles & ResponsibilitiesResponsibility:Run production environment by monitoring availability and taking a holistic view of the system health. Achieve site reliability automation, minimize system downtime, and reduce site reliability cost. Manage risks and resolves issues that affect the release scope, schedule and quality. Suggest architecture...
-
Site Reliability Engineer
4 weeks ago
Singapore TRUEWATCH TECHNOLOGY INC PTE. LTD. Full timeRoles & ResponsibilitiesResponsibility: Run production environment by monitoring availability and taking a holistic view of the system health. Achieve site reliability automation, minimize system downtime, and reduce site reliability cost. Manage risks and resolves issues that affect the release scope, schedule and quality. Suggest architecture...
-
Reliability Engineer
2 days ago
Singapore ONE STOP ENGINEERING PTE. LTD. Full timeTitle**:Reliability Engineer Purpose Statement (2-3 Sentences): - Ensures reliability and maintainability of equipment, processes, utilities, facilities and controls with an objective to constantly improve site production and cost performance. - Develops engineering solutions to repetitive failures and all other problems that adversely affect plant...
-
Senior Site Reliability Engineer
4 weeks ago
Singapore GK CONSULTING PTE. LTD. Full timeSenior Site Reliability EngineerWe are seeking an experienced Senior Site Reliability Engineer to ensure the reliability, availability, and performance of our cloud-based internet services. The ideal candidate will be responsible for owning the reliability, availability, and user experience for assigned cloud services.
-
Site Reliability Engineer
1 week ago
Singapore beBee Careers Full timeSite Reliability EngineerWe are seeking a talented Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and performance of our systems.Job Description:We are looking for an experienced engineer who can develop, support, administer, and consult on the SecDb runtime environment....