System Reliability Expert

2 weeks ago


Singapore BYTEDANCE PTE. LTD. Full time

About Bytedance
Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok, Helo, and Resso, as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content.

Why Join Us
At ByteDance, our people are humble, intelligent, compassionate and creative. We create to inspire - for you, for us, and for millions of users across all of our products. We lead with curiosity and aim for the highest, never shying away from taking calculated risks and embracing ambiguity as it comes. Here, the opportunities are limitless for those who dare to pursue bold ideas that exist just beyond the boundary of possibility. Join us and make impact happen with a career at ByteDance.

About the Team
The Game Technology team is playing a significant role in the whole life cycle of the game. We are responsible for the development, testing, operation, SRE, and quality assurance of the user and operating system, and give strong support for game developing and publishing. Providing comprehensive and systematical solutions, support the stable operation and commercialization of the game.

1. Responsible for the design and implementation of the deployment architecture of the game's overseas game business, and ensuring the stable operation of online services.
2. Daily maintenance of game servers, opening and closing servers, online environment changes, data backup and monitoring and alarm processing, etc.
3. Identify and solve problems related to key service operations, assist in the analysis and optimization of service performance bottlenecks, and be responsible for rapid response and handling of faults in the online environment.
4. Cooperate with domestic teams to continuously improve the design and experience of game operation and maintenance tools, such as publishing changes, monitoring, alarms, logs, traceability, network optimization, etc.
5. Continue to maintain the key SLA indicators of the game, and do a good job in the operation and maintenance support of the game in terms of efficiency, cost, quality and security.

Qualifications:
1. Bachelor's or higher degree in Computer Science, Information Systems or related field.
2. Has cloud computing technology experience from Amazon Web Services, Google Cloud Platform and other suppliers, more than two years of experience in game industry operation and maintenance
3. Practical experience in at least one programming language: Bash, Go, Python.
4. Understand K8S containerized service management, cloud network optimization, ELK, Kafka and other technologies in one or more directions.



  • Singapore beBeeSystem Full time

    Job Title: Reliable Systems Expert We are seeking a skilled Reliable Systems Expert to join our team. About the Role: The Reliable Systems Expert will be responsible for managing the operational work of our services, ensuring they are running smoothly and efficiently. This involves designing and selecting basic runtime environments for game servers...

  • Reliability Expert

    4 days ago


    Singapore beBeeReliability Full time $90,000 - $120,000

    Job Title: Reliability ExpertWe are seeking an experienced reliability expert to lead our reliability studies and analysis. The ideal candidate will have a strong background in mechanical engineering and extensive experience in maintenance and troubleshooting.The successful candidate will be responsible for conducting root cause failure analysis (RCA) and...


  • Singapore beBeeReliability Full time $120,000 - $150,000

    Reliability and Maintainability EngineerWe are seeking a highly skilled Reliability and Maintainability (RAM) expert to join our team. The ideal candidate will have a strong background in RAM, with excellent leadership, communication and teamwork abilities.About the RoleDevelop and implement RAM methodologies to ensure the long-term sustainability of complex...

  • Reliability Expert

    6 days ago


    Singapore beBeeExpert Full time $100,000 - $120,000

    Reliability ExpertWe are seeking a skilled professional to join our team as a Reliability Expert. As a key member of our organization, you will be responsible for ensuring the smooth operation of our systems and applications.Key ResponsibilitiesProvide day-to-day support for Ecommerce Platform Application, FX clients and Stockbrokers using FX.Monitor systems...


  • Singapore beBeeReliability Full time

    Unlock your potential as a Reliability Architect, driving the design and development of scalable backend systems and cloud-based services. We're seeking an experienced engineer to build, maintain and enhance our infrastructure, leveraging expertise in software development, automation, tooling and integration.About UsOur organization thrives on diversity,...

  • Reliability Expert

    4 days ago


    Singapore beBeeReliability Full time $200,000 - $240,000

    About the Role:We are seeking a highly skilled individual to join our team as a Reliability Expert. This role involves ensuring the smooth operation of our systems and developing tools to maintain their resilience and performance.Key Responsibilities:Collaborate with engineers to design and implement scalable systems.Develop and enhance tools to monitor and...


  • Singapore beBeeReliability Full time

    Job Title: System Reliability EngineerWe are seeking an exceptional individual to join our team as a System Reliability Engineer. This role is responsible for designing and implementing cloud architectures, administering cloud-scale production environments, and ensuring service reliability and cost-efficiency.Key Responsibilities:Design and implement cloud...


  • Singapore beBeeSoftwareReliability Full time $100,000 - $200,000

    Job Title: Senior Software Reliability ExpertWe are seeking an experienced and skilled professional to join our team as a Senior Software Reliability Expert. This role involves designing and developing software systems that meet the highest standards of quality, reliability, and performance.Job Description:As a Senior Software Reliability Expert, you will be...


  • Singapore beBeeReliability Full time $80,000 - $120,000

    System Reliability SpecialistEnsuring the Uptime of Complex SystemsThis position is responsible for delivering technical support across various areas including reliability, availability, and maintainability (RAM) to cover the entire life cycle of clients' systems and equipment.The role involves proposing and implementing integrated logistics support (ILS)...


  • Singapore beBeeReliability Full time $80,000 - $120,000

    Job Description">We are seeking a skilled professional to excel as a Reliability Expert. This key role is responsible for ensuring the smooth operation of our systems and applications.Key Responsibilities">Provide day-to-day support for Ecommerce Platform Application, FX clients and Stockbrokers using FX.Monitor systems and perform maintenance tasks as...