Site Reliability Engineer

1 week ago


Singapore NUVERSE PTE. LTD. Full time

**About Bytedance**

Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok, Helo, and Resso, as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content.

**Why Join Us**

At ByteDance, our people are humble, intelligent, compassionate and creative. We create to inspire - for you, for us, and for millions of users across all of our products. We lead with curiosity and aim for the highest, never shying away from taking calculated risks and embracing ambiguity as it comes. Here, the opportunities are limitless for those who dare to pursue bold ideas that exist just beyond the boundary of possibility. Join us and make impact happen with a career at ByteDance.

**About the Team**

The Game Technology team is playing a significant role in the whole life cycle of the game. We are responsible for the development, testing, operation, SRE, and quality assurance of the user and operating system, and give strong support for game developing and publishing. Providing comprehensive and systematical solutions, support the stable operation and commercialisation of the game.

1.Responsible for the design and deployment of overseas game business architecture, ensuring the stable operation of online services.

2. Deploy, maintain, operate, and manage game-related servers on a daily basis, including environmental changes, data backups, and alert handling.

3.Identify and solve problems related to key service operations, assist in analysing and optimizing service performance bottlenecks, and be responsible for rapid response and fault handling for the live environment.

4.Collaborate closely with the platform tool development team to continuously optimize the design and user experience of game operations tools, such as release changes, monitoring, alarms, logs, performance tracking, network optimization, etc. 5.Continuously maintain key SLA indicators for games, supporting game operation work with a focus on efficiency, cost, quality, and security.

**Qualifications**

1.Bachelor's degree or higher in Computer Science, Information Engineering, or a related field.

2.Familiar with cloud computing technology, network protocols, and Linux system maintenance.

3.Strong knowledge of game operations and maintenance tools and technologies, such as CICD, monitoring tools, alarms, logs, performance optimization, and security.

4.Strong problem-solving skills and attention to detail.

5. Good communication and collaboration skills, able to work effectively in a team environment.

ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.



  • Singapore IDEMIA Full time

    Join to apply for the Site Reliability Engineer role at IDEMIA Join to apply for the Site Reliability Engineer role at IDEMIA Get AI-powered advice on this job and more exclusive features. PurposeThis role plays a critical part in ensuring reliability, scalability, and performance of our systems and services. You will work closely with development and...


  • Singapore TRUEWATCH TECHNOLOGY INC PTE. LTD. Full time

    **Responsibility**: - Run production environment by monitoring availability and taking a holistic view of the system health. - Achieve site reliability automation, minimize system downtime, and reduce site reliability cost. - Manage risks and resolves issues that affect the release scope, schedule and quality. - Suggest architecture improvements, push for...


  • Singapore JJ Consulting Services Full time

    Our Client is a fast growing company in Singapore, who is seeking to recruit a Site Reliability Engineer. **Site Reliability Engineer** **Key Roles & Responsibilities** - Providing ancillary support of Enterprise-Grade Products and solutions at customer's sites - Ironing out deployment issues or challenges that our customers may face - Responsible for...


  • Singapore RigNet Full time

    About us One team. Global challenges. Infinite opportunities. At Viasat, we’re on a mission to deliver connections with the capacity to change the world. For more than 35 years, Viasat has helped shape how consumers, businesses, governments and militaries around the globe communicate. We’re looking for people who think big, act fearlessly, and create an...


  • Singapore ABAXX SINGAPORE PTE. LTD. Full time

    Site Reliability Engineer - Networking We are seeking competent candidate joining our Infrastructure Team for the mission building and operating MAS regulated marketplace and clearing house. This role is ideal for someone with a strong foundation in AWS services, infrastructure as code, and cloud security, who is passionate about building scalable, secure,...


  • Singapore Point72 Full time

    Join to apply for the Site Reliability Engineer role at Point72 About the role As part of Point72’s Technology Team, you will focus on developing and maintaining complex, distributed, real-time systems that support our Global Macro business. Your responsibilities will include optimizing operations through automation, building foundational SRE components,...


  • Singapore Point72 Full time

    Join to apply for the Site Reliability Engineer role at Point72About the role As part of Point72’s Technology Team, you will focus on developing and maintaining complex, distributed, real-time systems that support our Global Macro business. Your responsibilities will include optimizing operations through automation, building foundational SRE components,...


  • Singapore People Profilers Full time

    Job Description: **Responsibilities**: - Support services before they go live through activities such as system design consulting and launch reviews. - Develop and maintain tools, re-designing capacity planning infrastructure for greater scalability. - Troubleshooting, diagnosing and fixing software issues. - Suggesting architecture improvements, pushing...


  • Singapore DT One Full time

    Site Reliability Engineer role at DT One Keeping more people, more connected, more often DT One was founded with the aim to provide mobile carriers with the infrastructure and services they need to help migrant workers stay in touch with their family and friends back home. Today, we operate a leading global network for mobile top-up solutions, innovative...


  • Singapore Point72 Full time

    Join to apply for the Site Reliability Engineer role at Point72 About the role As part of Point72's Technology Team, you will focus on developing and maintaining complex, distributed, real-time systems that support our Global Macro business. Your responsibilities will include optimizing operations through automation, building foundational SRE components,...