Site Reliability Engineer, Traffic Platform
2 weeks ago
**Location**: Singapore **Team**: Technology **Employment Type**: Regular **Job Code**: A111172A **Responsibilities**: About the Team Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed infrastructures. Our SREs are tasked to ensure the traffic services are reliable, fault-tolerant, efficiently scalable and cost-effective. You will have the opportunity to manage a variety of complex systems at scale, including traffic systems that serve hyperscale datacenters and public cloud, global load balancer that handles Tbps of traffic. **Responsibilities**: - Build, expand and operate Bytedance’s global traffic platform, including large-scale systems in public and private clouds, edge data centers. - Build tools, automations, visualizations and monitors to facilitate the operation and optimization of the global traffic platform. - Work in a fast-paced environment. Participate in technical operations and rotations in response to performance and reliability issues. - Help improve the whole lifecycle of infrastructure services from inception and design throughout development, to deployment, user support and refinement. **Qualifications**: Minimum Qualifications - Bachelor or Master's degree in Computer Engineering, Electrical Engineering, Computer Science or related major. - Proven years experience working with Linux systems from kernel to shell and beyond with experience working with system libraries, file systems, and client-server protocols. - At least 3 years experience in one or more programming languages such as Go, Python and Shell script. - Familiar with Cloud and CI/CD framework/Tools, such as GIT, Docker, Kubernetes, etc. Preferred Qualifications - Experience in designing, analyzing and building automation and tools for large scale systems - Experience in building solutions with AWS, Google, Azures and other cloud services. - Experience in networking technologies such TCP/IP, HTTP, DNS, etc. in a carrier-grade environment. - Experience in developing and operating one or more of following systems: Kubernetes, Nginx, ipvs, ELK stack, etc. - Self-driven and capable of coping with ambiguity and moving projects from concept to delivery. - Strong in analytical skills and the ability to solve real world problems in a fast moving environment. Job Information About Us Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok, Lemon8, CapCut and Pico as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content. Why Join ByteDance Inspiring creativity is at the core of ByteDance's mission. Our innovative products are built to help people authentically express themselves, discover and connect - and our global, diverse teams make that possible. Together, we create value for our communities, inspire creativity and enrich life - a mission we work towards every day. As ByteDancers, we strive to do great things with great people. We lead with curiosity, humility, and a desire to make impact in a rapidly growing tech company. By constantly iterating and fostering an "Always Day 1" mindset, we achieve meaningful breakthroughs for ourselves, our Company, and our users. When we create and grow together, the possibilities are limitless. Join us. Diversity & Inclusion ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.
-
Site Reliability Engineer, Traffic Platform
1 week ago
Singapore ByteDance Full timeResponsibilities About ByteDance Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create...
-
Site Reliability Engineers/Platform Engineers
2 weeks ago
Singapore Razer Full timeJoining Razer will place you on a global mission to revolutionize the way the world games. Razer is a place to do great work, offering you the opportunity to make an impact globally while working across a global team located across 5 continents. Razer is also a great place to work, providing you the unique, gamer-centric #LifeAtRazer experience that will put...
-
Singapore Razer Inc. Full timeSite Reliability Engineers/Platform Engineers (Mid/Senior)Joining Razer will place you on a global mission to revolutionize the way the world games. Razer is a place to do great work , offering you the opportunity to make an impact globally while working across a global team located across 5 continents. Razer is also a great place to work, providing you the...
-
Automation Engineer
1 week ago
Singapore Yunex Traffic Full timeOverview Automation Engineer role at Yunex Traffic. Location and Employment Location: Singapore, SG, Type of Employment: Full-time Career Level: Professional Job Family: Engineering Date posted: Sep 1, 2025What we do As a prominent company in the field of intelligent traffic systems, we develop digital solutions to help cities and transportation providers...
-
Site Reliability Engineer
2 weeks ago
Singapore Crystal Equation Corporation Full timeWe are seeking a skilled Site Reliability Engineer (SRE) to join our team. SRE will be responsible for keeping all internal user-facing applications and other production systems running smoothly. This hybrid role involves a combination of both development and operations skills to build and manage systems that are both efficient and reliable. The Enterprise...
-
Singapore NodeFlair Full time**Job Summary**: **Salary** S$7,000 - S$14,000 / Monthly **Job Type** **Seniority** Junior **Years of Experience** At least 1 year **Tech Stacks** HTTP TCP AWS Docker Go Shell Script CI ELK Shell Git Azure Istio Linux Kubernetes Python **About ByteDance** Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of...
-
Site Reliability Engineer
15 hours ago
Singapore Shopee Full timeDepartmentEngineering and Technology- LevelExperienced (Individual Contributor)- LocationSingaporeThe Engineering and Technology team is at the core of the Shopee platform development. The team is made up of a group of passionate engineers from all over the world, striving to build the best systems with the most suitable technologies. Our engineers do not...
-
Singapore Shopify Full timeCompany Description Shopify is the leading omni-channel commerce platform. Merchants use Shopify to design, set up, and manage their stores across multiple sales channels, including mobile, web, social media, marketplaces, brick-and-mortar locations, and pop-up shops. The platform also provides merchants with a powerful back-office and a single view of...
-
Automation Engineer
2 days ago
Singapore Yunex Traffic Full timeUniting what's next in traffic - Yunex TrafficAs a prominent company in the field of intelligent traffic systems, we are at the forefront of the emerging mobility landscape.We harness the power of digital advancements and innovative technologies to help cities, highway authorities, and transportation providers develop new mobility solutions. Our offerings...
-
Site Reliability Engineer
2 weeks ago
North-East Singapore PERSOLKELLY Full timeThe Site Reliability Engineer is responsible for ensuring the reliability, scalability, and efficiency of our systems and infrastructure. This role involves monitoring, troubleshooting, and resolving issues to maintain optimal performance. The engineer will also collaborate with cross-functional teams to automate processes and improve system reliability....