Site Reliability Engineer, Traffic Platform
1 week ago
Location:
Singapore
Team:
Technology
Employment Type:
Regular
Job Code:
A111172A
Responsibilities
About the Team
Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed infrastructures. Our SREs are tasked to ensure the traffic services are reliable, fault-tolerant, efficiently scalable and cost-effective. You will have the opportunity to manage a variety of complex systems at scale, including traffic systems that serve hyperscale datacenters and public cloud, global load balancer that handles Tbps of traffic.
Responsibilities
- Build, expand and operate Bytedance's global traffic platform, including large-scale systems in public and private clouds, edge data centers.
- Build tools, automations, visualizations and monitors to facilitate the operation and optimization of the global traffic platform.
- Work in a fast-paced environment. Participate in technical operations and rotations in response to performance and reliability issues.
- Help improve the whole lifecycle of infrastructure services from inception and design throughout development, to deployment, user support and refinement.
Qualifications
Minimum Qualifications
- Bachelor or Master's degree in Computer Engineering, Electrical Engineering, Computer Science or related major.
- Proven years experience working with Linux systems from kernel to shell and beyond with experience working with system libraries, file systems, and client-server protocols.
- At least 3 years experience in one or more programming languages such as Go, Python and Shell script.
- Familiar with Cloud and CI/CD framework/Tools, such as GIT, Docker, Kubernetes, etc.
Preferred Qualifications
- Experience in designing, analyzing and building automation and tools for large scale systems
- Experience in building solutions with AWS, Google, Azures and other cloud services.
- Experience in networking technologies such TCP/IP, HTTP, DNS, etc. in a carrier-grade environment.
- Experience in developing and operating one or more of following systems: Kubernetes, Nginx, ipvs, ELK stack, etc.
- Self-driven and capable of coping with ambiguity and moving projects from concept to delivery.
- Strong in analytical skills and the ability to solve real world problems in a fast moving environment.
Job Information
About Us
Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok, Lemon8, CapCut and Pico as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content.
Why Join ByteDance
Inspiring creativity is at the core of ByteDance's mission. Our innovative products are built to help people authentically express themselves, discover and connect – and our global, diverse teams make that possible. Together, we create value for our communities, inspire creativity and enrich life - a mission we work towards every day.
As ByteDancers, we strive to do great things with great people. We lead with curiosity, humility, and a desire to make impact in a rapidly growing tech company. By constantly iterating and fostering an "Always Day 1" mindset, we achieve meaningful breakthroughs for ourselves, our Company, and our users. When we create and grow together, the possibilities are limitless. Join us.
Diversity & Inclusion
ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.
-
Site Reliability Engineer, Traffic Platform
1 week ago
Singapore ByteDance Full timeSite Reliability Engineer, Traffic Platform - Traffic SRE Location: Team: Technology Employment Type: Regular Job Code: A Share this listing: Responsibilities The Traffic Infrastructure Global Engineering (TIGE)-Traffic Platform team at ByteDance builds and operates multi-cloud based large scale network services around the world that we use to accelerate and...
-
Site Reliability Engineer, Traffic Platform
7 days ago
Singapore ByteDance Full timeSite Reliability Engineer, Traffic Platform - Traffic SRE Location: Team: Technology Employment Type: Regular Job Code: A Share this listing: Responsibilities The Traffic Infrastructure Global Engineering (TIGE)-Traffic Platform team at ByteDance builds and operates multi-cloud based large scale network services around the world that we use to accelerate and...
-
Site Reliability Engineer, Traffic Platform
1 week ago
Singapore ByteDance Full time**Site Reliability Engineer, Traffic Platform - Traffic SRE** - Singapore Regular - R&D Job ID: A136692 **Responsibilities** **Qualifications** Minimum Qualifications - Experience in developing network systems in Rust, C, C++, and/or Go, developing skills in Linux environment. - Bachelor's degree in Computer Science, Electrical Engineering, Computer...
-
Cloud Platform Site Reliability Engineer
2 weeks ago
Singapore Barings Full timeOverview Cloud Platform Site Reliability Engineer – Barings. We are seeking a highly motivated and skilled professional to design, implement, and maintain Cloud infrastructure solutions for enterprise-level organizations. The role combines cloud engineering and operations with a focus on reliability, performance, monitoring, security, and cloud platform...
-
Cloud Platform Site Reliability Engineer
5 days ago
Singapore Barings LLC Full timeCloud Platform Site Reliability Engineer page is loaded## Cloud Platform Site Reliability Engineerlocations: Hong Kong: SG - SINGAPORE - 1 WALLICH STtime type: Full timeposted on: Posted 30+ Days Agojob requisition id: JR\_ At Barings, we are as invested in our associates as we are in our clients. We recognize those who work diligently for us and reward them...
-
Cloud Platform Site Reliability Engineer
7 days ago
Singapore Barings LLC Full timeCloud Platform Site Reliability Engineer page is loaded## Cloud Platform Site Reliability Engineerlocations: Hong Kong: SG - SINGAPORE - 1 WALLICH STtime type: Full timeposted on: Posted 30+ Days Agojob requisition id: JR\_ At Barings, we are as invested in our associates as we are in our clients. We recognize those who work diligently for us and...
-
Site Reliability Engineer Graduate
1 week ago
Singapore ByteDance Full timeSite Reliability Engineer Graduate (Traffic Platform) - 2026 Start (BS/MS) Employment Type: Regular Job Code: A A Successfully candidates must be able to commit to an onboarding date by end of year 2026. Please state your availability and graduation date clearly in your resume. Responsibilities Design and develop features of traffic software (DNS Server, L4...
-
Site Reliability Engineer Graduate
7 days ago
Singapore ByteDance Full timeSite Reliability Engineer Graduate (Traffic Platform) - 2026 Start (BS/MS)Employment Type: Regular Job Code: A A Successfully candidates must be able to commit to an onboarding date by end of year 2026. Please state your availability and graduation date clearly in your resume. Responsibilities Design and develop features of traffic software (DNS Server, L4...
-
Site Reliability Engineer
1 week ago
Singapore ByteDance Full time**Site Reliability Engineer (Traffic) - Infrastructure Engineering** - Singapore Regular - R&D Job ID: A82206 **Responsibilities** **Qualifications** Minimum Qualifications - Bachelor’s degree in any of these faculties: Computer Science, Information Technology, Programming & Systems Analysis, Science (Computer Studies). - Minimum 5 years work experience. -...
-
Site Reliability Engineer
7 days ago
Singapore Second Talent Full timeInfrastructure Platform Development Design, build, and enhance infrastructure operation platforms Develop and maintain systems for infrastructure management, CI/CD pipelines, monitoring/alerting, and centralized logging Drive platform standardization and automation initiatives High Availability & Reliability Ensure maximum uptime for production services...