
Production System Engineer, Infrastructure Engineering
3 weeks ago
Get AI-powered advice on this job and more exclusive features.
The Infrastructure Engineering team supports the company's fast growth by building and operating hyperscale datacenters. The team manages the end-to-end lifecycle of the server fleet, providing cloud solutions and various infrastructure services to ensure they are scalable and reliable. Embark on an expedition to ByteDance's rapidly expanding domain in the United States, Europe, and Asia. The Infrastructure Engineering team is crafting data centers that shelter hundreds of thousands of servers. As the operator of production systems, you will manage the lifecycle of these servers from initial deployment (OS installation, service provisioning, inventory management) through on-call troubleshooting, to decommissioning and recycling, contributing to ByteDance's technological evolution.
Responsibilities- Operation: Contribute to the stability, efficiency, effectiveness, and scalability of data center and server operations, platform, and services on a worldwide scale.
- Lifecycle Enhancement: Participate in and improve the entire lifecycle of the server fleet—from design/introduction to launch reviews, deployment, operation, and retirement.
- Automation: Develop and deploy tools to enhance automation, reliability, scalability, and operability of servers in the datacenter.
- Monitoring: Develop tools to improve availability, latency, and overall health of datacenter infrastructure, servers, and networks.
- Disaster Recovery: Troubleshoot complex technical issues in a high-pressure environment, perform root-cause analysis, and implement preventive measures; conduct incident response and postmortems.
- Cross-team Collaboration: Work with infrastructure architects, project managers, data center operations engineers, platform developers, supply chain teams, and internal customers; design and implement solutions for Core IDCs and CDN/Edge.
- On-call: Participate in on-call support across regions and incident response teams for production issues.
- Minimum Qualifications:
- Bachelor's degree in Computer Science, Electronic Engineering, a relevant technical field, or equivalent practical experience.
- At least 3 years of experience in areas such as server operations (Linux administration, scripting in Bash and Python), server hardware troubleshooting, and experience in planning, delivering, and operating large-scale data centers in multiple countries.
- Proficiency in customizing and deploying tooling for monitoring, provisioning, fault management, and maintenance of new server hardware.
- Experience developing and maintaining monitoring software for more than 10,000 servers.
- Preferred Qualifications:
- Data Center: Experience with OS installations, break-fix operations, and end-to-end infrastructure lifecycle projects, including new design-build or retrofit activities.
- GPU server operation and maintenance proficiency strongly preferred.
- Full Stack Software Development: Experience with RESTful APIs (Flask), JavaScript/Node.js, SQL (including Redis), and Ansible for configuration management and deployment.
Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With products including TikTok, Lemon8, CapCut, Pico, Toutiao, Douyin, and Xigua, ByteDance connects people to content across global markets.
Why Join ByteDanceInspiring creativity is at the core of ByteDance's mission. Our teams are global and diverse, working together to create value for communities and users. We encourage curiosity, humility, and impact, fostering an "Always Day 1" mindset to achieve meaningful breakthroughs for employees, the company, and users. Join us.
Diversity & InclusionByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. We celebrate diverse voices and aim to reflect the communities we reach.
Details- Seniority level: Mid-Senior level
- Employment type: Full-time
- Job function: Information Technology
- Industries: Technology, Information and Internet
-
Production System Engineer, Infrastructure
2 weeks ago
Singapore Outscal Technologies Full timeAbout the job SummaryBy Outscal ByteDance seeks a Production System Engineer to enhance data center operations, improve system lifecycle, automate processes, and ensure high availability. Strong Linux, automation, and server hardware knowledge are essential. - Responsibilities - About ByteDance Founded in 2012, ByteDance's mission is to inspire creativity...
-
Singapore ByteDance Full timeResponsibilities About ByteDance Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create...
-
Singapore ByteDance Full timeOverview Get AI-powered advice on this job and more exclusive features. The Infrastructure Engineering team supports the company's fast growth by building and operating hyperscale datacenters. The team manages the end-to-end lifecycle of the server fleet, providing cloud solutions and various infrastructure services to ensure they are scalable and...
-
Infrastructure System Engineer
5 days ago
Singapore Ascendion Full timeGet AI-powered advice on this job and more exclusive features. Direct message the job poster from Ascendion Empowering Digital Transformation & Building Future-Ready Teams to Drive Innovation Role: We are seeking a system engineer to join the Day 2 Operations team, responsible for ensuring the stability, resilience, and performance of mission-critical...
-
Singapore Tek Infotree Sdn Bhd Full time**Position: System Engineer (IT infra)** Salary range: SGD$7700-9300 monthly Company background: IT Services and IT Consulting Office location: Singapore Working days: Mon to Fri Employment type: Contract for 1 year renewable basis & possible convert to permanent role **Responsibilities**: - Assist project or team leads in evaluating and recommending...
-
Infrastructure Production Engineer
1 week ago
Central Singapore Sopra Steria I2S Full time**Company**: Sopra Steria is a listed European tech leader specializes in Consulting, Digital Service, and Software. We have 50,000 employees worldwide located in different regions (Europe, North America and Asia), whereby Singapore is the HQ for APAC. EvaGroup Asia Pacific is part of Sopra Steria I2S APAC, in charge of Infrastructure, Cloud and...
-
Infrastructure Engineer
7 days ago
Singapore Assurity Trusted Solutions Pte Ltd Full time4 weeks ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive features. Assurity Trusted Solutions (ATS) is a wholly owned subsidiary of the Government Technology Agency (GovTech). As a Trusted Partner over the last decade, ATS offers a comprehensive suite of products and services ranging from infrastructure and...
-
Infrastructure Engineer
2 weeks ago
Singapore STAFFKING PTE. LTD. Full timeLocation: Central area **Position Overview**: This role is ideal for individuals who are passionate about designing and implementing technology solutions while providing technical support for smooth operations. As an Infrastructure Engineer, you will contribute to infrastructure planning, project execution, and end-user support to ensure seamless technology...
-
IT Infrastructure Engineer
2 weeks ago
Singapore OCTA CONSULTANTS PTE. LTD. Full time**Job Description & Requirements**: - Ensure the smooth operation and security of IT systems - Knowledge and experience in Unix scripting, VMware Virtualization, Cloud and Middleware. - Effective provisioning, installation/configuration, operation, and maintenance of systems hardware and software and related infrastructure - Participates in technical...
-
Infrastructure Systems Engineer
2 weeks ago
Singapore Volt Full timeLocation: - Singapore- Job Type: - Permanent- Salary: - S$4500 - S$6500 per month- Reference: - BBBH10447_1658460842- Contact: - Jeanette Yeo- **Infrastructure Systems Engineer - 12 Months - Singapore** **What's in it for you**: - Tech Refresh, Technology 2.0 - Niche Domain, World-Renowned Gaming Applications - Agile Methodologies **Day-to-Day Task**: -...