Production System Engineer, Infrastructure Engineering

1 week ago


Singapore ByteDance Full time

Responsibilities
About ByteDance
Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content.
Why Join Us
Creation is the core of ByteDance's purpose. Our products are built to help imaginations thrive. This is doubly true of the teams that make our innovations possible. Together, we inspire creativity and enrich life - a mission we aim towards achieving every day. To us, every challenge, no matter how ambiguous, is an opportunity; to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always. At ByteDance, we create together and grow together. That's how we drive impact-for ourselves, our company, and the users we serve. Join us.
About the Team
The Infrastructure Engineering team supports the company's fast growth by building and operating hyperscale datacenters. The team manages the end to end lifecycle of server fleet, providing cloud solutions and various infrastructure services ensuring that they are scalable and are reliable.
Embark on an exciting expedition to explore the rapidly expanding ByteDance domain in the United States, Europe, and Asia. Here, the Infrastructure Engineering team is crafting monumental data citadels that encircle the planet, sheltering legions of hundreds of thousands of servers. As the maestro of our production systems, you will embark on a captivating odyssey, taming the life cycles of these servers. Your adventure will begin with the orchestration of their initial deployment, navigating the intricate terrain of OS installation, summoning services like a digital magician, and maintaining vigilant watch over our inventory. But, like any epic tale, there will be times of challenge when you become a troubleshooter extraordinaire, mending and restoring with unwavering dedication. Eventually, you'll guide them into the sunset, orchestrating their decommissioning and ensuring their rebirth through recycling, all while contributing to the pulsating rhythm of ByteDance's technological evolution.
**Key Responsibilities**:

- Responsibilities**:

- Operation: As a Production Systems Engineer, your mission is to contribute to enhancing the stability, efficiency, effectiveness, and scalability of our data center and cloud operations, platform, and service on a worldwide scale.
- Lifecycle Improvement: Engage in and improve the whole lifecycle of Infrastructure systems - from system design consulting through to launch reviews, deployment, operation, and refinement.
- Automation: Develop & deploy tools and solutions to improve the automation, reliability, scalability, and operability of services.
- Monitoring: Deliver tools and solutions to improve monitor availability, latency, and overall service, server and Cloud infrastructure and network health.
- Disaster Recovery: Troubleshoot and resolve complex technical issues in a high-pressure, time-sensitive environment. Conduct high-level root-cause analysis for service interruption and establish preventive measures. Practice sustainable incident response and postmortem.
- Cross-team Collaboration: Partner with stakeholders like infrastructure architects, project managers, data center operations engineers, platform developers, supply chain teams, and our internal customers to understand overarching business objectives. You will also have the opportunity to design and implement innovative solutions for our Core IDCs and CDN/Edge and Cloud Services.
- On-call: Participate in our on-call across regions and incident response teams to solve critical problems in production.

**Qualifications**:
Minimum Qualifications
- Education: Bachelor's degree in Computer Science, Electronic Engineering, relevant technical field, or equivalent practical experience.
- Experience: Minimal 3 years of experience in at least one of the areas below:

- Linux System Administration: Proficient in Linux system administration tasks. Have an in-depth understanding of Linux kernels, drivers, and modules. Be capable of writing scripts in Bash and Python to automate routine system operations, thereby enhancing efficiency and reducing manual effort. This includes skills such as system configuration, performance tuning, and security management within the Linux environment.
- Tooling Adaptation, Deployment, and Maintenance: Skilled in adapting operation and maintenance tools to meet specific requirements for new server hardware. Capable of handling the entire lifecycle of software tools, from deployment to ongoing maintenance. This involves tasks related to facilitating the monitoring of server performance, provisioning resources effectively, managing fault handling in a timely manner, and carrying out repairs to ensure the seamless operation of new server hardware.
- Communication: Experience



  • Singapore Outscal Technologies Full time

    About the job SummaryBy Outscal ByteDance seeks a Production System Engineer to enhance data center operations, improve system lifecycle, automate processes, and ensure high availability. Strong Linux, automation, and server hardware knowledge are essential. - Responsibilities - About ByteDance Founded in 2012, ByteDance's mission is to inspire creativity...


  • Singapore ByteDance Full time

    Overview Get AI-powered advice on this job and more exclusive features. The Infrastructure Engineering team supports the company's fast growth by building and operating hyperscale datacenters. The team manages the end-to-end lifecycle of the server fleet, providing cloud solutions and various infrastructure services to ensure they are scalable and reliable....


  • Singapore ByteDance Full time

    Overview Get AI-powered advice on this job and more exclusive features. The Infrastructure Engineering team supports the company's fast growth by building and operating hyperscale datacenters. The team manages the end-to-end lifecycle of the server fleet, providing cloud solutions and various infrastructure services to ensure they are scalable and...


  • Singapore Ascendion Full time

    Get AI-powered advice on this job and more exclusive features. Direct message the job poster from Ascendion Empowering Digital Transformation & Building Future-Ready Teams to Drive Innovation Role: We are seeking a system engineer to join the Day 2 Operations team, responsible for ensuring the stability, resilience, and performance of mission-critical...


  • Singapore Tek Infotree Sdn Bhd Full time

    **Position: System Engineer (IT infra)** Salary range: SGD$7700-9300 monthly Company background: IT Services and IT Consulting Office location: Singapore Working days: Mon to Fri Employment type: Contract for 1 year renewable basis & possible convert to permanent role **Responsibilities**: - Assist project or team leads in evaluating and recommending...


  • Central Singapore Sopra Steria I2S Full time

    **Company**: Sopra Steria is a listed European tech leader specializes in Consulting, Digital Service, and Software. We have 50,000 employees worldwide located in different regions (Europe, North America and Asia), whereby Singapore is the HQ for APAC. EvaGroup Asia Pacific is part of Sopra Steria I2S APAC, in charge of Infrastructure, Cloud and...


  • Singapore Assurity Trusted Solutions Pte Ltd Full time

    4 weeks ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive features. Assurity Trusted Solutions (ATS) is a wholly owned subsidiary of the Government Technology Agency (GovTech). As a Trusted Partner over the last decade, ATS offers a comprehensive suite of products and services ranging from infrastructure and...


  • Singapore Volt Full time

    Location: - Singapore- Job Type: - Permanent- Salary: - S$4500 - S$6500 per month- Reference: - BBBH10447_1658460842- Contact: - Jeanette Yeo- **Infrastructure Systems Engineer - 12 Months - Singapore** **What's in it for you**: - Tech Refresh, Technology 2.0 - Niche Domain, World-Renowned Gaming Applications - Agile Methodologies **Day-to-Day Task**: -...

  • Eiac Engineer

    1 week ago


    Singapore Keppel Infrastructure Full time

    Conduct engineering work process for instrumentation, control, automation and information system of projects - Perform design work for I&C, automation system, including system integration, logic control, P&ID, design calculations, functional description, design technical specification, estimation, IT system integration and related documents in compliance...


  • Singapore MOHAN MANAGEMENT CONSULTANTS PTE LTD Full time

    Roles & Responsibilities On behalf of our client we are looking for Infrastructure System Engineer to work one on one basis on various tasks. Role: Join the Infrastructure team and play a key role in keeping mission-critical systems secure, reliable, and scalable. You'll work on a mix of on-premises and cloud technologies, support automation...