Production System Engineer, Infrastructure Engineering

1 week ago


Singapur, Singapore ByteDance Full time

Overview Be among the first applicants to join ByteDance as a Production Systems Engineer in our Infrastructure Engineering team. The team supports the company's growth by building and operating hyperscale datacenters, managing the end-to-end lifecycle of the server fleet, and providing cloud solutions and infrastructure services that are scalable and reliable. Embark on an expedition to ByteDance’s global data centers and contribute to the orchestration, deployment, operation, and eventual retirement of production servers. Responsibilities Operation: Contribute to the stability, efficiency, effectiveness, and scalability of data center and server operations, platforms, and services on a worldwide scale. Lifecycle Enhancement: Participate in and enhance the entire lifecycle of the server fleet—from design and introduction through launch reviews, deployment, operation, and retirement. Automation: Develop and deploy tools to improve automation, reliability, scalability, and operability of servers in the datacenter. Monitoring: Develop and deploy tools to improve availability, latency, and overall health of datacenter infrastructure, servers, and networks. Disaster Recovery: Troubleshoot complex issues in high-pressure environments, perform root-cause analysis, and implement preventive measures and postmortems. Cross-team Collaboration: Work with infrastructure architects, project managers, data center operations engineers, platform developers, supply chain teams, and internal customers to align with business objectives; design and implement solutions for Core IDCs and CDN/Edge. On-call: Participate in on-call support across regions and incident response teams to address production issues. Qualifications Minimum Qualifications: Bachelor's degree in Computer Science, Electronic Engineering, or a related technical field, or equivalent practical experience. At least 3 years of experience in one or more of the following areas: Server Operations: Linux system administration, kernel/driver knowledge, Bash and Python scripting for automation, performance tuning, and security management. Server hardware understanding with experience in planning, delivery, and operation of large-scale data centers in multiple countries. Tooling Adaptation, Deployment, and Maintenance: Customizing operations/maintenance tools for new server hardware, monitoring, provisioning resources, fault management, and hardware upkeep. Experience developing and maintaining monitoring software for 10,000+ servers. Preferred Qualifications: Data Center experience across OS installation, break-fix operations, planning and operations of the infrastructure lifecycle, and design-build or retrofit activities for existing systems. Proficiency in operating and maintaining GPU servers. Full stack software development skills including RESTful APIs (Flask), JavaScript/Node.js, SQL, Redis, and familiarity with Ansible for configuration management and deployment. About Us Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With products including TikTok, Lemon8, CapCut and Pico, ByteDance also operates platforms for the China market such as Toutiao, Douyin, and Xigua. Why Join ByteDance ByteDance values creativity, curiosity, humility, and impact. We strive to create an inclusive environment that reflects the diverse communities we reach, and we are committed to celebrating diverse voices and experiences. Joining ByteDance means being part of a global, innovative team that aims to deliver meaningful breakthroughs for our customers and users. Seniority level Mid-Senior level Employment type Full-time Job function Information Technology Industries Technology, Information and Internet Referrals increase your chances of interviewing at ByteDance. #J-18808-Ljbffr



  • Singapur, Singapore Assurity Trusted Solutions Pte Ltd Full time

    4 weeks ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive features. Assurity Trusted Solutions (ATS) is a wholly owned subsidiary of the Government Technology Agency (GovTech). As a Trusted Partner over the last decade, ATS offers a comprehensive suite of products and services ranging from infrastructure and...


  • Singapur, Singapore Temus Full time

    Join to apply for the Infrastructure Engineer (Systems) role at Temus. The Infrastructure Engineer (Systems) plays a key role in architecting, deploying, and operating mission‑critical systems within a highly secured and regulated environment. As a subject‑matter expert in infrastructure technologies, you will design and deliver end‑to‑end secure...


  • Singapur, Singapore Assurity Trusted Solutions Full time

    Assurity Trusted Solutions (ATS) is a wholly owned subsidiary of the Government Technology Agency (GovTech). As a Trusted Partner over the last decade, ATS offers a comprehensive suite of products and services ranging from infrastructure and operational services, authentication services, governance and assurance services as well as managed processes. In a...


  • Singapur, Singapore Assurity Trusted Solutions Pte Ltd Full time

    Assurity Trusted Solutions (ATS) is a wholly owned subsidiary of the Government Technology Agency (GovTech). As a Trusted Partner over the last decade, ATS offers a comprehensive suite of products and services ranging from infrastructure and operational services, authentication services, governance and assurance services as well as managed processes. In a...


  • Singapur, Singapore IDEMIA Full time

    Senior System Engineer (Network & Infrastructure) – IDEMIA Purpose: This role develops the core technical platform, capabilities, and services that efficiently support business processes and data while optimizing cost in close collaboration with the solution or software architect. Key Missions System Implementation & Optimization Lead the implementation...


  • Singapur, Singapore GENESIS NETWORKS PTE LTD Full time

    Overview An Infrastructure Engineer (IE) is responsible for designing, building, deploying, and maintaining the IT infrastructure using the latest technology for our Customers as well as internal in house systems. An Infrastructure Engineer requires all the IT systems that support businesses of any size to function efficiently. Infrastructure here includes...


  • Singapur, Singapore PLT Engineering Full time

    Get to Know the Team The AI Platform team empowers Grab teams to leverage advanced AI seamlessly and effectively. We're building cutting-edge tools and infrastructure to democratize AI capabilities, accelerate innovation, and enhance Grab's products and services at scale. Get to Know the Role As a Principal Machine Learning Engineer focused on AI...


  • Singapur, Singapore DRW Full time

    Join to apply for the Infrastructure Engineer role at DRW Join to apply for the Infrastructure Engineer role at DRW DRW is a diversified trading firm with over 3 decades of experience bringing sophisticated technology and exceptional people together to operate in markets around the world. We value autonomy and the ability to quickly pivot to capture...


  • Singapur, Singapore Paradigm Full time

    Join to apply for the Infrastructure Engineer role at Paradigm About Paradex Paradex isn’t just another decentralized exchange—it’s a Super App. We’ve combined three powerful financial primitives: Exchange, Asset Management, and Borrow/Lend Markets, all seamlessly composable and accessible through one unified account that uses your entire portfolio...

  • Systems Engineer

    7 days ago


    Singapur, Singapore St Engineering Full time

    Overview Select how often (in days) to receive an alert: Prepare Requirement Tree in accordance with Tender/Contract requirements Prepare System Engineering Management Plan (SEMP), Work Breakdown Structure (WBS), project schedule, System Requirement Compliance Table and System Family Tree Prepare System Design Review (SDR), Preliminary Design Review (PDR),...