Production System Engineer, Infrastructure Engineering

2 weeks ago


Singapore ByteDance Full time

Overview Be among the first applicants to join ByteDance as a Production Systems Engineer in our Infrastructure Engineering team. The team supports the company's growth by building and operating hyperscale datacenters, managing the end-to-end lifecycle of the server fleet, and providing cloud solutions and infrastructure services that are scalable and reliable. Embark on an expedition to ByteDance's global data centers and contribute to the orchestration, deployment, operation, and eventual retirement of production servers. Responsibilities Operation: Contribute to the stability, efficiency, effectiveness, and scalability of data center and server operations, platforms, and services on a worldwide scale. Lifecycle Enhancement: Participate in and enhance the entire lifecycle of the server fleet—from design and introduction through launch reviews, deployment, operation, and retirement. Automation: Develop and deploy tools to improve automation, reliability, scalability, and operability of servers in the datacenter. Monitoring: Develop and deploy tools to improve availability, latency, and overall health of datacenter infrastructure, servers, and networks. Disaster Recovery: Troubleshoot complex issues in high-pressure environments, perform root-cause analysis, and implement preventive measures and postmortems. Cross-team Collaboration: Work with infrastructure architects, project managers, data center operations engineers, platform developers, supply chain teams, and internal customers to align with business objectives; design and implement solutions for Core IDCs and CDN/Edge. On-call: Participate in on-call support across regions and incident response teams to address production issues. Qualifications Minimum Qualifications: Bachelor's degree in Computer Science, Electronic Engineering, or a related technical field, or equivalent practical experience. At least 3 years of experience in one or more of the following areas: Server Operations: Linux system administration, kernel/driver knowledge, Bash and Python scripting for automation, performance tuning, and security management. Server hardware understanding with experience in planning, delivery, and operation of large-scale data centers in multiple countries. Tooling Adaptation, Deployment, and Maintenance: Customizing operations/maintenance tools for new server hardware, monitoring, provisioning resources, fault management, and hardware upkeep. Experience developing and maintaining monitoring software for 10,000+ servers. Preferred Qualifications: Data Center experience across OS installation, break-fix operations, planning and operations of the infrastructure lifecycle, and design-build or retrofit activities for existing systems. Proficiency in operating and maintaining GPU servers. Full stack software development skills including RESTful APIs (Flask), JavaScript/Node.js, SQL, Redis, and familiarity with Ansible for configuration management and deployment. About Us Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With products including TikTok, Lemon8, CapCut and Pico, ByteDance also operates platforms for the China market such as Toutiao, Douyin, and Xigua. Why Join ByteDance ByteDance values creativity, curiosity, humility, and impact. We strive to create an inclusive environment that reflects the diverse communities we reach, and we are committed to celebrating diverse voices and experiences. Joining ByteDance means being part of a global, innovative team that aims to deliver meaningful breakthroughs for our customers and users. Seniority level Mid-Senior level Employment type Full-time Job function Information Technology Industries Technology, Information and Internet Referrals increase your chances of interviewing at ByteDance. #J-18808-Ljbffr


  • System Engineer

    12 hours ago


    Singapore Enterprise Infrastructure & Services Full time $80,000 - $120,000 per year

    We are looking for a service-oriented professional to join our 24x7 Enterprise Infrastructure & Services team.  In this role, you will ensure that our enterprise IT services are reliable, responsive, and user-friendly, supporting the day-to-day needs of PSA's workforce.  You will be responsible for supporting Windows Servers, Azure, Active Directory,...


  • Singapore ATT System Full time

    **Role and Responsibilities** - Planning, implementing, managing, monitoring, and upgrading technical and organization measures for clients’ ICT infrastructure. - Troubleshooting ICT infrastructure problems. - Integration and implementation of infrastructure products such as network, storage, and computing devices. - Follow project implementation lifecycle...


  • Singapore Temus Full time

    Join to apply for the Infrastructure Engineer (Systems) role at Temus. The Infrastructure Engineer (Systems) plays a key role in architecting, deploying, and operating mission‐critical systems within a highly secured and regulated environment. As a subject‐matter expert in infrastructure technologies, you will design and deliver end‐to‐end secure...


  • Singapore Atlas Full time

    Atlas is building the operating system for restaurants. Atlas is the easiest way to start, run, and grow any restaurant online and offline. Our products power hundreds of restaurants and process hundreds of millions in GMV each year. We are the team that previously built Grain, a venture-backed online restaurant, to millions in revenue. Our team and...


  • Singapore STAFFKING PTE. LTD. Full time

    Location: Central area **Position Overview**: This role is ideal for individuals who are passionate about designing and implementing technology solutions while providing technical support for smooth operations. As an Infrastructure Engineer, you will contribute to infrastructure planning, project execution, and end-user support to ensure seamless technology...


  • Singapore OCTA CONSULTANTS PTE. LTD. Full time

    **Job Description & Requirements**: - Ensure the smooth operation and security of IT systems - Knowledge and experience in Unix scripting, VMware Virtualization, Cloud and Middleware. - Effective provisioning, installation/configuration, operation, and maintenance of systems hardware and software and related infrastructure - Participates in technical...


  • Singapore SEDHA CONSULTING PTE. LTD. Full time

    **IT INFRASTRUCTURE ENGINEER (System Administration)**: The activities required to be performed shall include the following: a) Effective provisioning, installation/configuration, operation, and maintenance of systems hardware and software and related infrastructure; b) Participates in technical research and development to enable continuing innovation within...


  • Singapore Assurity Trusted Solutions Full time

    Assurity Trusted Solutions (ATS) is a wholly owned subsidiary of the Government Technology Agency (GovTech). As a Trusted Partner over the last decade, ATS offers a comprehensive suite of products and services ranging from infrastructure and operational services, authentication services, governance and assurance services as well as managed processes. In a...


  • Singapore OCTA METIER PTE. LTD. Full time

    **Job Description & Requirements: - Knowledge and experience in Unix scripting, VMware Virtualization, Cloud and Middleware. - Effective provisioning, installation/configuration, operation, and maintenance of systems hardware and software and related infrastructure - Participates in technical research and development to enable continuing innovation within...


  • Singapore Atlas Full time

    Senior Software Engineer, Product Infrastructure Atlas is building the operating system for restaurants — the easiest way to start, run, and grow any restaurant, both online and offline. The team at Atlas previously built Grain, a venture‐backed online restaurant that grew to millions in revenue. Atlas helps restaurants power online storefronts, POS...