System Reliability Specialist

4 days ago


Singapore TIKTOK PTE. LTD. Full time
Company Overview
TikTok PTE. LTD. is a leading short-form mobile video platform that inspires creativity and brings joy to users worldwide.

Job Description
We are seeking an experienced System Reliability Specialist to join our Recommendation Architecture Team. As a key member of the team, you will be responsible for building and optimizing the architecture for our recommendation system to provide a stable and best experience for TikTok users.

The successful candidate will have the opportunity to sharpen their expertise in coding, performance analysis, large-scale system operation, and hardware/capacity decision-making. You will work closely with software engineers to design and implement DevOps solutions to improve the efficiency of the R&D process.

Responsibilities
- Ensure the reliability and operation optimization for large-scale clusters of TikTok Recommendation System.
- Continuous integration and delivery of core services, optimizing the efficiency and automation of operation, and improving service stability and R&D efficiency.
- Collaborate with software engineer to design and implement DevOps solutions to Improve the efficiency of the entire R&D process.
- Work with computer hardware engineers to integrate hardware and software systems and develop specifications and performance requirements.

Qualifications
- Bachelor's degree or above in computer science, software engineering, or a related field
- Operation experience of large-scale systems, familiar with system operation skills on Linux and network.
- Good programming experience with at least one of the following languages: Shell/Python/Perl/Go/C++. Expertise in analyzing, and troubleshooting large-scale distributed systems.
- At least 3 years of relevant experience.

About Us
TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. We celebrate our diverse voices and create an environment that reflects the many communities we reach. We aim to inspire creativity and bring joy to our users.

  • Singapore TSTAR RECRUIT PTE. LTD. Full time

    TSTAR RECRUIT PTE. LTD. is seeking a talented System Reliability Specialist to join our team.Job ResponsibilitiesThe ideal candidate will have a strong understanding of x86 PC architecture and experience in the PC industry. They will be responsible for analyzing and resolving system performance issues, collaborating with engineering teams to identify root...


  • Singapore NATIONAL UNIVERSITY HEALTH SYSTEM PTE. LTD. Full time

    **Job Description:**National University Health System Pte. Ltd. is seeking a highly skilled individual to join our team as a Reliability Matters Risk Specialist.The incumbent will play a pivotal role in supporting and enhancing risk management, governance, and data protection initiatives. This includes identifying gaps and providing practical recommendations...

  • Systems Specialist

    2 weeks ago


    Singapore MARTEQ ENERGY PTE. LTD. Full time

    Systems Specialist Duties - Develop, configure, install, and maintain computer systems - Identify and diagnose network and system problems and provide solutions - Manage data backups and disaster recovery operations - Monitor system performance and security - Research and recommend system enhancements to improve performance and reliability - Manage system...

  • Systems Specialist

    2 weeks ago


    Singapore JAIDAN PTE. LTD. Full time

    Systems Specialist Duties - Develop, configure, install, and maintain computer systems - Identify and diagnose network and system problems and provide solutions - Manage data backups and disaster recovery operations - Monitor system performance and security - Research and recommend system enhancements to improve performance and reliability - Manage system...


  • Singapore Johnson Controls Full time

    About the RoleWe are looking for a Product Reliability Specialist to join our team at Johnson Controls International plc. In this role, you will be responsible for providing technical support for our BE products and driving product reliability through the Continuous Improvement Process.Main ResponsibilitiesProvide technical support for BE products to Field...


  • Singapore ADVANCE MARINE & LOGISTICS PTE. LTD. Full time

    Job OverviewAs an Equipment Reliability Specialist at ADVANCE MARINE & LOGISTICS PTE. LTD., you will play a critical role in ensuring the reliability and efficiency of our refrigerated containers. Your main responsibilities include performing troubleshooting and repair works for our refrigerated container fleet, as well as coordinating with our Control...


  • Singapore People Profilers Full time

    Job Position:We are hiring a skilled Asset Reliability Specialist to join our team at People Profilers Pte Ltd. As a key member of our asset reliability team, you will be responsible for developing and implementing strategies to improve equipment reliability and reduce downtime.Main Responsibilities:Analyze equipment failure data to identify trends and...


  • Singapore BARCLAYS EXECUTION SERVICES LIMITED Singapore Branch Full time

    We are seeking a talented individual to fill the role of Site Reliability Specialist - Compute Infrastructure. In this position, you will be responsible for ensuring the reliability, availability, and scalability of the bank's distributed super-computer.About the Role:As a Site Reliability Specialist, you will work closely with the Engineering and...


  • Singapore BYTEDANCE PTE. LTD. Full time

    About UsAt BYTEDANCE PTE. LTD., we're passionate about inspiring creativity and enriching life through our suite of innovative products, including TikTok and others.The RoleWe're seeking a highly skilled Site Reliability Engineer to join our Privacy and Security (PnS) team. As a key member of our SRE team, you'll be responsible for planning, building, and...


  • Singapore DBS Bank Limited Full time

    OverviewDigital transformation is driving significant growth and innovation for DBS Bank Limited. As a leading financial institution, we are committed to delivering exceptional customer experiences through technology. The role of Cloud Platform Reliability Specialist plays a vital part in ensuring the reliability and efficiency of our cloud-based...


  • Singapore METACOMP PTE. LTD. Full time

    Job OverviewMETACOMP PTE. LTD. is a leading technology company that provides innovative solutions to businesses worldwide. We are seeking a highly skilled Reliability Systems Engineer to join our team and ensure the robustness and reliability of our API services.The successful candidate will have a strong background in software engineering, with expertise in...


  • Singapore ST Engineering Full time

    Company OverviewWe are a leading global engineering company that provides innovative solutions to meet the evolving needs of our customers. As a System Reliability Engineer, you will play a key role in ensuring the effectiveness of our products and services.Job DescriptionThe primary responsibility of this role is to perform Reliability, Maintainability, and...


  • Singapore Advanced Micro Devices Full time

    About the RoleAs a Reliability Engineering Specialist at Advanced Micro Devices, you will be responsible for leading advanced thermal/mechanical reliability teams. Your primary focus will be on architecting, designing, and developing comprehensive quality and reliability assessments for AMD's cutting-edge processor and accelerator products. You will work...


  • Singapore Takeda Pharmaceutical Full time

    **Job Title: Reliability Specialist I** **Location: Woodlands, Singapore** **About the role**: **How you will contribute**: - Accountable and Responsible for Change Control as Change owner, reliability OOT quality deviation, Manage calibration strategy, assessment, and PM Optimization - Accountable and Responsible for Quality deviation which related to...


  • Singapore Advanced Micro Devices Full time

    WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences – the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our...


  • Singapore BARCLAYS EXECUTION SERVICES LIMITED Singapore Branch Full time

    Job DescriptionAs a Compute Grid Site Reliability Engineer - AVP at Barclays Execution Services Limited Singapore Branch, you will play a pivotal role in building and maintaining the bank's distributed super-computer which runs its compute-intensive workloads. This system harnesses CPU capacity sourced from on-prem and public cloud platforms.**Key...


  • Singapore METACOMP PTE. LTD. Full time

    We are looking for a Software Reliability Specialist to join our team at Metacomp PTE. LTD. and contribute to the success of our API services by ensuring their robustness and reliability.Job DescriptionFunctionality Testing: Design and execute tests to validate that APIs operate as expected and meet business needs.Performance Optimization: Conduct...


  • Singapore TIKTOK PTE. LTD. Full time

    At TikTok PTE. LTD., we believe that creativity and joy can be found in every aspect of our lives. As a System Stability Assurance Lead, you will play a critical role in ensuring that our users have a seamless experience on our platform.About the RoleThis position requires a strong understanding of high availability systems, software reliability, and system...


  • Singapore LIFE TECHNOLOGIES HOLDINGS PTE. LTD. Full time

    About UsLIFE TECHNOLOGIES HOLDINGS PTE. LTD. is a world leader in serving science, with a mission to enable our customers to make the world healthier, cleaner, and safer.Job DescriptionWe are looking for a Reliable Algorithm Specialist to join our team. The successful candidate will have experience in developing testing frameworks and pipelines to automate...


  • Singapore Goldman Sachs Bank AG Full time

    About the Position:We are seeking an experienced Systems Engineering Team Lead to join our team and help us lead the development and maintenance of our Managed File Transfer (MFT) platform, SFX.The successful candidate will have a deep understanding of operating systems, network fundamentals, and experience in leading cross-functional teams to manage and...