System Machine Reliability Engineer

4 months ago


Singapur, Singapore Shopee Full time
System Machine Reliability Engineer - Engineering Infra DepartmentEngineering and TechnologyLevelExperienced (Individual Contributor)LocationSingapore

The Engineering and Technology team is at the core of the Shopee platform development. The team is made up of a group of passionate engineers from all over the world, striving to build the best systems with the most suitable technologies. Our engineers do not merely solve problems at hand; We build foundations for a long-lasting future. We don't limit ourselves on what we can or can't do; we take matters into our own hands even if it means drilling down to the bottom layer of the computing platform. Shopee's hyper-growing business scale has transformed most "innocent" problems into huge technical challenges, and there is no better place to experience it first-hand if you love technologies as much as we do.

About the Team:

The mission of the Shopee Tech Ops MRE (Machine Reliability Engineering) team is to ensure efficient and sustainable operation of the Shopee network and hardware level 24x7, building and maintaining massive hardware clusters for SRE and capacity, in terms of capacity, cost and hardware performance. The team provides sustainable hardware resources and stable network support services. MRE needs to communicate with the data center team to design and optimise network architecture; provide reasonable hardware configuration through hardware testing and selection according to business requirements; customise stable and efficient OS; optimise traditional operation through engineering and service means; and build a complete hardware monitoring system to improve the efficiency of fault handling.

Job Description: Responsible for the maintenance of OS and server. Responsible for the system service such as NTP/SMTP/Ansible/Saltstack. Responsible for the maintenance of CI/CD pipeline. Provide efficient and effective OS/Server solutions according to business needs. Requirements: Bachelors or higher degree in Computer Science, Engineering, Information Systems or related fields. Proficient in Linux Operating system. Familiar with X86 hardware architecture,including CPU,GPU,SSD,PCIE. Skilled use of a variety of system management tools, with experience in performance benchmark, familiar with TCP/IP and basic network concept. Large system management experience in an Internet company is preferred.

Skills below are optional but preferable:

RHCE/RHCA certification.  Experience with Ansible/Saltstack. Experience with SMTP/PoP3/IMAP/NTP. Experience with development of CMDB.

  • Singapur, Singapore Shopee Full time

    Position Overview:Machine Reliability Engineer - Engineering InfraThe Engineering and Technology division is integral to the development of the Shopee platform. Comprising a diverse group of dedicated engineers from various backgrounds, we aim to create optimal systems utilizing the most appropriate technologies. Our engineers not only address immediate...


  • Singapur, Singapore Shopee Full time

    Position Title: Machine Reliability Engineer - Engineering InfraDepartment: Engineering and TechnologyLevel: Entry LevelLocation: SingaporeThe Engineering and Technology division is integral to the development of the Shopee platform. Comprising a diverse group of dedicated engineers from around the globe, this team is committed to constructing the most...


  • Singapur, Singapore Shopee Full time

    About the RoleWe are seeking a highly skilled Machine Reliability Engineer to join our Engineering and Technology team at Shopee. As a key member of our team, you will be responsible for ensuring the efficient and sustainable operation of our network and hardware infrastructure.Key ResponsibilitiesMaintain and optimize server and OS configurations to ensure...


  • Singapur, Singapore Shopee Full time

    About the RoleWe are seeking a highly skilled Machine Reliability Engineer to join our Engineering and Technology team at Shopee. As a key member of our team, you will be responsible for ensuring the efficient and sustainable operation of our network and hardware infrastructure.Key ResponsibilitiesMaintain and optimize server and OS configurations to ensure...


  • Singapur, Singapore Shopee Full time

    Machine Reliability Engineer - Engineering Infra (Campus Recruitment 2024) DepartmentEngineering and TechnologyLevelEntry LevelLocationSingapore The Engineering and Technology team is at the core of the Shopee platform development. The team is made up of a group of passionate engineers from all over the world, striving to build the best systems with the most...


  • Singapur, Singapore Shopee Full time

    Position Overview:Machine Reliability Engineer - Engineering InfraThe Engineering and Technology division is integral to the development of the Shopee platform. This team comprises dedicated engineers from diverse backgrounds, all committed to constructing robust systems using the most appropriate technologies. Our engineers do not just address immediate...


  • Singapur, Singapore RiceBowl Full time

    Job Summary:RiceBowl is seeking a highly skilled Machine Learning Engineer to join our team. As a key member of our data science team, you will be responsible for designing, developing, and deploying machine learning models that drive business growth and improve customer experiences.Key Responsibilities:Model Development: Design and develop machine learning...


  • Singapur, Singapore Squarepoint Capital Full time

    Squarepoint Capital is a leading global investment management firm focused on systematic and quantitative strategies within financial markets. We are committed to delivering high-quality, uncorrelated returns for our clients through meticulous scientific research. Our proficiency in trading, technology, and operations empowers us to create and maintain...


  • Singapur, Singapore TE Connectivity Full time

    About the RoleWe are seeking a highly skilled Senior Machine Learning Operations Engineer to join our team at TE Connectivity. As a key member of our organization, you will play a critical role in defining, building, and operating machine learning workloads in our cloud platform.Key ResponsibilitiesArchitectural Leadership: Provide thought leadership around...

  • (Systems Engineer)

    4 months ago


    Singapur, Singapore TransitLink Full time

    Responsibilities:Technical specialist for Ticket Service Provider, providing support to O&M Team in ensuring Ticketing Machines maintain a high level of availability, reliability and performance.Manage, monitor and track asset performance, life cycle of new and existing systems and infrastructure. A Senior Systems Engineer will oversee more systems or...


  • Singapur, Singapore RiceBowl Full time

    Job Summary:RiceBowl is seeking a highly skilled Machine Learning Engineer to join our team. As a key member of our data science team, you will be responsible for designing, developing, and deploying machine learning models that drive business growth and improve customer experiences.Key Responsibilities:Model Development: Design and develop machine learning...


  • Singapur, Singapore IHiS Full time

    Reliability Engineering ManagerThe Reliability Engineering Manager will play a key role in supporting the reliability principal with senior management in strategy discussions for application and system improvement. This individual will also manage the reliability team, ensuring that existing site reliability engineering initiatives are on track. The...


  • Singapur, Singapore Takeda Full time

    Job SummaryTakeda is seeking a highly motivated and detail-oriented Reliability Intern to join our team in Singapore. As a Reliability Intern, you will play a key role in supporting our manufacturing site's engineering activities and ensuring the delivery of high-quality products.Key ResponsibilitiesParticipate in reliability meetings and prepare for...


  • Singapur, Singapore Citadel Securities Full time

    Job Title: Site Reliability EngineerAbout the Role:We are seeking a highly skilled Site Reliability Engineer to join our team at Citadel Securities. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our distributed systems and applications.Responsibilities:Design and implement scalable and efficient...


  • Singapur, Singapore Takeda Full time

    Job SummaryTakeda is seeking a highly motivated and detail-oriented Reliability Intern to join our team in Singapore. As a Reliability Intern, you will play a key role in supporting our manufacturing site in Woodlands, working closely with various stakeholders to ensure the reliability and efficiency of our processes.Key ResponsibilitiesParticipate in...


  • Singapur, Singapore DBS Bank Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our Consumer Banking Group Technology team at DBS Bank. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our production systems, identifying and implementing improvements, and collaborating with cross-functional teams to drive...

  • Reliability Engineer

    2 weeks ago


    Singapur, Singapore Takeda Full time

    About the RoleTakeda is seeking a highly motivated and detail-oriented Reliability Intern to join our team in Singapore. As a key member of our manufacturing site, you will play a critical role in ensuring the reliability and efficiency of our biologics manufacturing processes.Key ResponsibilitiesParticipate in reliability meetings and prepare for periodic...


  • Singapur, Singapore DBS Bank Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our Consumer Banking Group Technology team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our production systems, identifying and resolving technical issues, and implementing automation and monitoring solutions to improve...


  • Singapur, Singapore Citadel Securities Full time

    Job Title: Site Reliability EngineerAbout the Role:We are seeking a highly skilled Site Reliability Engineer to join our team at Citadel Securities. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our distributed systems and applications.Responsibilities:Design and implement scalable and reliable...


  • Singapur, Singapore IHiS Full time

    Job SummaryWe are seeking a highly skilled Reliability Engineering Lead to join our team at IHiS. As a key member of our reliability team, you will play a critical role in ensuring the reliability and availability of our systems and applications.Key ResponsibilitiesDevelop and implement strategies to improve system reliability and availabilityLead and manage...