Reliability Engineer for Machine Systems

3 weeks ago


Singapur, Singapore Shopee Full time

Position Title: Machine Reliability Engineer - Engineering Infra

Department: Engineering and Technology

Level: Entry Level

Location: Singapore

The Engineering and Technology division is integral to the development of the Shopee platform. Comprising a diverse group of dedicated engineers from around the globe, this team is committed to constructing the most effective systems utilizing the most appropriate technologies. Our engineers not only address immediate challenges but also lay the groundwork for a sustainable future. We embrace a proactive approach, delving deep into the foundational layers of our computing platforms. The rapid expansion of Shopee's business has turned many straightforward issues into significant technical challenges, providing an unparalleled opportunity for those passionate about technology.

About the Team:

The mission of the Shopee Tech Ops MRE (Machine Reliability Engineering) team is to guarantee the efficient and sustainable operation of the Shopee network and hardware on a 24/7 basis. This involves building and maintaining extensive hardware clusters for Site Reliability Engineering (SRE) and capacity management, focusing on cost efficiency and hardware performance. The team is responsible for delivering sustainable hardware resources and reliable network support services. MRE collaborates with the data center team to design and enhance network architecture, ensuring optimal hardware configurations through testing and selection based on business needs. The team also customizes stable and efficient operating systems, optimizes traditional operations through engineering solutions, and develops a comprehensive hardware monitoring system to enhance fault resolution efficiency.

Key Responsibilities:

  • Oversee the maintenance of servers and operating systems.
  • Manage system services including OOB/BMC/Firmware/Ansible.
  • Lead new generation server proof of concept and configuration selection.
  • Deliver effective OS/Server solutions tailored to business requirements.
  • Foster a dynamic and energetic team culture with a strong focus on learning, sharing, and personal growth.
  • Gain extensive exposure to facilitate rapid development of personal skills and career advancement.
  • Ensure the reliability of large-scale servers supporting Shopee/SeaMoney/Bank.

Qualifications:

  • Bachelor's degree or higher in Computer Science, Computer Engineering, Information Systems, or related fields.
  • Strong foundation in computer science principles: data structures and algorithms, operating systems, computer networking/security, virtualization, and containerization.
  • Proficient in software engineering and application architecture: backend/frontend development, design patterns, and middleware technologies including caching, databases, queues, and file storage.
  • Desirable personal attributes: quick learning capability, teamwork orientation, strong analytical and problem-solving skills, adaptability in a dynamic work environment, and a strong sense of ownership.

Preferred Skills:

  • Familiarity with DevOps concepts and tools.
  • Experience with Site Reliability Engineering concepts and tools.
  • Knowledge of automation tools such as Ansible, SaltStack, etc.
  • Experience with monitoring tools like Prometheus, Zabbix, Grafana, etc.
  • Familiarity with server management tools such as OOB, BMC, Firmware, etc.


  • Singapur, Singapore Shopee Full time

    Position Overview:Machine Reliability Engineer - Engineering InfraThe Engineering and Technology division is integral to the development of the Shopee platform. Comprising a diverse group of dedicated engineers from various backgrounds, we aim to create optimal systems utilizing the most appropriate technologies. Our engineers not only address immediate...


  • Singapur, Singapore Shopee Full time

    About the RoleWe are seeking a highly skilled Machine Reliability Engineer to join our Engineering and Technology team at Shopee. As a key member of our team, you will be responsible for ensuring the efficient and sustainable operation of our network and hardware infrastructure.Key ResponsibilitiesMaintain and optimize server and OS configurations to ensure...


  • Singapur, Singapore Shopee Full time

    System Machine Reliability Engineer - Engineering Infra DepartmentEngineering and TechnologyLevelExperienced (Individual Contributor)LocationSingapore The Engineering and Technology team is at the core of the Shopee platform development. The team is made up of a group of passionate engineers from all over the world, striving to build the best systems with...


  • Singapur, Singapore Shopee Full time

    About the RoleWe are seeking a highly skilled Machine Reliability Engineer to join our Engineering and Technology team at Shopee. As a key member of our team, you will be responsible for ensuring the efficient and sustainable operation of our network and hardware infrastructure.Key ResponsibilitiesMaintain and optimize server and OS configurations to ensure...


  • Singapur, Singapore Shopee Full time

    Machine Reliability Engineer - Engineering Infra (Campus Recruitment 2024) DepartmentEngineering and TechnologyLevelEntry LevelLocationSingapore The Engineering and Technology team is at the core of the Shopee platform development. The team is made up of a group of passionate engineers from all over the world, striving to build the best systems with the most...


  • Singapur, Singapore Shopee Full time

    Position Overview:Machine Reliability Engineer - Engineering InfraThe Engineering and Technology division is integral to the development of the Shopee platform. This team comprises dedicated engineers from diverse backgrounds, all committed to constructing robust systems using the most appropriate technologies. Our engineers do not just address immediate...


  • Singapur, Singapore RiceBowl Full time

    Job Summary:RiceBowl is seeking a highly skilled Machine Learning Engineer to join our team. As a key member of our data science team, you will be responsible for designing, developing, and deploying machine learning models that drive business growth and improve customer experiences.Key Responsibilities:Model Development: Design and develop machine learning...


  • Singapur, Singapore Squarepoint Capital Full time

    Squarepoint Capital is a leading global investment management firm focused on systematic and quantitative strategies within financial markets. We are committed to delivering high-quality, uncorrelated returns for our clients through meticulous scientific research. Our proficiency in trading, technology, and operations empowers us to create and maintain...


  • Singapur, Singapore TE Connectivity Full time

    About the RoleWe are seeking a highly skilled Senior Machine Learning Operations Engineer to join our team at TE Connectivity. As a key member of our organization, you will play a critical role in defining, building, and operating machine learning workloads in our cloud platform.Key ResponsibilitiesArchitectural Leadership: Provide thought leadership around...

  • (Systems Engineer)

    4 months ago


    Singapur, Singapore TransitLink Full time

    Responsibilities:Technical specialist for Ticket Service Provider, providing support to O&M Team in ensuring Ticketing Machines maintain a high level of availability, reliability and performance.Manage, monitor and track asset performance, life cycle of new and existing systems and infrastructure. A Senior Systems Engineer will oversee more systems or...


  • Singapur, Singapore RiceBowl Full time

    Job Summary:RiceBowl is seeking a highly skilled Machine Learning Engineer to join our team. As a key member of our data science team, you will be responsible for designing, developing, and deploying machine learning models that drive business growth and improve customer experiences.Key Responsibilities:Model Development: Design and develop machine learning...


  • Singapur, Singapore IHiS Full time

    Reliability Engineering ManagerThe Reliability Engineering Manager will play a key role in supporting the reliability principal with senior management in strategy discussions for application and system improvement. This individual will also manage the reliability team, ensuring that existing site reliability engineering initiatives are on track. The...


  • Singapur, Singapore Takeda Full time

    Job SummaryTakeda is seeking a highly motivated and detail-oriented Reliability Intern to join our team in Singapore. As a Reliability Intern, you will play a key role in supporting our manufacturing site's engineering activities and ensuring the delivery of high-quality products.Key ResponsibilitiesParticipate in reliability meetings and prepare for...


  • Singapur, Singapore Citadel Securities Full time

    Job Title: Site Reliability EngineerAbout the Role:We are seeking a highly skilled Site Reliability Engineer to join our team at Citadel Securities. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our distributed systems and applications.Responsibilities:Design and implement scalable and efficient...


  • Singapur, Singapore Takeda Full time

    Job SummaryTakeda is seeking a highly motivated and detail-oriented Reliability Intern to join our team in Singapore. As a Reliability Intern, you will play a key role in supporting our manufacturing site in Woodlands, working closely with various stakeholders to ensure the reliability and efficiency of our processes.Key ResponsibilitiesParticipate in...


  • Singapur, Singapore DBS Bank Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our Consumer Banking Group Technology team at DBS Bank. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our production systems, identifying and implementing improvements, and collaborating with cross-functional teams to drive...

  • Reliability Engineer

    2 weeks ago


    Singapur, Singapore Takeda Full time

    About the RoleTakeda is seeking a highly motivated and detail-oriented Reliability Intern to join our team in Singapore. As a key member of our manufacturing site, you will play a critical role in ensuring the reliability and efficiency of our biologics manufacturing processes.Key ResponsibilitiesParticipate in reliability meetings and prepare for periodic...


  • Singapur, Singapore DBS Bank Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our Consumer Banking Group Technology team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our production systems, identifying and resolving technical issues, and implementing automation and monitoring solutions to improve...


  • Singapur, Singapore Citadel Securities Full time

    Job Title: Site Reliability EngineerAbout the Role:We are seeking a highly skilled Site Reliability Engineer to join our team at Citadel Securities. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our distributed systems and applications.Responsibilities:Design and implement scalable and reliable...


  • Singapur, Singapore IHiS Full time

    Job SummaryWe are seeking a highly skilled Reliability Engineering Lead to join our team at IHiS. As a key member of our reliability team, you will play a critical role in ensuring the reliability and availability of our systems and applications.Key ResponsibilitiesDevelop and implement strategies to improve system reliability and availabilityLead and manage...