Promotion Site Reliability Engineer

1 week ago


Singapore Shopee Full time

DepartmentEngineering and Technology- LevelExperienced (Individual Contributor)- LocationSingaporeThe Engineering and Technology team is at the core of the Shopee platform development. The team is made up of a group of passionate engineers from all over the world, striving to build the best systems with the most suitable technologies. Our engineers do not merely solve problems at hand; We build foundations for a long-lasting future. We don't limit ourselves on what we can or can't do; we take matters into our own hands even if it means drilling down to the bottom layer of the computing platform. Shopee's hyper-growing business scale has transformed most "innocent" problems into huge technical challenges, and there is no better place to experience it first-hand if you love technologies as much as we do.

**About the Team**:

- As a business SRE, you'll manage the technical operations of Shopee's core marketplace businesses, including product lines such as shopee voucher management, shopee discount/coins management, shopee selling listing online, shopee intelligence and data, and more. Our goal is to construct and sustain vast, robust, and highly efficient distributed systems, striving to maximize system availability and performance while minimizing costs. Consequently, you will not only contribute to the development of multiple full-stack platforms and solutions but also create your own. This role will frequently expose you to challenges in both technical operations and software engineering. Your involvement will require a deep dive into Shopee's development and business operations cycle to ensure scalability even in the face of rapid system evolution. Your responsibilities will span every aspect, from designing business development to optimizing data centers, networks, and operating systems.- Set up, deploy and configure marketplace services in the private cloud platform.
- Continuously improve the marketplace services in the private cloud, including but not limited to stress test automation, capacity management, service autoscaler, disaster recovery, chat operations, knowledge base management, SOP automation, dynamic service protection, etc.
- Administer and maintain the servers of marketplace services and all the dependent middlewares.
- Deep dive into Marketplace core product lines, and setup and run proof of concepts to optimize the services running in private cloud.
- Ensure reliability of Shopee Marketplace all year round, and through all campaigns.
- Fun and energetic team culture with strong emphasis on learning, sharing and growth.
- Wide exposure to enable rapid growth in personal skills and career.
- 50:50 time spent between technical operations and software engineering.

**Requirements**:

- Bachelor's degree or higher in Bachelor's degree or higher in Statistics, Mathematics, Computer Science, Information Technology, Programming & Systems Analysis, Engineering or other related disciplines.
- Minimum 3 years’ work experience as a site reliability engineer.
- Experience with site reliability engineering concepts and tools.
- Experience with monitoring tools like Prometheus, Zabbix, Grafana, etc.
- Experience with load balancing tools like LVS, Nginx, OpenResty, HAProxy, etc.
- Experience with container technology such as Docker, Kubernetes, etc.
- Experience with load testing, capacity management, and campaign preparation.
- Good computer science fundamentals: data structures and algorithms, operating systems, computer networking / security, virtualization, containerization, etc.
- Individual traits that we are looking for: fast learning ability and a good team player, strong analytical and problem-solving skills, ability to adapt and thrive in a dynamic work environment, passionate and possessing a strong sense of ownership.



  • Singapore IDEMIA Full time

    Join to apply for the Site Reliability Engineer role at IDEMIA Join to apply for the Site Reliability Engineer role at IDEMIA Get AI-powered advice on this job and more exclusive features. PurposeThis role plays a critical part in ensuring reliability, scalability, and performance of our systems and services. You will work closely with development and...


  • Singapore TRUEWATCH TECHNOLOGY INC PTE. LTD. Full time

    **Responsibility**: - Run production environment by monitoring availability and taking a holistic view of the system health. - Achieve site reliability automation, minimize system downtime, and reduce site reliability cost. - Manage risks and resolves issues that affect the release scope, schedule and quality. - Suggest architecture improvements, push for...


  • Singapore JJ Consulting Services Full time

    Our Client is a fast growing company in Singapore, who is seeking to recruit a Site Reliability Engineer. **Site Reliability Engineer** **Key Roles & Responsibilities** - Providing ancillary support of Enterprise-Grade Products and solutions at customer's sites - Ironing out deployment issues or challenges that our customers may face - Responsible for...


  • Singapore Tencent Full time

    Join to apply for the Senior Site Reliability Engineer role at Tencent 1 day ago Be among the first 25 applicants Join to apply for the Senior Site Reliability Engineer role at Tencent Business Unit Tencent Games was established in 2003. We are a leading global platform for game development, operations and publishing, and the largest online game community in...


  • Singapore Beijing Foreign Enterprise Management Consultants Co.,Ltd. Full time

    Direct message the job poster from Beijing Foreign Enterprise Management Consultants Co.,Ltd. On behalf of Huawei, a world-renowned information and communication technology company, we are seeking passionate and talented individuals to join our team as Site Reliability Engineer Overview On behalf of Huawei, a world-renowned information and communication...


  • Singapore RigNet Full time

    About us One team. Global challenges. Infinite opportunities. At Viasat, we’re on a mission to deliver connections with the capacity to change the world. For more than 35 years, Viasat has helped shape how consumers, businesses, governments and militaries around the globe communicate. We’re looking for people who think big, act fearlessly, and create an...


  • Singapore Binance Full time

    Binance is the global blockchain company behind the world’s largest digital asset exchange by trading volume and users, serving a greater mission to accelerate cryptocurrency adoption and increase the freedom of money. Are you looking to be a part of the most influential company in the blockchain industry and contribute to the crypto-currency revolution...


  • Singapore ABAXX SINGAPORE PTE. LTD. Full time

    Site Reliability Engineer - Networking We are seeking competent candidate joining our Infrastructure Team for the mission building and operating MAS regulated marketplace and clearing house. This role is ideal for someone with a strong foundation in AWS services, infrastructure as code, and cloud security, who is passionate about building scalable, secure,...


  • Singapore Point72 Full time

    Join to apply for the Site Reliability Engineer role at Point72 About the role As part of Point72’s Technology Team, you will focus on developing and maintaining complex, distributed, real-time systems that support our Global Macro business. Your responsibilities will include optimizing operations through automation, building foundational SRE components,...


  • Singapore People Profilers Full time

    Job Description: **Responsibilities**: - Support services before they go live through activities such as system design consulting and launch reviews. - Develop and maintain tools, re-designing capacity planning infrastructure for greater scalability. - Troubleshooting, diagnosing and fixing software issues. - Suggesting architecture improvements, pushing...