Site reliability engineer

4 days ago


Singapore Sea Full time

The Engineering and Technology team is at the core of the Shopee platform development. The team is made up of a group of passionate engineers from all over the world, striving to build the best systems with the most suitable technologies. Our engineers do not merely solve problems at hand; we build foundations for a long-lasting future. We don't limit ourselves on what we can or can't do; we take matters into our own hands even if it means drilling down to the bottom layer of the computing platform. Shopee's hyper-growing business scale has transformed most "innocent" problems into huge technical challenges, and there is no better place to experience it first-hand if you love technologies as much as we do. About the Team The mission of the SRE (Site Reliability Engineer) team is to ensure the efficient and sustainable operation of Shopee 24x7, as well as to build and maintain large-scale, highly available, high-performance distributed systems based on system availability and performance. It is formed by combining traditional software engineering and technical operation. The SRE team needs to dive deep into the Shopee development lines to ensure that the system is highly scalable under the rapid evolution of the System. From the perspective of stability and performance, it includes the design of business development, components of the basic platform (middleware, container scheduling, caching, object storage, etc.), OS optimization, data center and network optimization. We optimize the inefficient and complicated operation in the traditional operation and maintenance mode through engineering and service means and are committed to building a sound monitoring system to improve the efficiency of incident handling. Job Description Deep dive into development lines, learn and understand the mechanism of every application component, and promote product scalability, stability, and performance. Set up, manage, and maintain Shopee product/middleware/big-data applications and services. Perform regular and ad-hoc server-side deployments, make improvements to the performance, and troubleshoot. Design and develop automated technical operation platforms. Manage capacity and resources. Responsible for the full-chain stress test to enhance performance and remove redundancy of applications. Prepare routine operation documentation. Qualifications Education: Bachelor's degree or above in Computer Science, Engineering, Information Systems, or related fields. Experience: More than 2 years of relevant experience (candidates with no working experience are welcomed to apply). Technical Skills: Extensive and hands-on knowledge with Linux operating systems (Ubuntu, Cent OS, etc.). Highly familiar with Computer Networks (TCP/IP, DNS, etc.), Computer Organizations, and OS. Hands-on experience with at least one programming language: Bash, Python, Go. Strong analytical and problem-solving skills with the ability to thrive in a dynamic work environment. Passionate and possess a strong sense of responsibility. Fast learning ability and a good team player. Agile and detail-oriented. Optional but Preferred Skills: Experience with automation tools like Ansible, Salt Stack. Experience with monitoring tools like Prometheus, Zabbix, Grafana, etc. Experience with load balancing tools like LVS, Nginx, Openresty, or HAProxy. Experience with container technology such as Docker, Kubernetes. Experience with High Availability system design and Server Deployment Process. Experience with SRE. Experience with Ops Paa S platform or Ops automation platform (i.e., CMDB). #J-18808-Ljbffr



  • Singapore HW Search & Selection Ltd Full time

    Site Reliability Engineer A new opportunity has arisen for a Site Reliability Engineer for a prestigious investment management firm in Singapore. You will be responsible for providing production support for the trading infrastructure.Your main responsibilities will include:Linux trading infrastructure supportProviding Level II supportUtilizing Python to...


  • Singapore HW Search & Selection Ltd Full time

    Site Reliability Engineer A new opportunity has arisen for a Site Reliability Engineer for a prestigious investment management firm in Singapore. You will be responsible for providing production support for the trading infrastructure. Your main responsibilities will include: Linux trading infrastructure support Providing Level II support Utilizing Python to...


  • Singapore Qlik Full time

    What makes us Qlik? A Gartner Magic Quadrant Leader for 14 years in a row, Qlik transforms complex data landscapes into actionable insights, driving strategic business outcomes. Serving over 40,000 global customers, our portfolio leverages pervasive data quality and advanced AI/ML capabilities that lead to better decisions, faster. We excel in integration...


  • Singapore Aptitude Asia Limited Full time

    Our client, a top-tier hedge fund, is looking to hire a talented Site Reliability Engineer to join their growing SRE team in Singapore. Job Responsibilities: Ensure high reliability, availability, and performance of applications throughout their lifecycle. Automate repetitive tasks and systematically address recurring issues. Generate innovative ideas for...

  • Process engineer

    3 weeks ago


    Singapore The Chemical Engineer Full time

    Why Patients Need You Whether you are involved in the design and development of manufacturing processes for products or supporting maintenance and reliability, engineering is vital to making sure customers and patients have the medicines they need, when they need them. Working with our innovative engineering team, you'll help bring medicines to the...


  • Singapore ACCESS PEOPLE (SINGAPORE) PTE. LTD. Full time

    Roles & ResponsibilitiesA global energy trading firm is transitioning to a data-centric platform and is seeking a Site Reliability Engineer to support this multi-year program. The role will focus on enhancing the reliability, scalability, and stability of the company's evolving platform. The successful candidate will work on integrating a new event-based,...


  • Singapore Qlik Full time

    Director of Regional Site Reliability EngineeringQlik is seeking an experienced leader to oversee the development and scaling of our regional Site Reliability Engineering (SRE) organization in APAC. This role will be instrumental in ensuring the availability, scalability, and reliability of our services.About QlikWe are a global company that transforms...


  • Singapore APPLE SERVICES PTE. LTD. Full time

    Roles & ResponsibilitiesSummaryThe Apple Services Engineering (ASE) team is one of the most exciting examples of Apple's long-held passion for combining art and technology. These are the people who power the App Store, Apple TV, Apple Music, Apple Podcasts, Fitness+ and Apple Books. And they do it on a massive scale, meeting Apple's high expectations with...


  • Singapore DEUTSCHE BANK AKTIENGESELLSCHAFT Full time

    About the RoleWe are seeking an experienced Site Reliability Engineer to join our team at Deutsche Bank AKTIENGESELLSCHAFT. As a Site Reliability Engineer, you will play a critical role in ensuring the availability, performance, and security of our cloud-based infrastructure.


  • Singapore Luxoft Full time

    Project Description With award-winning mobile banking apps and trading systems, our technology platforms help Bank deliver best-in-class products to clients. Naturally, we make sure that the phones work, emails are delivered and PCs run - but we also develop innovative collaboration platforms and workspaces that help our people share their knowledge, their...


  • Singapore Tower Research Capital Full time

    Tower Research Capital Job DescriptionJob Title: Site Reliability EngineerJob Summary:We are seeking a highly skilled Site Reliability Engineer to join our team at Tower Research Capital. The successful candidate will be responsible for ensuring the continuous operation of our Linux-based trading infrastructure and addressing day-to-day operational needs.Key...


  • Singapore Ripple Labs Singapore Full time

    As a Senior Site Reliability Engineer at Ripple Labs Singapore, you will be responsible for ensuring the high availability and scalability of our systems. Your primary goal will be to design, implement, and maintain a robust and efficient infrastructure that can handle high traffic and complex distributed systems.Key Responsibilities:Design and implement...


  • Singapore This is an IT support group Full time

    Singapore, Singapore Relocation friendly DevOps BCM Industry 02/12/2024Req. VR-109808Project Description With award-winning mobile banking apps and trading systems, our technology platforms help Bank deliver best-in-class products to clients. Naturally, we make sure that the phones work, emails are delivered and PCs run - but we also develop innovative...


  • Singapore Ripple Labs Singapore Full time

    Job SummaryWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Ripple Labs Singapore. As a key member of our infrastructure team, you will be responsible for ensuring the high availability and reliability of our systems.Key ResponsibilitiesDesign, implement, and maintain high availability systems and infrastructureCollaborate...


  • Singapore Helius Full time

    Job Title: Site Reliability EngineerJob Summary: Helius is seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for designing, implementing, and operating highly scalable and reliable systems. Your main focus will be on ensuring the smooth operation of our services, resolving technical issues,...


  • Singapore Tata Consultancy Services Limited Full time

    Roles & Responsibilities: The Site Reliability Engineer (SRE) combines software development and system engineering to build and run distributed solutions in a secured multi-tier heterogeneous environment to safeguard, provide and continuously improve the software and systems behind the organization's cloud platform solutions. With a vigilant eye on...


  • Singapore Sea Full time

    About SeaSea is a cutting-edge technology company with a hyper-growing business scale, transforming complex problems into technical challenges. Our team of passionate engineers is dedicated to delivering world-class experiences for our users.The Games Site Reliability Engineer (SRE) team at Sea Labs Indonesia plays a crucial role in ensuring the stability...


  • Singapore Luxoft Full time

    Singapore, Singapore Relocation friendly DevOps BCM Industry 02/12/2024 Req. VR-109808 Project description With award-winning mobile banking apps and trading systems, our technology platforms help Bank deliver best-in-class products to clients. Naturally, we make sure that the phones work, emails are...


  • Singapore Infosight Software And Consulting Services Private Limited Full time

    We can consider EP & Singaporean / PR. Job Role Site Reliability Engineer Experience 5 to 7 Years Work Location Singapore Budget 7800-8000 SGD Duration 6 months, 12 months renewable contract Job Description Strong hands-on experience with using and designing VMware solutions such as NSX-T, v Realize Suite, v Sphere/v Center is mandatory. Strong working...


  • Singapore Snaphunt Full time

    The OpportunityWe're seeking an experienced Site Reliability Engineer to empower users with a rich feature set, high availability, and stellar performance at First Digital Finance Corp.As we expand customer deployments, the ideal candidate will deliver insights from massive-scale data in real-time, collaborating with a cross-functional team to develop...