Senior PaaS Site Reliability Engineer

6 months ago


Singapur, Singapore Tencent Full time

Responsibilities:

About the Company
Tencent is a leading global technology company focused on connecting people and developing innovative products and services that improve the quality of life of people around the world. Founded in 1998 and publicly traded on the Hong Kong Stock Exchange since 2004, Tencent offers a variety of products and services, including leading communication and social platforms (Weixin/WeChat), high-quality entertainment (from video games, music, TV and film, sport and literature), FinTech (WeChat Pay and QQ Wallet) and industry-leading cloud products and services.

Cloud & Smart Industries Group (CSIG) is responsible for promoting the company's cloud and industry Internet strategy. CSIG explores the interactions between users and industries to create innovative solutions for smart industries via technological advancements such as cloud, AI, and network security. While driving the digitalization of retail, medical, education, transportation and other industries, CSIG helps companies serve users in smarter ways, building a new ecosystem of intelligent industries that connect users and businesses.

Position Overview

Join Tencent Cloud as a Senior PaaS SRE and be at the forefront of optimizing and upgrading cloud product architecture and operations. You'll play a vital role in ensuring the stability and reliability of Tencent Cloud's PaaS products through monitoring, troubleshooting, and driving automation initiatives. With your deep understanding of cloud product architecture and technical principles, you'll push the boundaries to enhance operational efficiency and drive cost reduction while ensuring the highest standards of service delivery.

Responsibilities

Monitor and maintain Tencent Cloud's PaaS products to ensure stability and reliability, resolving technical issues and mitigating risks to ensure smooth operations in various technical scenarios. Drive optimization and upgrades of product architecture and operations by deeply understanding cloud product architecture and technical principles. Promote the implementation of automated operations, high-availability component construction, and high-availability drills to enhance operational efficiency. Utilize tools or platforms such as CI/CD, intelligence, and data to improve the overall efficiency of the operations team, reducing operational costs and avoiding inefficient repetitive tasks.

Requirements:

Bachelor's degree or above in Computer Science or related fields. Over 5 years of experience in development, architecture design, and system operations, preferably with experience in large-scale distributed system operations in the cloud. Experience with public cloud platforms such as AWS, Azure, GCP, or Tencent Cloud, with preference given to those with cloud certification and cloud operations experience. Profound understanding of distributed systems and be well-versed in commonly used open-source components in the internet industry, including CDN, Nginx, microservices, Kubernetes, Redis, Kafka, MySQL, HBase, and others. Proficiency in one or more programming languages such as Java, Go, Shell, and Python scripting. Strong expertise in Linux system, TCP/IP protocol, and troubleshooting skills. Familiarity with containerization technologies such as Docker and container orchestration tools like Kubernetes, with experience in container product operations and troubleshooting. Proficiency in MySQL or any other database, familiarity with NoSQL databases, and caching databases like Redis. Strong sense of responsibility, excellent communication and teamwork skills, proactive thinking, and self-drive, with keen risk awareness and risk identification abilities. Excellent documentation skills, with the ability to produce technical documents and operational solutions promptly and effectively, and the capability to summarize insights for internal and external dissemination. Excellent language proficiency in English and Chinese Mandarin in order to liaise with China-based stakeholders and manage various cross-country tools and documents in the said languages.

#LI-JY1

Diversity, Equity & Inclusion at Tencent

Diversity, equity and inclusion are important, interdependent components of our workplace. As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals.



  • Singapur, Singapore Shopee Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our Engineering and Technology team in Singapore. As a key member of our team, you will be responsible for managing the technical operations of Shopee's core marketplace businesses, including product lines such as shopee voucher management, shopee discount/coins...


  • Singapur, Singapore Tencent Full time

    About the CompanyTencent is a leading global technology company focused on connecting people and developing innovative products and services that improve the quality of life of people around the world.Cloud & Smart Industries Group (CSIG)CSIG is responsible for promoting the company's cloud and industry Internet strategy. We explore the interactions between...


  • Singapur, Singapore Sea Full time

    Job Title: Site Reliability EngineerAt Sea, our Infrastructure team is responsible for providing end-to-end managed services and solutions for our entire Internet infrastructure. We excel in building architecture, providing solutions, and operating data centers, connectivity, cloud, networking, systems, storage, and security.As a Site Reliability Engineer,...


  • Singapur, Singapore Sea Full time

    Our Infrastructure team provides the end-to-end managed services and solutions for the Group's entire Internet infrastructure alongside running business applications. We excel in building the architecture, providing solutions and operations of data centre, connectivity, cloud, networking, system, storage and security. We are a proud provider of high-quality...


  • Singapur, Singapore Sea Full time

    About Sea LabsAt Sea Labs, we're at the forefront of innovation, driving the development of cutting-edge technologies that power our e-commerce, supply chain, games, payment, and finance platforms. Our team in Indonesia is a key part of this journey, working closely with global teams to deliver exceptional user experiences.We're seeking a skilled Site...


  • Singapur, Singapore Sea Full time

    Our Infrastructure team provides the end-to-end managed services and solutions for the Group's entire Internet infrastructure alongside running business applications. We excel in building the architecture, providing solutions and operations of data centre, connectivity, cloud, networking, system, storage and security. We are a proud provider of high-quality...


  • Singapur, Singapore Sea Full time

    About Sea LabsAt Sea Labs, we're at the forefront of the Sea platform's development, supporting diverse business lines across e-commerce, supply chain, games, payment, and finance. Our strong growth and unique positioning have led to the launch of Sea Labs Indonesia, where passionate engineers drive the best experience for our users in Indonesia and...


  • Singapur, Singapore Tencent Full time

    Job Summary:Tencent Games is seeking a skilled Site Reliability Engineer to maintain the stability and performance of our overseas cloud platforms. As a key member of our team, you will be responsible for monitoring and resource management, ensuring the smooth operation of our data platforms and services.Key Responsibilities:Design and implement automatic...


  • Singapur, Singapore NodeFlair Full time

    Senior Site Reliability EngineerWe are working with a leading pioneer in the Cryptocurrency space, utilizing one of the largest data platforms, and as part of their continued growth, NodeFlair has been engaged to search for a Senior Site Reliability Engineer to join their Singapore/Remote team.Key Responsibilities:Collaborate with the team on software...


  • Singapur, Singapore IHiS Full time

    Position OverviewThe Reliability Lead will support the reliability principal with senior management in strategy discussion for application & system improvement, and will also manage the reliability team. He/She will ensure that the existing site reliability engineering (SREs) initiatives, such as monitoring availability, uplifting capability and automoation...


  • Singapur, Singapore Sea Full time

    At Sea, our Infrastructure team provides end-to-end managed services and solutions for our entire Internet infrastructure, alongside running business applications. We excel in building architecture, providing solutions and operations of data centre, connectivity, cloud, networking, system, storage and security. Our team is proud to provide high-quality and...


  • Singapur, Singapore Shopee Full time

    Senior Site Reliability Engineer (Promotion) - Engineering Infra DepartmentEngineering and TechnologyLevelExperienced (Individual Contributor)LocationSingapore The Engineering and Technology team is at the core of the Shopee platform development. The team is made up of a group of passionate engineers from all over the world, striving to build the best...


  • Singapur, Singapore Unison Consulting Pte Ltd Full time

    Job Description and Requirements:We are seeking a highly skilled Oracle PaaS Developer to join our team at Unison Consulting Pte Ltd.The ideal candidate will have at least 5 years of experience in Oracle PaaS development, with a strong focus on VBCS, OIC, ODI, and ATP Database.Key responsibilities include:Designing and developing webpages and business...


  • Singapur, Singapore Sea Full time

    The Engineering and Technology team is at the core of the Shopee platform development. The team is made up of a group of passionate engineers from all over the world, striving to build the best systems with the most suitable technologies. Our engineers do not merely solve problems at hand; We build foundations for a long-lasting future. We don't limit...

  • Senior Manager

    4 weeks ago


    Singapur, Singapore StarHub Full time

    Job DescriptionThe Senior Manager, Site Reliability Engineering (SRE) Operations Analyst is responsible for leading the SRE operations team to ensure the availability, latency, and performance of our systems. This role requires a strong understanding of cloud solutions, CI/CD tools, logging and monitoring tools, and cloud native technologies.Key...


  • Singapur, Singapore IHiS Full time

    Job OverviewThe Reliability Lead will collaborate with senior management to discuss strategies for improving application and system reliability. This role will also manage the reliability team and ensure that existing site reliability engineering initiatives are on track.Key ResponsibilitiesStrive for automation in production systemsIdentify significant...


  • Singapur, Singapore GEMINI Full time

    Department : Platform Our Platform organization’s purpose is to enable Gemini to scale effectively and empower our engineering teams to focus on building innovative financial products and experiences for individuals around the world. Platform focuses around building a scalable and secure foundations platform, enabling Engineering to deploy, validate,...


  • Singapur, Singapore Tencent Full time

    About the RoleTencent Cloud International is seeking a passionate undergraduate or graduate student to join our PaaS SRE Internship. As an integral part of our operation and maintenance team, you will be responsible for ensuring the reliability and performance of our cloud infrastructure.Key ResponsibilitiesAssist in the maintenance and operation of PaaS...


  • Singapur, Singapore Ripple Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team in Singapore. As a key member of our infrastructure team, you will be responsible for ensuring the high availability and scalability of our systems.Key ResponsibilitiesDesign, implement, and maintain high availability systems and infrastructureCollaborate with...


  • Singapur, Singapore Celanese Corporation Full time

    Responsibilities 职责: Job Description - Senior Reliability Engineer (Electrical) / Electrical - Subject Matter Expert Electrical Reliability and Maintenance: -Provide technical subject matter expertise to enhance the electrical reliability and ensuring all KPIs are met. -Improve reliability of electrical equipment by implementing repair...