Site Reliability Engineer

1 week ago


Singapore PATSNAP PTE. LTD. Full time
Roles & Responsibilities

About the Role

We are looking for a skilled and experienced DevOps Engineer / Site Reliability

Engineer (SRE) to ensure the high availability, stability, and performance of our

business platform. This role will be responsible for designing and implementing scalable

and maintainable DevOps architecture and automation systems to enhance

operational efficiency. As a senior member, you will lead efforts in optimizing our

operational standards, managing risk assessments, and fostering collaboration with

our China-based operations team. If you are passionate about high-performance

systems, security, and automation, we welcome you to join our team.

Key Responsibilities

  • Ensure high availability, stability, and performance of business platforms, developing optimization strategies and refining operational standards and procedures.
  • Lead the design and implementation of scalable, maintainable DevOps architecture and automation systems to streamline and enhance operational processes.
  • Oversee security risk assessments, and lead the creation and implementation of security strategies to maintain system security.
  • Evaluate and review the system architecture, process logic, performance, and stability, working closely with SRE and developer teams in China to address challenges effectively.
  • Act as the primary incident commander for production environment issues, leading team efforts in troubleshooting and resolution, and ensuring timely response and resolution.
  • Stay updated on the latest trends in technology advancements, organizing team learning sessions to foster continuous improvement.

Desired Qualifications

  • Bachelor's degree in Computer Science or a related field, with at least 4 years of experience in internet system operations or SRE roles.
  • In-depth understanding of internet technology architecture, including expertise in microservices, Kubernetes, Docker, monitoring and alerting systems, CI/CD, logging systems, distributed caching, and database systems.
  • Extensive experience in distributed systems and high-concurrency operations, with strong skills in fault diagnosis and system optimization.
  • Proficient in cloud platform operations (e.g., AWS, Azure), with knowledge of MySQL, PostgreSQL, Redis, and familiarity with big data technologies and hybrid cloud architectures preferred.
  • Skilled in at least one programming language such as Python, Go, or Java, with relevant development experience.
  • Strong organizational and coordination skills, with the ability to guide team members in solving complex issues.
  • Fluent in Mandarin to facilitate effective communication within a multilingual team environment.

Why Join Us

  • Work with innovative DevOps and cloud technologies to drive impactful solutions.
  • Be part of a collaborative, growth-oriented environment that emphasizes continuous learning.
  • Engage in diverse DevOps areas, including system automation, security, and performance tuning, for a comprehensive experience.
Tell employers what skills you have

Troubleshooting
Kubernetes
Azure
Big Data
Technology Architecture
High Availability
MySQL
Reliability
Logging
Multilingual
Distributed Systems
Python
Performance Tuning
Docker
Java
System Architecture

  • Singapore This is an IT support group Full time

    As a key member of our team, you will play a crucial role in ensuring the reliability and efficiency of our plant's utilities and facilities.About the RoleThe Site Reliability Engineer (SRE) ensures that all utilities and facilities within the plant are functioning optimally. This includes managing utilities such as water, electricity, HVAC, compressed air,...


  • Singapore OCBC Full time

    Job Description:We are seeking a Site Reliability Engineer Leader to join our team at OCBC. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our infrastructure. This role requires strong expertise in automating releases, continuous integration/delivery systems, and relevant infrastructure...


  • Singapore COMBUILDER PTE LTD Full time

    Roles & ResponsibilitiesWe are seeking talented and driven professionals to join our Site Reliability Engineering (SRE) team. This role involves helping organizations enhance the availability, performance, and resilience of their applications and services through the deployment and administration of Observability Platforms.Key ResponsibilitiesDeploy and...


  • Singapore FUNFLY PTE. LTD. Full time

    Roles & ResponsibilitiesPosition OverviewAs a site reliability engineer, you will be responsible for ensuring the smooth operation of game services by maintaining, monitoring, and responding to faults daily. They will develop automation tools to enhance operational efficiency and manage game servers for optimal performance. The role includes collaborating...


  • Singapore GK CONSULTING PTE. LTD. Full time

    Roles & ResponsibilitiesWe're seeking an experienced Senior Site Reliability Engineer to ensure the reliability, availability, and performance of our cloud-based internet services.Key Responsibilities1. Own reliability, availability, and user experience for assigned cloud services2. Develop and implement service governance initiatives to increase reliability...


  • Singapore TRINITY CONSULTING SERVICES PTE. LTD. Full time

    Roles & Responsibilities· Must have minimum 5 years' experience.· Strong technical knowledge and experience in supporting enterprise-level applications.· Proficiency in troubleshooting application issues, performing log analysis, and using monitoring tools.· Experience with databases and SQL query language.· Familiarity with software development life...


  • Singapore FLOWDESK ASIA PTE. LTD. Full time

    Roles & ResponsibilitiesAbout the jobAre you passionate about maintaining robust and high-performing infrastructures? Do you thrive in managing complex network environments and ensuring system reliability?Join our infrastructure team and help us elevate operational excellence to new heights.As a Site Reliability Engineer at Flowdesk, you will be at the heart...


  • Singapore HELLO PLANET PTE. LTD. Full time

    Roles & ResponsibilitiesWe are a global dating app created to give everyone a chance at love. The sense of belonging and connectedness we get from relationships helps us survive and thrive, and we're working to make it a little easier for people to find that. We're inspired by the stories we hear from employees, friends, and family who have used our app to...


  • Singapore Oxford Knight Full time

    RequirementsOxford Knight seeks a highly motivated and experienced Senior Site Reliability Engineer with a strong background in Linux administration, cloud computing, and programming languages (preferably Python). The ideal candidate should have a degree in Computer Science or a related field and excellent communication skills.Key Skills and Qualifications5+...


  • Singapore TIKTOK PTE. LTD. Full time

    Roles & ResponsibilitiesTikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. TikTok has global offices including Los Angeles, New York, London, Paris, Berlin, Dubai, Singapore, Jakarta, Seoul and Tokyo.Why Join UsAt TikTok, our people are humble, intelligent, compassionate and creative. We create...


  • Singapore Gravitas Recruitment Group Full time

    Our client, a leading investor in financial markets, are looking for an autonomous, critical thinking, Site Reliability Engineer to join their team in Singapore. The ideal candidate must have a strong academic background, having graduated from a top university with a bachelor's degree in computer science. This degree should have been applied...


  • Singapore SCIENTE INTERNATIONAL PTE. LTD. Full time

    Roles & ResponsibilitiesJob Summary:We are seeking a highly skilled Site Reliability Engineer (SRE) to join our team. As an SRE, you will play a critical role in enhancing system reliability, performance, and scalability while ensuring the seamless functioning of our production environments. This is an opportunity to work in a fast-paced, dynamic environment...


  • Singapore TOSS-EX PTE. LTD. Full time

    Roles & ResponsibilitiesRoles & ResponsibilitiesJob PurposeThe Site Reliability Engineer (SRE) combines software development and system engineering to build and run distributed solutions in a secured multi-tier heterogeneous environment to safeguard, provide and continuously improve the software and systems behind the organization’s cloud platform...


  • Singapore TOSS-EX PTE. LTD. Full time

    Roles & ResponsibilitiesRoles & ResponsibilitiesJob PurposeThe Site Reliability Engineer (SRE) combines software development and system engineering to build and run distributed solutions in a secured multi-tier heterogeneous environment to safeguard, provide and continuously improve the software and systems behind the organization's cloud platform...


  • Singapore GXS Bank Full time

    About the Team: Our team treats infrastructure and operations as software engineering problems. We are responsible for building and progressing software platforms that enable the provisioning and management of all Digibank services in safe, reliable, and scalable ways. We consistently challenge the status quo and use new technologies to build platforms...


  • Singapore SOURCEO PTE. LTD. Full time

    Roles & ResponsibilitiesRequired Expertise and ExperienceAt least 3 years of experience in SRE, DevOps, or a related engineering role. Proficiency in Infrastructure as Code (IaC) using Terraform to manage complex infrastructure. Hands-on experience with log analytics and observability tools, including ELK (Elasticsearch, Logstash, Kibana) and the Grafana...


  • Singapore BYTEDANCE PTE. LTD. Full time

    Roles & ResponsibilitiesAbout Doubao (Seed)Founded in 2023, the ByteDance Doubao (Seed) Team, is dedicated to pioneering advanced AI foundation models. Our goal is to lead in cutting-edge research and drive technological and societal advancements.With a strong commitment to AI, our research areas span deep learning, reinforcement learning, Language, Vision,...

  • Associate VP

    3 days ago


    Singapore DBS Bank Limited Full time

    Business Function Group Technology enables and empowers the bank with an efficient, nimble and resilient infrastructure through a strategic focus on productivity, quality & control, technology, people capability and innovation. In Group Technology, we manage the majority of the Bank's operational processes and inspire to delight our business partners...

  • Reliability Engineer

    4 weeks ago


    Singapore UNITED MICROELECTRONICS CORPORATION (SINGAPORE BRANCH) Full time

    Roles & ResponsibilitiesJob Summary:We are seeking a motivated and detail-oriented Fab Reliability Engineer to join our dynamic team. The ideal candidate will play a crucial role in ensuring the reliability and performance of our manufacturing processes. This position involves process qualification, conformance, process change management, new process...

  • Associate VP

    4 days ago


    Singapore DBS Bank Limited Full time

    Business Function Group Technology enables and empowers the bank with an efficient, nimble and resilient infrastructure through a strategic focus on productivity, quality & control, technology, people capability and innovation. In Group Technology, we manage the majority of the Bank's operational processes and inspire to delight our business partners...