Site Reliability Engineer

1 week ago


Singapore GXS Bank Full time

Get to know the Role

We treat Infrastructure and operations as Software Engineering problems. Our mission is to build and progress software platforms which enables the provisioning and managing of all Digibank services in safe, reliable and scalable ways. We consistently challenge the status quo, use new technologies to build platforms and tooling for engineering teams. In this role you will make significant decisions with a huge impact on building modern banking technology. You would be part of a team, responsible for designing & architecting new solutions, finding creative ways to optimise existing solutions which will improve agility for managing hundreds of microservices infrastructures in a stable & reliable way.

If you are:

  • A strong believer of automating DevOps & SRE aspects like infrastructure provisioning, deployment, observability, incident lifecycle, uptime SLA etc.
  • Bold to challenge, open to get challenged, curious to learn & grow
This is the right place for you

The Day-to-Day Activities:
  • Working with Kubernetes clusters hosted in AWS
  • Using InfrastructureAsCode tooling like Terraform, and Ansible to manage AWS, Azure & Kubernetes resources
  • Engage with the development teams throughout the life cycle to help develop software for reliability and scale. Coaching team's SRE best practices
  • Troubleshoot priority incidents, facilitate blameless post-mortems and ensure permanent closure of incidents
  • Perform analytics on previous incidents and usage patterns to better predict issues and take proactive actions
  • Build and drive adoption for greater self-healing and resiliency patterns
  • Design automated software and product upgrades, change management, and release management solutions
  • Design, code, test and deliver software to automate manual operational work. Own your tools and services end to end.
  • Performance and cost optimization for infrastructure
  • Be part of an on-call rotation for the team's tooling and 24x7 support coverage as needed
  • Succeed, fail, and learn together with other talented people. We believe in an environment that provides an opportunity for growth and see education as an outcome of failure that gets us closer to the next breakthrough
The Must-Haves:
  • Bachelor's degree in information systems, information technology, computer science, or similar.
  • 1-4+ years of professional experience.
  • Experience with administering Kubernetes cluster
  • Experience with managing Infrastructure as code using Terraform
  • Direct production operations experience in a cloud environment.
  • Experience contributing to technology and product strategy.
  • Experience leading capability-building initiatives across diverse areas such as infrastructure and operations automation, observability, incident management, architecting HA systems, and other core engineering.
  • Demonstrated experience in driving operational efficiency and transparency of a growing engineering organization.


  • Singapore Sea Limited Full time

    Engineering and Technology - Infrastructure, Singapore - Entry Level Our DevOps Engineering team plays an important role in developing and maintaining the internal systems and tools for the Infrastructure team. As a Site Reliability Engineer, you are responsible for improving the availability and reliability of our Infrastructure services. - Responsible for...


  • Singapore JJ Consulting Services Full time

    Our Client is a fast growing company in Singapore, who is seeking to recruit a Site Reliability Engineer. **Site Reliability Engineer** **Key Roles & Responsibilities** - Providing ancillary support of Enterprise-Grade Products and solutions at customer's sites - Ironing out deployment issues or challenges that our customers may face - Responsible for...


  • Singapore NodeFlair Full time

    **Job Summary**: **Salary** S$11,500 - S$16,500 / Monthly **Job Type** **Seniority** Senior **Years of Experience** At least 7 years **Tech Stacks** Microsoft Puppet Java Ansible Python **This is Adyen** Adyen provides payments, data, and financial products in a single solution for customers like Meta, Uber, H&M, and Microsoft - making us the...


  • Singapore THALES SOLUTIONS ASIA PTE. LTD. Full time

    Roles & ResponsibilitiesDigital Competence Center (DCC)Thales IFE has decided to create a leading technology center in Singapore for its IFE Digital Engineering. It will leverage on unique digital skillset from Singapore and neighbouring countries on Cloud engineering. Thanks to a multi-year strategic plan, Thales is locating at WeWork@Suntec, a center that...


  • Singapore Retentia technology private limited Full time

    **3+ years of experience in Site Reliability Engineering, DevOps**, or a related field. - **Strong knowledge of cloud platforms (AWS, GCP, Azure) and containerization technologies (Docker, Kubernetes).** - Experience with automation and configuration management tools (e.g., T**erraform, Ansible, Chef, or Puppet).** - Proficiency in at least **one programming...


  • Singapore The Edge Asia Full time

    Our client is a US hedge fund and their Technology group is constantly improving the company’s IT infrastructure, positioning them at the forefront of a rapidly evolving technology landscape. They are a team of experts experimenting, discovering new ways to harness the power of open-source solutions, and embracing enterprise agile methodology. Their...


  • Singapore Oxford Knight Full time

    Senior Site Reliability Engineer Job OverviewOxford Knight is seeking a highly skilled Senior Site Reliability Engineer to join our team and support our Linux trading infrastructure.Key ResponsibilitiesDesign and implement software components and systems to improve trading services.Provide level II support, including emergency response and advanced...


  • Singapore Gravitas Recruitment Group Full time

    Job details - Location - Singapore - Salary - S$9000 - S$13000 per month - Job Type - Permanent - Ref - BBBH137137_1690786002 - Posted - about 1 hour ago Job summary **Our client, a trading firm, is looking for a Site Reliability Engineer to join their team. They are seeking team players who demonstrate a creative approach to problem-solving and take...


  • Singapore AKAMAI TECHNOLOGIES APJ PTE. LTD. Full time

    As a Senior Site Reliability Engineer, you will influence a wide array of teams. You will be responsible for the performance and reliability of Akamai’s delivery products by working with the Product, Engineering and Support teams to diagnose, mitigate and solve outages. You will have to solve some of the most complex problems in distributed systems at...


  • Singapore NextWave Partners Full time

    Location: - Singapore- Job Type: - Permanent- Discipline: - Software Engineering- Salary: - Negotiable- Contact: - Chelsea Phan**Senior Site Reliability Engineer** **Singapore** **About the role** We are working with a climate technology, who is currently working on a smart carbon measurement, accounting, and management Saas platform that allows...


  • Singapore IFUN GAMES Full time

    **Responsibilities** - Design, implement, and maintain tools and processes for monitoring, alerting, and incident response - Collaborate with developers to improve the design and operation of systems, with a focus on reliability, performance, and scalability - Participate in on-call rotations to respond to incidents and handle escalations - Analyze system...


  • Central Singapore Emprego SG Full time

    **Location** Singapore, Central Singapore **Job Type** Permanent **Salary** 9,000 - 15,000 Per **Date Posted** 5 hours ago Additional Details **Job ID** 16908 **Job Views** 1 Roles & Responsibilities **Objectives of this Role** - Run the production environment by monitoring availability and taking a holistic view of system health Improve...


  • Singapore Sea Limited Full time

    Engineering and Technology - Infrastructure, Singapore - Experienced (Individual Contributor) Our DevOps Engineering team plays an important role in developing and maintaining the internal systems and tools for the Infrastructure team. As a Senior Site Reliability Operation Engineer, you are responsible for improving the availability and reliability of our...


  • Singapore SINGAPORE POWER LIMITED Full time

    **What You'll Do**: - Evangelist for Site Reliability Engineer (SRE) practices in SP Digital (SPD) - Maintain the Reliability tools with regular patching and upgrades - Mange and evolve the full stack observability tools used in SPD - Enhance the customer experience by simplifying the onboarding process and documentation - Work with teams to improve the...


  • Singapore J P INFOTEC PTE. LTD. Full time

    **Site Reliability Engineer** **Responsibilities** - Support and/or own the deployment of global products including setting up production and internal environments - Provide 24/7 first line of Engineering support (via follow the sun teams in all regions) for any issues related to global product deployment, availability and internal operations support. -...


  • Singapore Experis Full time

    **Site Reliability Engineer**: - Location- Singapore- Job reference- BBBH133368_1699927914- Salary- S$6000 - S$7500 per month- Consultant name - Rajasekar Shirley Monisha Consultant contact no. - 6232 5244 - EA License No. - 02C3423 - Consultant Registration No. - R22106767 **Responsibilities**: - Responsible for deployment, change, issues triage and...


  • Singapore GXS BANK PTE. LTD. Full time

    **Job Description & Requirements**: Get to know the Role: - As a Site Reliability Engineer (SRE) you will help build a meaningful engineering discipline, combining software and systems to develop creative engineering solutions to operations problems. - Much of our support and software development focuses on optimizing existing systems, building...


  • Singapore Ambition Singapore Full time

    Ambition SingaporeAbout the CompanyAmbition Singapore is a top quantitative trading firm with a results-driven culture, seeking a Site Reliability Engineer to safeguard their innovative services and strategies.


  • Singapore DADACONSULTANTS PTE. LTD. Full time

    Roles & ResponsibilitiesSenior Site Reliability Engineer (SRE) | Big DataResponsibilities:Manage the full lifecycle of services, from design to deployment and maintenance.Develop and improve automation tools for scalability and reliability.Troubleshoot and resolve software and infrastructure issues, ensuring data security.Optimize system architecture and...


  • Singapore INFOSYS COMPAZ PTE. LTD. Full time

    Roles & ResponsibilitiesJob DescriptionWe are seeking talented and driven professionals to join our Site Reliability Engineering (SRE) team. This role involves helping organizations enhance the availability, performance, and resilience of their applications and services through the deployment and administration of Observability PlatformsKey Responsibilities...