Client System Reliability Engineer

23 hours ago


Singapore Thought Machine Full time

**General information**:

- Job Title- Client System Reliability Engineer- Country- Singapore- Division- Engineering- Department- Infrastructure**Description**:

- Thought Machine’s mission is bold - to properly and permanently rid the world’s banks of legacy technology. To achieve this, we have developed the foundations of modern banking and built core and payments technology which runs natively in the cloud. What we are attempting is hard and means we need great people working together to build great technology.We have grown rapidly in the past few years - growing our team to more than 500 individuals across offices in London, New York, Singapore, Sydney and Melbourne. We have raised more than $500m in funding and are now valued at $2.7bn. Our investors include Molten Ventures, Eurazeo, Intesa Sanpaolo, Temasek, Nyca Partners, JPMorgan Chase, Standard Chartered, and more.We have created a culture enabling our team to produce the best work in the industry, ensuring we have fun along the way. We're regularly cited as having a fantastic workplace culture and have been recognised by Sifted magazine as having one of the highest Glassdoor ratings for a UK fintech company and the most generous employee share package in the industry. We've been named AltFi's B2B Fintech of the Year, placed in the FinTech50, and in the IDC list of top 100 Fintechs.The Client System Reliability Engineer role in Infrastructure will be responsible for enabling and supporting our clients to deliver a best in class cloud native implementation of Thought Machine Vault products on client or Thought Machine hosted infrastructure, from presales to production at scale. This role supports clients in their cloud infrastructure preparation, deployment, optimisation and troubleshooting.DUTIES- Hands on cloud infrastructure consulting both on client site and remoteWorking with customers and external partners to design and prepare suitable cloud infrastructure to ensure Thought Machine Vault products can be tested and run successfully at scale. Includes planning for high availability, disaster recovery, backup, redundancy, capacity and security.
-Deploying and configuring Thought Machine Vault products on client, SaaS and internal cloud infrastructure
-Developing deep understanding of and advising clients on optimisation of cloud infrastructure for overarching implementation of Vault, including advising on systems outside of Vault to empower holistic digital transformation in collaboration with Thought Machine Client Architects
-Supporting and troubleshooting client, SaaS and internal cloud infrastructure both remotely and on site, including by promoting and deploying suitable monitoring, logging and alerting tools
-Working closely with internal product and engineering teams to ensure client feedback is incorporated into improvements to the product and platform
-Supporting the Thought Machine Commercial Team and Cloud Provider Partners in answering infrastructure queries and challenges in the presales cycle

**Requirements**:
**Essential**Ability to explain technical concepts to technical and non-technical stakeholders.
-Hands on experience in the following:

- Linux/Unix administration, e.g. Ubuntu, Debian
- Kafka
- PostgreSQL
- Kubernetes
- Istio
- Experience with automation/configuration management, e.g. Terraform, Puppet, Chef, Ansible
- Experience with AWS or GCP

**Desirable**
- Experience with at least two and associated certifications for at least one of the following:
AWS (ideally certified Solutions Architect Professional)
-GCP (ideally certified Professional Cloud Architect)
-Azure (ideally certified Solutions Architect Expert)
-Experience of enterprise secrets management systems, e.g. HashiCorp Vault, AWS secrets manager
-Experience in supporting production systems for high profile, mission critical systems, ideally for a tier 1 financial institution.
-Experience with hybrid cloud technologies including OpenShift, Google Anthos, AWS EKS Anywhere, AWS Outposts
-A strong background in Python or Go
-Experience with CockroachDB, AlloyDB, Aurora
-Experience with observability tools, e.g. Prometheus, Grafana

**Benefits**:

- Highly competitive salary
- Bonus incentive
- Healthcare
- 25 days holiday and public holidays
- Competitive maternity and paternity leave
- $1,500 SGD per year flexible spend benefits
- All the latest tech you need
- A talented and experienced team as your colleagues
- An environment where we encourage learning and progress



  • Singapore NodeFlair Full time

    **Job Summary**: **Salary** S$8,000 - S$10,000 / Monthly **Job Type** **Seniority** Mid **Years of Experience** At least 4 years **Tech Stacks** Go Cloudflare CI Chef Puppet UNIX Linux Ansible SQL PostgreSQL MySQL Redis Python **About Us** We realize people do not fit into neat boxes. We are looking for curious and empathetic individuals who are...


  • Singapore Nityo Infotech Full time

    Site Reliability Engineer (SRE) About the Role: We are looking for a seasoned Site Reliability Engineer (SRE) with 5‑10 years of experience to join our Platform Engineering team. This role is ideal for someone who thrives in a fast‑paced environment, is passionate about reliability, and enjoys solving complex challenges. You will play a key role in...

  • Reliability Engineer

    23 hours ago


    Singapore NE Digital Full time

    COMPANY DESCRIPTION NE Digital is the digital, data and technology organization that serve as a center of excellence to drive digital transformation for our group of NTUC Social Enterprises to meet the critical social needs of Singapore's community. Delivering innovative products and solutions, we empower our people to lead a better and meaningful life...


  • Singapore Salt Full time

    Description SALT is hiring Site Reliability Engineer for a global technology client in Singapore for 12 months & renewable contract assignment. Responsibilities: - Reliability Engineering: Define and implement SLIs, SLOs, and error budgets to measure and improve service reliability. - Cloud Infrastructure: Design, deploy, and manage infrastructure on Google...


  • Singapore Salt Talent Search Pte Ltd Full time

    SALT is hiring Site Reliability Engineer for a global technology client in Singapore for 12 months & renewable contract assignment. Responsibilities Reliability Engineering: Define and implement SLIs, SLOs, and error budgets to measure and improve service reliability. Cloud Infrastructure: Design, deploy, and manage infrastructure on Google Cloud Platform...


  • Singapore Salt Full time

    SALT is hiring Site Reliability Engineer for a global technology client in Singapore for 12 months & renewable contract assignment. Responsibilities: Reliability Engineering: Define and implement SLIs, SLOs, and error budgets to measure and improve service reliability. Cloud Infrastructure: Design, deploy, and manage infrastructure on Google Cloud Platform...


  • Singapore Salt Full time

    SALT is hiring Site Reliability Engineer for a global technology client in Singapore for 12 months & renewable contract assignment. Responsibilities: Reliability Engineering: Define and implement SLIs, SLOs, and error budgets to measure and improve service reliability. Cloud Infrastructure: Design, deploy, and manage infrastructure on Google Cloud Platform...


  • Singapore JJ Consulting Services Full time

    Our Client is a fast growing company in Singapore, who is seeking to recruit a Site Reliability Engineer. **Site Reliability Engineer** **Key Roles & Responsibilities** - Providing ancillary support of Enterprise-Grade Products and solutions at customer's sites - Ironing out deployment issues or challenges that our customers may face - Responsible for...

  • Reliability Engineer

    2 weeks ago


    Singapore Cognizant Full time

    **About the role** The Reliability Engineer ensures stability of the manufacturing plant, systems health, lifecycle management, user satisfaction. Prioritizing digital capabilities and infrastructure's reliability, performance, and efficiency is a must. All employees involved in the development and maintenance of these services must work collaboratively to...

  • Reliability Engineer

    2 weeks ago


    Singapore Cognizant Full time

    **About the role** The Reliability Engineer ensures stability of the manufacturing plant, systems health, lifecycle management, user satisfaction. Prioritizing digital capabilities and infrastructure's reliability, performance, and efficiency is a must. All employees involved in the development and maintenance of these services must work collaboratively to...