
Client System Reliability Engineer
3 days ago
**General information**:
- Job Title- Client System Reliability Engineer- Country- Singapore- Division- Engineering- Department- Infrastructure**Description**:
- Thought Machine’s mission is bold - to properly and permanently rid the world’s banks of legacy technology. To achieve this, we have developed the foundations of modern banking and built core and payments technology which runs natively in the cloud. What we are attempting is hard and means we need great people working together to build great technology.We have grown rapidly in the past few years - growing our team to more than 500 individuals across offices in London, New York, Singapore, Sydney and Melbourne. We have raised more than $500m in funding and are now valued at $2.7bn. Our investors include Molten Ventures, Eurazeo, Intesa Sanpaolo, Temasek, Nyca Partners, JPMorgan Chase, Standard Chartered, and more.We have created a culture enabling our team to produce the best work in the industry, ensuring we have fun along the way. We're regularly cited as having a fantastic workplace culture and have been recognised by Sifted magazine as having one of the highest Glassdoor ratings for a UK fintech company and the most generous employee share package in the industry. We've been named AltFi's B2B Fintech of the Year, placed in the FinTech50, and in the IDC list of top 100 Fintechs.The Client System Reliability Engineer role in Infrastructure will be responsible for enabling and supporting our clients to deliver a best in class cloud native implementation of Thought Machine Vault products on client or Thought Machine hosted infrastructure, from presales to production at scale. This role supports clients in their cloud infrastructure preparation, deployment, optimisation and troubleshooting.DUTIES- Hands on cloud infrastructure consulting both on client site and remoteWorking with customers and external partners to design and prepare suitable cloud infrastructure to ensure Thought Machine Vault products can be tested and run successfully at scale. Includes planning for high availability, disaster recovery, backup, redundancy, capacity and security.
-Deploying and configuring Thought Machine Vault products on client, SaaS and internal cloud infrastructure
-Developing deep understanding of and advising clients on optimisation of cloud infrastructure for overarching implementation of Vault, including advising on systems outside of Vault to empower holistic digital transformation in collaboration with Thought Machine Client Architects
-Supporting and troubleshooting client, SaaS and internal cloud infrastructure both remotely and on site, including by promoting and deploying suitable monitoring, logging and alerting tools
-Working closely with internal product and engineering teams to ensure client feedback is incorporated into improvements to the product and platform
-Supporting the Thought Machine Commercial Team and Cloud Provider Partners in answering infrastructure queries and challenges in the presales cycle
**Requirements**:
**Essential**Ability to explain technical concepts to technical and non-technical stakeholders.
-Hands on experience in the following:
- Linux/Unix administration, e.g. Ubuntu, Debian
- Kafka
- PostgreSQL
- Kubernetes
- Istio
- Experience with automation/configuration management, e.g. Terraform, Puppet, Chef, Ansible
- Experience with AWS or GCP
**Desirable**
- Experience with at least two and associated certifications for at least one of the following:
AWS (ideally certified Solutions Architect Professional)
-GCP (ideally certified Professional Cloud Architect)
-Azure (ideally certified Solutions Architect Expert)
-Experience of enterprise secrets management systems, e.g. HashiCorp Vault, AWS secrets manager
-Experience in supporting production systems for high profile, mission critical systems, ideally for a tier 1 financial institution.
-Experience with hybrid cloud technologies including OpenShift, Google Anthos, AWS EKS Anywhere, AWS Outposts
-A strong background in Python or Go
-Experience with CockroachDB, AlloyDB, Aurora
-Experience with observability tools, e.g. Prometheus, Grafana
**Benefits**:
- Highly competitive salary
- Bonus incentive
- Healthcare
- 25 days holiday and public holidays
- Competitive maternity and paternity leave
- $1,500 SGD per year flexible spend benefits
- All the latest tech you need
- A talented and experienced team as your colleagues
- An environment where we encourage learning and progress
-
Reliable Systems Engineer
17 hours ago
Singapore beBeeReliability Full time $100,000 - $140,000Job OverviewThe position of System Reliability Engineer Specialist is available in a dynamic global team focused on advanced reliability testing for cutting-edge products.Key Responsibilities:System-Level Setup and TestingDevelop and optimize system-level setups for accelerator products, including server rack and system configurations.Ensure seamless...
-
System Reliability Specialist
5 days ago
Singapore beBeeReliability Full timeSystem Reliability Engineer Position To deliver technical support covering areas in reliability, availability, and maintainability (RAM) for clients' systems and equipment throughout their life cycles. The role involves proposing and implementing integrated logistics support (ILS) solutions to optimize the reliability, maintainability, and logistics...
-
System Reliability Engineer
16 hours ago
Singapore beBeeInstrumentation Full time $80,000 - $120,000Job Title:Instrumentation SpecialistJob DescriptionThe Instrumentation Specialist is responsible for maintaining the instrumentation for all plant operations, including field installations and associated control devices. Key responsibilities include developing maintenance and reliability plans related to instrumentation systems, providing technical support...
-
Systems Reliability Engineer
4 days ago
Singapore NodeFlair Full time**Job Summary**: **Salary** S$8,000 - S$10,000 / Monthly **Job Type** **Seniority** Mid **Years of Experience** At least 4 years **Tech Stacks** Go Cloudflare CI Chef Puppet UNIX Linux Ansible SQL PostgreSQL MySQL Redis Python **About Us** We realize people do not fit into neat boxes. We are looking for curious and empathetic individuals who are...
-
Senior System Reliability Engineer
1 day ago
Singapore beBeeReliability Full time $100,000 - $140,000Reliability Engineering SpecialistThis position involves working within a dynamic, global team that is dedicated to advanced reliability testing of cutting-edge products. The specialist will collaborate closely with cross-functional teams across various organizations on setup and testing of accelerator-product systems.Main ResponsibilitiesSetup and...
-
SRE System Reliability Engineer
1 day ago
Singapore beBeeReliability Full time AU$250,000 - AU$350,000System Reliability Engineer Opportunity">In this critical role, you will oversee the end-to-end management of key banking applications in production, focusing on timely incident resolution, thorough root cause analysis, and effective system interfacing. Key responsibilities include leading RCA reviews, collaborating with cross-functional teams to analyze...
-
System Reliability Engineer
6 days ago
Singapore beBeeTechnical Full timeJob TitleWe are seeking a skilled Production Engineer to join our team.About the RoleThe Production Engineer will be responsible for ensuring the availability, reliability, and scalability of our mission-critical systems. This is a critical role that requires strong technical skills and attention to detail.Key ResponsibilitiesMaintain and improve the...
-
Reliability Engineer
5 days ago
Singapore Annexion Partners Pte Ltd Full timeLocation: - Singapore- Discipline: - Client type: - Contact: - Ethan Tan- Reference: - 868- Posted: - about 1 hour agoWe are currently looking for a Reliability Engineer for a leading Data Centre Operator in the region, who will bring onboard with him/her knowledge on the DC market in Singapore to add value to the team. He/She will be able to work with a...
-
Senior System Reliability Consultant
12 hours ago
Singapore beBeeReliability Full time**System Reliability Engineer Position**To deliver technical support covering areas in reliability, availability, and maintainability (RAM) for clients' systems and equipment throughout their life cycles.The role involves proposing and implementing integrated logistics support (ILS) solutions to optimize the reliability, maintainability, and logistics...
-
Reliability Engineer
3 days ago
Singapore NE Digital Full timeCOMPANY DESCRIPTION NE Digital is the digital, data and technology organization that serve as a center of excellence to drive digital transformation for our group of NTUC Social Enterprises to meet the critical social needs of Singapore's community. Delivering innovative products and solutions, we empower our people to lead a better and meaningful life...