Site Reliability Engineer

5 days ago


Singapore Thought Machine Full time

**General information**:

- Job Title- Site Reliability Engineer- City- Singapore- Country- Singapore- Division- Engineering- Department- Infrastructure- Working time- Full-time**Description**:

- Thought Machine’s mission is bold - to properly and permanently rid the world’s banks of legacy technology. To achieve this, we have developed the foundations of modern banking and built core and payments technology which runs natively in the cloud. What we are attempting is hard and means we need great people working together to build great technology.

We have grown rapidly in the past few years - growing our team to more than 500 individuals across offices in London, New York, Singapore, Sydney and Melbourne. We have raised more than $500m in funding and are now valued at $2.7bn. Our investors include Molten Ventures, Eurazeo, Intesa Sanpaolo, Temasek, Nyca Partners, JPMorgan Chase, Standard Chartered, and more.

We have created a culture enabling our team to produce the best work in the industry, ensuring we have fun along the way. We're regularly cited as having a fantastic workplace culture and have been recognised by Sifted magazine as having one of the highest Glassdoor ratings for a UK fintech company and the most generous employee share package in the industry. We've been named AltFi's B2B Fintech of the Year, placed in the FinTech50, and in the IDC list of top 100 Fintechs.

Site Reliability Engineers at Thought Machine take responsibility for deploying our software into production. As well as traditional DevOps roles, your focus will be on writing and maintaining software with the aim of automating the deployment processes.

**DUTIES**
- Developing tools to ensure our services can scale and are highly available. We always try to manage our ops tasks with automation, by adopting open source tools or developing bespoke tools as required
- Being part of the 24x7 on-call rota, helping support and maintain production systems
- Day to day development support and monitoring of production server and network environments by developing and deploying logging and monitoring tools.
- Supporting disaster recovery, backup, redundancy and capacity planning activities.
- Working with external users/clients on a variety of projects, ensuring their success in running our core product Vault
- This role will be based in Singapore**Requirements**:
**Essential**
- Strong background in Linux/Unix administration, e.g. Ubuntu, Debian
- A strong background in at least one of Go, Python or Java
- A strong background in one of the following: database administration, Kafka, observability tools (such as Prometheus or Zipkin) or infrastructure automation.
- Experience with AWS or GCP is essential
- Experience or knowledge of container orchestration tools, e.g. Kubernetes

**Desirable**
- Experience in supporting production systems
- Experience with automation/configuration management, e.g. Terraform, Puppet, Chef, Ansible
- Client engagement experience as an SRE for high traffic, mission critical systems.
- Ability to explain technical concepts to technical and non-technical stakeholders.

**Benefits**:

- Highly competitive salary
- Bonus incentive
- Healthcare
- 25 days holiday and public holidays
- $1,500 SGD per year flexible spend benefit
- All the latest tech you need
- A talented and experienced team as your colleagues
- An environment where we encourage learning and progress



  • Singapore TRUEWATCH TECHNOLOGY INC PTE. LTD. Full time

    **Responsibility**: - Run production environment by monitoring availability and taking a holistic view of the system health. - Achieve site reliability automation, minimize system downtime, and reduce site reliability cost. - Manage risks and resolves issues that affect the release scope, schedule and quality. - Suggest architecture improvements, push for...


  • Singapore ETEAM WORKFORCE PTE. LTD. Full time

    Position: Site Reliability Engineer (SRE) Work Mode - Onsite/Hybrid Timing - 9am to 6 pm Duration – 1 Year (Highly extendable) Salary: 6018 SGD Work Location: Robinson Road, Singapore About the Role We are looking for a seasoned Site Reliability Engineer (SRE) with 5+ years of experience to join our Platform Engineering team. This role is ideal for someone...


  • Singapore JJ Consulting Services Full time

    Our Client is a fast growing company in Singapore, who is seeking to recruit a Site Reliability Engineer. **Site Reliability Engineer** **Key Roles & Responsibilities** - Providing ancillary support of Enterprise-Grade Products and solutions at customer's sites - Ironing out deployment issues or challenges that our customers may face - Responsible for...


  • Singapore Qlik Full time

    **What makes us Qlik?** A Gartner® Magic Quadrant Leader for 14 years in a row, Qlik transforms complex data landscapes into actionable insights, driving strategic business outcomes. Serving over 40,000 global customers, our portfolio leverages pervasive data quality and advanced AI/ML capabilities that lead to better decisions, faster. We excel in...


  • Singapore Adyen Full time

    **This is Adyen** Adyen provides payments, data, and financial products in a single solution for customers like Meta, Uber, H&M, and Microsoft - making us the financial technology platform of choice. At Adyen, everything we do is engineered for ambition. For our teams, we create an environment with opportunities for our people to succeed, backed by the...


  • Singapore ABAXX SINGAPORE PTE. LTD. Full time

    Site Reliability Engineer - Networking We are seeking competent candidate joining our Infrastructure Team for the mission building and operating MAS regulated marketplace and clearing house. This role is ideal for someone with a strong foundation in AWS services, infrastructure as code, and cloud security, who is passionate about building scalable, secure,...


  • Singapore Crystal Equation Corporation Full time

    We are seeking a skilled Site Reliability Engineer (SRE) to join our team. SRE will be responsible for keeping all internal user-facing applications and other production systems running smoothly. This hybrid role involves a combination of both development and operations skills to build and manage systems that are both efficient and reliable. The Enterprise...


  • Singapore Point72 Full time

    Join to apply for the Site Reliability Engineer role at Point72 About the role As part of Point72’s Technology Team, you will focus on developing and maintaining complex, distributed, real-time systems that support our Global Macro business. Your responsibilities will include optimizing operations through automation, building foundational SRE components,...


  • Singapore APPLE SOUTH ASIA PTE. LTD. Full time

    Summary At Apple, new ideas have a way of becoming excellent products, services, and customer experiences very quickly. Bring passion and dedication to your job and there’s no telling what you could accomplish. The people here at Apple don’t just build products - they craft the kind of wonder that’s revolutionized entire industries. It’s the...


  • Singapore DT One Full time

    About DT One DT One was founded to provide mobile carriers with the infrastructure and services they need to help migrant workers stay in touch with their family and friends back home. Today we operate a leading global network for mobile top‑up solutions, innovative mobile rewards, and Phone‑to‑Phone solutions. Our global network delivers better...