Current jobs related to Site Reliability Engineer - Singapore - Thought Machine
-
Site Reliability Engineer
5 days ago
North-East Singapore PERSOLKELLY Full timeThe Site Reliability Engineer is responsible for ensuring the reliability, scalability, and efficiency of our systems and infrastructure. This role involves monitoring, troubleshooting, and resolving issues to maintain optimal performance. The engineer will also collaborate with cross-functional teams to automate processes and improve system reliability....
-
Site Reliability Engineer
3 hours ago
Singapore Bright Vision Technologies Full timeExciting Opportunity for Site Reliability Engineer - H1B Sponsorship for 2025 at Bright Vision Technologies Join the Bright Vision Technologies Team: Where Innovation Meets Opportunity As we approach the 2025 H1B filing season, we are excited to offer a unique opportunity for talented professionals like you to work with our direct clients in the US and...
-
Site Reliability Engineer
1 week ago
Singapore Rapsys Technologies Full timeDrive the Site Reliability Engineering agenda forward at an Enterprise Level to improve availability, reliability, and performance of services. - Drive cross-team efforts in resiliency assessment exercises and reporting - Draft and/or contribute to internal SRE training materials - Support services before they go live through activities such as Chaos testing...
-
Site Reliability Engineer
3 weeks ago
Singapore THALES SOLUTIONS ASIA PTE. LTD. Full timeRoles & ResponsibilitiesDigital Competence Center (DCC)Thales IFE has decided to create a leading technology center in Singapore for its IFE Digital Engineering. It will leverage on unique digital skillset from Singapore and neighbouring countries on Cloud engineering. Thanks to a multi-year strategic plan, Thales is locating at WeWork@Suntec, a center that...
-
Site Reliability Engineer
3 days ago
Singapore NLS Full timeMy client, a global hedge fund, is actively seeking a hands on a highly skilled and motivated SRE to join their team. As an SRE, you will play a critical role in driving the adoption of Site Reliability Engineering practices within their organization. The ideal candidate will have a strong technical background and a passion for driving operational efficiency...
-
Site Reliability Engineer
7 days ago
Singapore Imperva Full time**Site Reliability Engineer**:** About the role** Imperva’s Infrastructure and Cloud team is looking for a highly technical Site Reliability Engineer to drive innovation, scale, and create operational excellence for the Imperva globally distributed network. As an SRE in the ICO organization, you approach solving, supporting, and optimizing the...
-
Site Reliability Engineer
3 days ago
Singapore Retentia technology private limited Full time**3+ years of experience in Site Reliability Engineering, DevOps**, or a related field. - **Strong knowledge of cloud platforms (AWS, GCP, Azure) and containerization technologies (Docker, Kubernetes).** - Experience with automation and configuration management tools (e.g., T**erraform, Ansible, Chef, or Puppet).** - Proficiency in at least **one programming...
-
Site Reliability Engineer
1 week ago
Singapore M2R System Technology Pte. Ltd. Full time**Responsibilities**: - Run production environment by monitoring availability and taking a holistic view of the system health - Achieve site reliability automation, minimize system downtime, and reduce site reliability cost - Manage risks and resolves issues that affect the release scope, schedule and quality - Suggest architecture improvements, push for...
-
Site Reliability Engineer
1 day ago
Singapore The Edge Asia Full timeOur client is a US hedge fund and their Technology group is constantly improving the company’s IT infrastructure, positioning them at the forefront of a rapidly evolving technology landscape. They are a team of experts experimenting, discovering new ways to harness the power of open-source solutions, and embracing enterprise agile methodology. Their...
-
Site Reliability Engineer
2 days ago
Singapore IFUN GAMES Full time**Responsibilities** - Design, implement, and maintain tools and processes for monitoring, alerting, and incident response - Collaborate with developers to improve the design and operation of systems, with a focus on reliability, performance, and scalability - Participate in on-call rotations to respond to incidents and handle escalations - Analyze system...
Site Reliability Engineer
3 weeks ago
**General information**:
- Job Title- Site Reliability Engineer- City- Singapore- Country- Singapore- Division- Engineering- Department- Infrastructure- Working time- Full-time**Description**:
- Thought Machine’s mission is bold - to properly and permanently rid the world’s banks of legacy technology. To achieve this, we have developed the foundations of modern banking and built core and payments technology which runs natively in the cloud. What we are attempting is hard and means we need great people working together to build great technology.
We have grown rapidly in the past few years - growing our team to more than 500 individuals across offices in London, New York, Singapore, Sydney and Melbourne. We have raised more than $500m in funding and are now valued at $2.7bn. Our investors include Molten Ventures, Eurazeo, Intesa Sanpaolo, Temasek, Nyca Partners, JPMorgan Chase, Standard Chartered, and more.
We have created a culture enabling our team to produce the best work in the industry, ensuring we have fun along the way. We're regularly cited as having a fantastic workplace culture and have been recognised by Sifted magazine as having one of the highest Glassdoor ratings for a UK fintech company and the most generous employee share package in the industry. We've been named AltFi's B2B Fintech of the Year, placed in the FinTech50, and in the IDC list of top 100 Fintechs.
Site Reliability Engineers at Thought Machine take responsibility for deploying our software into production. As well as traditional DevOps roles, your focus will be on writing and maintaining software with the aim of automating the deployment processes.
**DUTIES**
- Developing tools to ensure our services can scale and are highly available. We always try to manage our ops tasks with automation, by adopting open source tools or developing bespoke tools as required
- Being part of the 24x7 on-call rota, helping support and maintain production systems
- Day to day development support and monitoring of production server and network environments by developing and deploying logging and monitoring tools.
- Supporting disaster recovery, backup, redundancy and capacity planning activities.
- Working with external users/clients on a variety of projects, ensuring their success in running our core product Vault
- This role will be based in Singapore**Requirements**:
**Essential**
- Strong background in Linux/Unix administration, e.g. Ubuntu, Debian
- A strong background in at least one of Go, Python or Java
- A strong background in one of the following: database administration, Kafka, observability tools (such as Prometheus or Zipkin) or infrastructure automation.
- Experience with AWS or GCP is essential
- Experience or knowledge of container orchestration tools, e.g. Kubernetes
**Desirable**
- Experience in supporting production systems
- Experience with automation/configuration management, e.g. Terraform, Puppet, Chef, Ansible
- Client engagement experience as an SRE for high traffic, mission critical systems.
- Ability to explain technical concepts to technical and non-technical stakeholders.
**Benefits**:
- Highly competitive salary
- Bonus incentive
- Healthcare
- 25 days holiday and public holidays
- $1,500 SGD per year flexible spend benefit
- All the latest tech you need
- A talented and experienced team as your colleagues
- An environment where we encourage learning and progress