Site Reliability Engineer
2 weeks ago
**General information**:
- Job Title- Site Reliability Engineer- City- Singapore- Country- Singapore- Division- Engineering**Description**:
- Thought Machine’s mission is bold - to properly and permanently rid the world’s banks of legacy technology. To achieve this, we have developed the foundations of modern banking through core and payments technology which run natively in the cloud. What we are attempting is hard and means we need great people working together to build great technology.We have grown rapidly in the past few years - growing our team to more than 550 individuals across offices in London, New York, Singapore and Sydney. We have raised more than $500m in funding and are now valued at $2.7bn. Our investors include Temasek, Standard Chartered Ventures, Molten Ventures, Eurazeo, Intesa Sanpaolo, Nyca Partners, JPMorgan Chase Strategic Investments, and more.- We have created a culture enabling our team to produce the best work in the industry, ensuring we have fun along the way. We're regularly cited as having a fantastic workplace culture and have been recognised by Sifted magazine as having one of the highest Glassdoor ratings for a UK fintech company and the most generous employee share package in the industry. We've been named in the IDC list of top 100 fintechs, and the Singapore HR Awards awarded us Gold and Silver for our workplace culture and employee experience.We are spinning up a new regional SaaS platform team responsible for providing a world-class SaaS offering, by continuously improving and maintaining our SaaS platform. The team will be geographically distributed across our two main hubs: UK, SG.Joining this team is an excellent opportunity to get exposure to how mission-critical systems are run in production. You will be part of a team that owns the system end-to-end and have a deeper understanding of exactly how our clients use the system (for example by extracting usage analytics).The team will own the platform end-to-end, making use of existing infrastructure, improving core Terraform modules, as well as developing operators, tooling and additional infrastructure where appropriate. They will also be responsible for L2 support (for client-initiated support requests) and L1 (for alerting-based incidents). Support will be provided during working hours, with a follow-the-sun model and handovers happening between the 3 regions.Definition and development of the SaaS roadmap is another critical responsibility of this team. Alongside the Product Management function, they will define technical requirements, features and implement them with the goal of offering an excellent SaaS experience to our clients.**Duties**
- Provision SaaS environments as new clients are onboarded.
- Be part of the on-call rota (during business hours), responsible for resolving alerts generated by proactive monitoring and working closely with CANs to provide L2 support for client-initiated support requests.
- Define and implement the feature roadmap to improve the SaaS platform, for example by implementing self-service functionality, exposing metrics to clients, improving automation and self-healing properties of the system.
- Improving the scalability, security and performance of the SaaS platform, by implementing automated compliance and controls, testing different Kafka and DB setups (e.g. Aurora vs RDS) and running load tests at every level of the stack.
- Implementing and regularly testing DR strategies to ensure the highest level of resilience and fault tolerance of the platform.
**Requirements**:
**Essential**
- Strong background in Linux/Unix administration, e.g. Ubuntu, Debian
- A strong background in at least one of Go, Python or Java
- A strong background in one of the following: database administration, Kafka, observability tools (such as Prometheus or Zipkin) or infrastructure automation.
- Experience with AWS or GCP is essential
- Experience or knowledge of container orchestration tools, e.g. Kubernetes
**Desirable**
- Experience in supporting production systems
- Experience with automation/configuration management, e.g. Terraform, Puppet, Chef, Ansible
**Benefits**:
- Highly competitive salary
- Bonus incentive
- Healthcare
- 25 days holiday and public holidays
- Competitive maternity and paternity leave
- $1,500 SGD per year flexible spend benefit
- All the latest tech you need
- A talented and experienced team as your colleagues
- An environment where we encourage learning and progress
-
Site Reliability Engineer
7 days ago
Singapore Sea Limited Full timeEngineering and Technology - Infrastructure, Singapore - Entry Level Our DevOps Engineering team plays an important role in developing and maintaining the internal systems and tools for the Infrastructure team. As a Site Reliability Engineer, you are responsible for improving the availability and reliability of our Infrastructure services. - Responsible for...
-
Site Reliability Engineer
1 hour ago
Singapore JJ Consulting Services Full timeOur Client is a fast growing company in Singapore, who is seeking to recruit a Site Reliability Engineer. **Site Reliability Engineer** **Key Roles & Responsibilities** - Providing ancillary support of Enterprise-Grade Products and solutions at customer's sites - Ironing out deployment issues or challenges that our customers may face - Responsible for...
-
Site Reliability Engineer
5 days ago
Singapore NodeFlair Full time**Job Summary**: **Salary** S$11,500 - S$16,500 / Monthly **Job Type** **Seniority** Senior **Years of Experience** At least 7 years **Tech Stacks** Microsoft Puppet Java Ansible Python **This is Adyen** Adyen provides payments, data, and financial products in a single solution for customers like Meta, Uber, H&M, and Microsoft - making us the...
-
Site Reliability Engineer
5 hours ago
Singapore THALES SOLUTIONS ASIA PTE. LTD. Full timeRoles & ResponsibilitiesDigital Competence Center (DCC)Thales IFE has decided to create a leading technology center in Singapore for its IFE Digital Engineering. It will leverage on unique digital skillset from Singapore and neighbouring countries on Cloud engineering. Thanks to a multi-year strategic plan, Thales is locating at WeWork@Suntec, a center that...
-
Site Reliability Engineer
2 weeks ago
Singapore Retentia technology private limited Full time**3+ years of experience in Site Reliability Engineering, DevOps**, or a related field. - **Strong knowledge of cloud platforms (AWS, GCP, Azure) and containerization technologies (Docker, Kubernetes).** - Experience with automation and configuration management tools (e.g., T**erraform, Ansible, Chef, or Puppet).** - Proficiency in at least **one programming...
-
Site Reliability Engineer
1 week ago
Singapore The Edge Asia Full timeOur client is a US hedge fund and their Technology group is constantly improving the company’s IT infrastructure, positioning them at the forefront of a rapidly evolving technology landscape. They are a team of experts experimenting, discovering new ways to harness the power of open-source solutions, and embracing enterprise agile methodology. Their...
-
Site Reliability Engineering Leader
3 days ago
Singapore Oxford Knight Full timeSenior Site Reliability Engineer Job OverviewOxford Knight is seeking a highly skilled Senior Site Reliability Engineer to join our team and support our Linux trading infrastructure.Key ResponsibilitiesDesign and implement software components and systems to improve trading services.Provide level II support, including emergency response and advanced...
-
Site Reliability Engineer
7 days ago
Singapore Gravitas Recruitment Group Full timeJob details - Location - Singapore - Salary - S$9000 - S$13000 per month - Job Type - Permanent - Ref - BBBH137137_1690786002 - Posted - about 1 hour ago Job summary **Our client, a trading firm, is looking for a Site Reliability Engineer to join their team. They are seeking team players who demonstrate a creative approach to problem-solving and take...
-
Senior Site Reliability Engineer
1 day ago
Singapore AKAMAI TECHNOLOGIES APJ PTE. LTD. Full timeAs a Senior Site Reliability Engineer, you will influence a wide array of teams. You will be responsible for the performance and reliability of Akamai’s delivery products by working with the Product, Engineering and Support teams to diagnose, mitigate and solve outages. You will have to solve some of the most complex problems in distributed systems at...
-
Site Reliability Engineer
5 days ago
Singapore NextWave Partners Full timeLocation: - Singapore- Job Type: - Permanent- Discipline: - Software Engineering- Salary: - Negotiable- Contact: - Chelsea Phan**Senior Site Reliability Engineer** **Singapore** **About the role** We are working with a climate technology, who is currently working on a smart carbon measurement, accounting, and management Saas platform that allows...
-
Site Reliability Engineer
2 weeks ago
Singapore IFUN GAMES Full time**Responsibilities** - Design, implement, and maintain tools and processes for monitoring, alerting, and incident response - Collaborate with developers to improve the design and operation of systems, with a focus on reliability, performance, and scalability - Participate in on-call rotations to respond to incidents and handle escalations - Analyze system...
-
Site Reliability Engineer
1 week ago
Central Singapore Emprego SG Full time**Location** Singapore, Central Singapore **Job Type** Permanent **Salary** 9,000 - 15,000 Per **Date Posted** 5 hours ago Additional Details **Job ID** 16908 **Job Views** 1 Roles & Responsibilities **Objectives of this Role** - Run the production environment by monitoring availability and taking a holistic view of system health Improve...
-
Senior Site Reliability Engineer
7 days ago
Singapore Sea Limited Full timeEngineering and Technology - Infrastructure, Singapore - Experienced (Individual Contributor) Our DevOps Engineering team plays an important role in developing and maintaining the internal systems and tools for the Infrastructure team. As a Senior Site Reliability Operation Engineer, you are responsible for improving the availability and reliability of our...
-
Site Reliability Engineer
1 day ago
Singapore SINGAPORE POWER LIMITED Full time**What You'll Do**: - Evangelist for Site Reliability Engineer (SRE) practices in SP Digital (SPD) - Maintain the Reliability tools with regular patching and upgrades - Mange and evolve the full stack observability tools used in SPD - Enhance the customer experience by simplifying the onboarding process and documentation - Work with teams to improve the...
-
Site Reliability Engineer
2 weeks ago
Singapore J P INFOTEC PTE. LTD. Full time**Site Reliability Engineer** **Responsibilities** - Support and/or own the deployment of global products including setting up production and internal environments - Provide 24/7 first line of Engineering support (via follow the sun teams in all regions) for any issues related to global product deployment, availability and internal operations support. -...
-
Site Reliability Engineer
2 weeks ago
Singapore Experis Full time**Site Reliability Engineer**: - Location- Singapore- Job reference- BBBH133368_1699927914- Salary- S$6000 - S$7500 per month- Consultant name - Rajasekar Shirley Monisha Consultant contact no. - 6232 5244 - EA License No. - 02C3423 - Consultant Registration No. - R22106767 **Responsibilities**: - Responsible for deployment, change, issues triage and...
-
Site Reliability Engineer
4 days ago
Singapore GXS BANK PTE. LTD. Full time**Job Description & Requirements**: Get to know the Role: - As a Site Reliability Engineer (SRE) you will help build a meaningful engineering discipline, combining software and systems to develop creative engineering solutions to operations problems. - Much of our support and software development focuses on optimizing existing systems, building...
-
Site Reliability Engineer
6 days ago
Singapore Ambition Singapore Full timeAmbition SingaporeAbout the CompanyAmbition Singapore is a top quantitative trading firm with a results-driven culture, seeking a Site Reliability Engineer to safeguard their innovative services and strategies.
-
Site Reliability Engineer
6 days ago
Singapore DADACONSULTANTS PTE. LTD. Full timeRoles & ResponsibilitiesSenior Site Reliability Engineer (SRE) | Big DataResponsibilities:Manage the full lifecycle of services, from design to deployment and maintenance.Develop and improve automation tools for scalability and reliability.Troubleshoot and resolve software and infrastructure issues, ensuring data security.Optimize system architecture and...
-
Site Reliability Engineer
6 days ago
Singapore INFOSYS COMPAZ PTE. LTD. Full timeRoles & ResponsibilitiesJob DescriptionWe are seeking talented and driven professionals to join our Site Reliability Engineering (SRE) team. This role involves helping organizations enhance the availability, performance, and resilience of their applications and services through the deployment and administration of Observability PlatformsKey Responsibilities...