Site Reliability Engineer
3 days ago
**This is Adyen**
Adyen provides payments, data, and financial products in a single solution for customers like Meta, Uber, H&M, and Microsoft - making us the financial technology platform of choice. At Adyen, everything we do is engineered for ambition.
For our teams, we create an environment with opportunities for our people to succeed, backed by the culture and support to ensure they are enabled to truly own their careers. We are motivated individuals who tackle unique technical challenges at scale and solve them as a team. Together, we deliver innovative and ethical solutions that help businesses achieve their ambitions faster.
**Site Reliability Engineer**
We provide our merchants a single platform, capable of meeting the rapidly evolving needs of today's fast-growing global businesses. To meet the high expectations of our merchants, Adyen has adopted and embedded principles from the Site Reliability Engineering discipline, offering an environment whereby data-driven decisions, intellectual curiosity, problem solving and openness are key drivers for success.
As Site Reliability Engineers we are responsible for the stability and reliability of our financial technology platform.
Our mission is to enable engineering teams to run their products reliably and we operate under the following three principles which we use as guidelines in our day to day work:
- We embrace calculated risk
- We use SLOs to drive platform stability and innovation
- We eliminate toil through automation
It is a technical position for which we are in need of experienced engineers to help the rest of the engineering organization design, implement, maintain, scale and troubleshoot our platform.
**Who you are**
- Strong familiarity with SRE practices and methodologies such as SLOs, error budgets, incident management and reducing toil;
- You have either a software engineering or infrastructure background, being the one writing code to automate problems away;
- You enjoy troubleshooting and you always want to get to the root cause of the problem;
- You embrace calculated risk and failures;
- You see other engineering teams as your clients and you want to ensure they are confident in building, operating and maintaining reliable services;
- Have experience with building, operating and troubleshooting large-scale distributed systems spanning multiple data centers across the globe;
- Strong technical foundation ranging from networking and infrastructure to services and databases;
- Have a mindset for building sustainable, scalable and resilient long-term solutions;
- Skilled in one or more programming or scripting languages such as Python, Java or bash;
- Have a good understanding of Infrastructure as Code and experience with configuration management and automation tools such as Puppet and Ansible
**What you’ll do**
- Keep enabling other teams by designing and implementing solutions that improve the reliability and performance of our systems and services;
- Work on the automation and scalability of existing components of the platform;
- Be involved in key architectural decisions that determine the future of our platform;
- Troubleshoot and investigate complex technical issues, being involved from discovery to post-mortem;
- Together with the team, lead the way in continuously improving our incident management and on-call processes
**Our Diversity, Equity and Inclusion commitments**
Our unique approach is a product of our diverse perspectives. This diversity of backgrounds and cultures is essential in helping us maintain our momentum. Our business and technical challenges are unique, and we need as many different voices as possible to join us in solving them - voices like yours. No matter who you are or where you’re from, we welcome you to be your true self at Adyen.
**What’s next?**
This role is based out of our Singapore office. We are an office-first company and value in-person collaboration; we do not offer remote-only roles.
-
Site Reliability Engineer
5 days ago
Singapore TRUEWATCH TECHNOLOGY INC PTE. LTD. Full time**Responsibility**: - Run production environment by monitoring availability and taking a holistic view of the system health. - Achieve site reliability automation, minimize system downtime, and reduce site reliability cost. - Manage risks and resolves issues that affect the release scope, schedule and quality. - Suggest architecture improvements, push for...
-
Site Reliability Engineer
2 weeks ago
Singapore ETEAM WORKFORCE PTE. LTD. Full timePosition: Site Reliability Engineer (SRE) Work Mode - Onsite/Hybrid Timing - 9am to 6 pm Duration – 1 Year (Highly extendable) Salary: 6018 SGD Work Location: Robinson Road, Singapore About the Role We are looking for a seasoned Site Reliability Engineer (SRE) with 5+ years of experience to join our Platform Engineering team. This role is ideal for someone...
-
Site Reliability Engineer
6 days ago
Singapore JJ Consulting Services Full timeOur Client is a fast growing company in Singapore, who is seeking to recruit a Site Reliability Engineer. **Site Reliability Engineer** **Key Roles & Responsibilities** - Providing ancillary support of Enterprise-Grade Products and solutions at customer's sites - Ironing out deployment issues or challenges that our customers may face - Responsible for...
-
Site Reliability Engineer
3 days ago
Singapore Qlik Full time**What makes us Qlik?** A Gartner® Magic Quadrant Leader for 14 years in a row, Qlik transforms complex data landscapes into actionable insights, driving strategic business outcomes. Serving over 40,000 global customers, our portfolio leverages pervasive data quality and advanced AI/ML capabilities that lead to better decisions, faster. We excel in...
-
Site Reliability Engineer
2 weeks ago
Singapore ABAXX SINGAPORE PTE. LTD. Full timeSite Reliability Engineer - Networking We are seeking competent candidate joining our Infrastructure Team for the mission building and operating MAS regulated marketplace and clearing house. This role is ideal for someone with a strong foundation in AWS services, infrastructure as code, and cloud security, who is passionate about building scalable, secure,...
-
Site Reliability Engineer
2 weeks ago
Singapore Crystal Equation Corporation Full timeWe are seeking a skilled Site Reliability Engineer (SRE) to join our team. SRE will be responsible for keeping all internal user-facing applications and other production systems running smoothly. This hybrid role involves a combination of both development and operations skills to build and manage systems that are both efficient and reliable. The Enterprise...
-
Site Reliability Engineer
1 week ago
Singapore Point72 Full timeJoin to apply for the Site Reliability Engineer role at Point72 About the role As part of Point72’s Technology Team, you will focus on developing and maintaining complex, distributed, real-time systems that support our Global Macro business. Your responsibilities will include optimizing operations through automation, building foundational SRE components,...
-
Site Reliability Engineer
5 days ago
Singapore APPLE SOUTH ASIA PTE. LTD. Full timeSummary At Apple, new ideas have a way of becoming excellent products, services, and customer experiences very quickly. Bring passion and dedication to your job and there’s no telling what you could accomplish. The people here at Apple don’t just build products - they craft the kind of wonder that’s revolutionized entire industries. It’s the...
-
Site Reliability Engineer
1 week ago
Singapore DT One Full timeAbout DT One DT One was founded to provide mobile carriers with the infrastructure and services they need to help migrant workers stay in touch with their family and friends back home. Today we operate a leading global network for mobile top‑up solutions, innovative mobile rewards, and Phone‑to‑Phone solutions. Our global network delivers better...
-
Site Reliability Engineer
3 days ago
Singapore Second Talent Full timeInfrastructure Platform Development Design, build, and enhance infrastructure operation platforms Develop and maintain systems for infrastructure management, CI/CD pipelines, monitoring/alerting, and centralized logging Drive platform standardization and automation initiatives High Availability & Reliability Ensure maximum uptime for production services...