Principal Site Reliability Engineer
2 weeks ago
We are living in exciting times. Technology is reshaping how we live and we want to redefine how financial services are offered, which is why Singtel and Grab are coming together. Singtel is Asia’s leading communications group connecting millions of consumers and enterprises to essential digital services while Grab is the leading technology company in Southeast Asia offering everyday services to consumers. Together, we have big dreams to unlock and financial inclusion for people in our region is just one. We want to build a digital bank with the right foundation - using data, technology and trust to solve problems and serve customers.
**Get to know the Role**:
As Principal Site Reliability Engineer, you will be one of the few key systems owners for the Digibank You will work very closely with the Principal Software Engineers and other technical staff in ensuring the architecture of our core systems meets world-class security, quick feature velocity, massive business scaling, and mission-critical stability requirements. WIth your ninja SRE & DevOps skills, you will help in designing multi cloud network/deployment architecture, building infrastructure as service, implementation of Observability platform, Security and incident management.
**Some specific activities would include**:
- Implement/own secure and scalable ‘Infrastructure as a Service’ and network architecture of the bank, in a multi-cloud environment
- Acts as an infrastructure expert for infrastructure, security and engineering teams in the plan, design and delivery of enterprise solutions.
- Troubleshoot the connectivity, performance or failover issues with Multi-Cloud infrastructure, as needed.
- Lead the analysis of the current technology environment to detect critical deficiencies and recommend solutions for improvement and lead the analysis of technology industry, market trends to determine their potential impact on the enterprise infrastructure architecture.
- Assist in designing the relevant financial regulatory activities associated with ensuring full compliance of the tech systems in the bank.
- Educate the wider engineering organization on design and operational best practices for distributed computing
- Helping set SLAs for internal and external services and continual improvement of operational processes (weekly ops meetings, metrics, etc)
- Developing or improving guidelines for using cloud services and on-premises data centers
- Representing overall company needs to cloud service providers and working with them to develop any unique features we need
- Build tools and automation to improve system's observability, availability, reliability, performance/latency, monitoring, emergency response.
- Work closely with Security, Compliance and Audit teams to ensure Digibank Engineering systems, processes and policies adhere to and exceed the relevant regulatory requirements.
**Job Requirements**
- Strong track record of implementing AWS/GCP/Azure services in a variety of distributed computing environments, with good understanding on Docker, Kubernetes
- Understanding of CNI/CNCF landscape is good to have
- Strong knowledge of runtimes of Storage/RDBMS and No-SQL databases.
- Experience in implementing multi cloud networking and deployment architecture.
- Good understanding of the L3/4/7 network layers (including SDN)
- Hand on design, coding on any one of - Python, Shell, Go or Java.
- Strong debugging/troubleshooting skills.
- Experience on implementing observability platforms using any of products suites like DataDog, NewRelic, ELK, Prometheus.
- Strong Experience with infrastructure automation and monitoring tools
- Terraform, Helm, Ansible, Puppet, Chef, etc.
- Experience with modern cloud development practices (microservices architectures, REST interfaces, etc. )
- Deep working knowledge on Linux servers and networking.
-
Site Reliability Engineer
7 days ago
Singapore Sea Limited Full timeEngineering and Technology - Infrastructure, Singapore - Entry Level Our DevOps Engineering team plays an important role in developing and maintaining the internal systems and tools for the Infrastructure team. As a Site Reliability Engineer, you are responsible for improving the availability and reliability of our Infrastructure services. - Responsible for...
-
Site Reliability Engineer
1 hour ago
Singapore JJ Consulting Services Full timeOur Client is a fast growing company in Singapore, who is seeking to recruit a Site Reliability Engineer. **Site Reliability Engineer** **Key Roles & Responsibilities** - Providing ancillary support of Enterprise-Grade Products and solutions at customer's sites - Ironing out deployment issues or challenges that our customers may face - Responsible for...
-
Ase - Site Reliability Engineer
1 week ago
Singapore NodeFlair Full time**Job Summary**: **Salary** S$10,000 - S$20,000 / Monthly **Job Type** **Seniority** Mid **Years of Experience** At least 5 years **Tech Stacks** TDD Amazon S3 AWS Go LDAP ZooKeeper CloudFront EC2 Rust Chef Puppet Kubernetes kafka Bootstrap Angular Ansible Swift jQuery Cassandra Redis React Python openstack **Job Summary**: Apple Services Engineering...
-
Site Reliability Engineer
5 days ago
Singapore NodeFlair Full time**Job Summary**: **Salary** S$11,500 - S$16,500 / Monthly **Job Type** **Seniority** Senior **Years of Experience** At least 7 years **Tech Stacks** Microsoft Puppet Java Ansible Python **This is Adyen** Adyen provides payments, data, and financial products in a single solution for customers like Meta, Uber, H&M, and Microsoft - making us the...
-
Reliability Engineer
2 weeks ago
Singapore John Crane Full timeThe Reliability Engineer will ensure effective and efficient service contract operation, principally through providing engineering and reliability support with the objective of improving overall equipment reliability, availability and capability. The person is responsible for managing and driving all Performance Plus contract related tasks and day to day...
-
Site Reliability Engineer
5 hours ago
Singapore THALES SOLUTIONS ASIA PTE. LTD. Full timeRoles & ResponsibilitiesDigital Competence Center (DCC)Thales IFE has decided to create a leading technology center in Singapore for its IFE Digital Engineering. It will leverage on unique digital skillset from Singapore and neighbouring countries on Cloud engineering. Thanks to a multi-year strategic plan, Thales is locating at WeWork@Suntec, a center that...
-
Principal Site Reliability Engineer
1 day ago
Singapore GXS Bank Full time-GXS Bank Singapore Posted 1 day ago Flexible Permanent Attractive Compensation Package - POSTED BY - Jing Heng Sim - Talent Acquisition Business PartnerFollow - If you are: A strong believer of automating DevOps & SRE aspects like infrastructure provisioning, deployment, observability, incident lifecycle, uptime SLA etc. Bold to challenge, open to get...
-
Site Reliability Engineer
2 weeks ago
Singapore Retentia technology private limited Full time**3+ years of experience in Site Reliability Engineering, DevOps**, or a related field. - **Strong knowledge of cloud platforms (AWS, GCP, Azure) and containerization technologies (Docker, Kubernetes).** - Experience with automation and configuration management tools (e.g., T**erraform, Ansible, Chef, or Puppet).** - Proficiency in at least **one programming...
-
Site Reliability Engineer
1 week ago
Singapore The Edge Asia Full timeOur client is a US hedge fund and their Technology group is constantly improving the company’s IT infrastructure, positioning them at the forefront of a rapidly evolving technology landscape. They are a team of experts experimenting, discovering new ways to harness the power of open-source solutions, and embracing enterprise agile methodology. Their...
-
Site Reliability Engineering Leader
3 days ago
Singapore Oxford Knight Full timeSenior Site Reliability Engineer Job OverviewOxford Knight is seeking a highly skilled Senior Site Reliability Engineer to join our team and support our Linux trading infrastructure.Key ResponsibilitiesDesign and implement software components and systems to improve trading services.Provide level II support, including emergency response and advanced...
-
Site Reliability Engineer
7 days ago
Singapore Gravitas Recruitment Group Full timeJob details - Location - Singapore - Salary - S$9000 - S$13000 per month - Job Type - Permanent - Ref - BBBH137137_1690786002 - Posted - about 1 hour ago Job summary **Our client, a trading firm, is looking for a Site Reliability Engineer to join their team. They are seeking team players who demonstrate a creative approach to problem-solving and take...
-
Senior Site Reliability Engineer
1 day ago
Singapore AKAMAI TECHNOLOGIES APJ PTE. LTD. Full timeAs a Senior Site Reliability Engineer, you will influence a wide array of teams. You will be responsible for the performance and reliability of Akamai’s delivery products by working with the Product, Engineering and Support teams to diagnose, mitigate and solve outages. You will have to solve some of the most complex problems in distributed systems at...
-
Site Reliability Engineer
5 days ago
Singapore NextWave Partners Full timeLocation: - Singapore- Job Type: - Permanent- Discipline: - Software Engineering- Salary: - Negotiable- Contact: - Chelsea Phan**Senior Site Reliability Engineer** **Singapore** **About the role** We are working with a climate technology, who is currently working on a smart carbon measurement, accounting, and management Saas platform that allows...
-
Site Reliability Engineer
2 weeks ago
Singapore IFUN GAMES Full time**Responsibilities** - Design, implement, and maintain tools and processes for monitoring, alerting, and incident response - Collaborate with developers to improve the design and operation of systems, with a focus on reliability, performance, and scalability - Participate in on-call rotations to respond to incidents and handle escalations - Analyze system...
-
Senior/principal Engineer
1 hour ago
Singapore REC Group Full timeCOMPANY DESCRIPTION Founded in Norway in 1996, REC Group is an international pioneering solar energy company with a strong footprint in North America, Europe and Asia. We are part of Reliance Industries Limited (RIL), a Fortune Global 500 company that shares our passion for bold innovations that empower people with clean energy solutions and drive global...
-
Site Reliability Engineer
1 week ago
Central Singapore Emprego SG Full time**Location** Singapore, Central Singapore **Job Type** Permanent **Salary** 9,000 - 15,000 Per **Date Posted** 5 hours ago Additional Details **Job ID** 16908 **Job Views** 1 Roles & Responsibilities **Objectives of this Role** - Run the production environment by monitoring availability and taking a holistic view of system health Improve...
-
Site Reliability Engineer
7 days ago
Singapore Ubisoft Full timeCompany Description **CREATOR OF WORLDS** Ubisoft’s 20,000 team members, working across more than 40 locations around the world, are bound by a common mission to enrich players’ lives with original and memorable gaming experiences. Their dedication and talent has brought to life many acclaimed franchises such as Assassin’s Creed, Far Cry, Watch Dogs,...
-
Site Reliability Engineer
1 hour ago
Singapore Ubisoft Full timeCompany Description** CREATOR OF WORLDS** Ubisoft’s 20,000 team members, working across more than 40 locations around the world, are bound by a common mission to enrich players’ lives with original and memorable gaming experiences. Their dedication and talent has brought to life many acclaimed franchises such as Assassin’s Creed, Far Cry, Watch Dogs,...
-
Senior Site Reliability Engineer
7 days ago
Singapore Sea Limited Full timeEngineering and Technology - Infrastructure, Singapore - Experienced (Individual Contributor) Our DevOps Engineering team plays an important role in developing and maintaining the internal systems and tools for the Infrastructure team. As a Senior Site Reliability Operation Engineer, you are responsible for improving the availability and reliability of our...
-
Site Reliability Engineer
1 day ago
Singapore SINGAPORE POWER LIMITED Full time**What You'll Do**: - Evangelist for Site Reliability Engineer (SRE) practices in SP Digital (SPD) - Maintain the Reliability tools with regular patching and upgrades - Mange and evolve the full stack observability tools used in SPD - Enhance the customer experience by simplifying the onboarding process and documentation - Work with teams to improve the...