Principal Site Reliability Engineer

2 days ago


Singapore GXS BANK PTE. LTD. Full time

We are living in exciting times. Technology is reshaping how we live and we want to redefine how financial services are offered, which is why Singtel and Grab are coming together. Singtel is Asia’s leading communications group connecting millions of consumers and enterprises to essential digital services while Grab is the leading technology company in Southeast Asia offering everyday services to consumers. Together, we have big dreams to unlock and financial inclusion for people in our region is just one. We want to build a digital bank with the right foundation - using data, technology and trust to solve problems and serve customers.

**Get to know the Role**:
As Principal Site Reliability Engineer, you will be one of the few key systems owners for the Digibank You will work very closely with the Principal Software Engineers and other technical staff in ensuring the architecture of our core systems meets world-class security, quick feature velocity, massive business scaling, and mission-critical stability requirements. WIth your ninja SRE & DevOps skills, you will help in designing multi cloud network/deployment architecture, building infrastructure as service, implementation of Observability platform, Security and incident management.

**Some specific activities would include**:

- Implement/own secure and scalable ‘Infrastructure as a Service’ and network architecture of the bank, in a multi-cloud environment
- Acts as an infrastructure expert for infrastructure, security and engineering teams in the plan, design and delivery of enterprise solutions.
- Troubleshoot the connectivity, performance or failover issues with Multi-Cloud infrastructure, as needed.
- Lead the analysis of the current technology environment to detect critical deficiencies and recommend solutions for improvement and lead the analysis of technology industry, market trends to determine their potential impact on the enterprise infrastructure architecture.
- Assist in designing the relevant financial regulatory activities associated with ensuring full compliance of the tech systems in the bank.
- Educate the wider engineering organization on design and operational best practices for distributed computing
- Helping set SLAs for internal and external services and continual improvement of operational processes (weekly ops meetings, metrics, etc)
- Developing or improving guidelines for using cloud services and on-premises data centers
- Representing overall company needs to cloud service providers and working with them to develop any unique features we need
- Build tools and automation to improve system's observability, availability, reliability, performance/latency, monitoring, emergency response.
- Work closely with Security, Compliance and Audit teams to ensure Digibank Engineering systems, processes and policies adhere to and exceed the relevant regulatory requirements.

**Job Requirements**
- Strong track record of implementing AWS/GCP/Azure services in a variety of distributed computing environments, with good understanding on Docker, Kubernetes
- Understanding of CNI/CNCF landscape is good to have
- Strong knowledge of runtimes of Storage/RDBMS and No-SQL databases.
- Experience in implementing multi cloud networking and deployment architecture.
- Good understanding of the L3/4/7 network layers (including SDN)
- Hand on design, coding on any one of - Python, Shell, Go or Java.
- Strong debugging/troubleshooting skills.
- Experience on implementing observability platforms using any of products suites like DataDog, NewRelic, ELK, Prometheus.
- Strong Experience with infrastructure automation and monitoring tools
- Terraform, Helm, Ansible, Puppet, Chef, etc.
- Experience with modern cloud development practices (microservices architectures, REST interfaces, etc. )
- Deep working knowledge on Linux servers and networking.



  • Singapore Oracle Full time

    Overview Join to apply for the Principal Site Reliability Engineer role at Oracle . As a Site Reliability Engineer, you will be responsible for the operation of production environments, including systems and databases, supporting critical business operations for Singapore’s governmental sovereign cloud environment. You will be focused on automation and...


  • Singapore NXP Semiconductors Full time

    Senior Principal Reliability Engineer page is loaded## Senior Principal Reliability Engineerlocations: Singaporetime type: Full timeposted on: Posted Todayjob requisition id: R- We are looking for Reliability Engineer role in preparation for the formation of the joint venture of NXP and VIS, known as VSMC.**Job Description**This posting is for a Senior...


  • Singapore RigNet Full time

    About us One team. Global challenges. Infinite opportunities. At Viasat, we’re on a mission to deliver connections with the capacity to change the world. For more than 35 years, Viasat has helped shape how consumers, businesses, governments and militaries around the globe communicate. We’re looking for people who think big, act fearlessly, and create an...


  • Singapore ABAXX SINGAPORE PTE. LTD. Full time

    Site Reliability Engineer - Networking We are seeking competent candidate joining our Infrastructure Team for the mission building and operating MAS regulated marketplace and clearing house. This role is ideal for someone with a strong foundation in AWS services, infrastructure as code, and cloud security, who is passionate about building scalable, secure,...


  • Singapore ABAXX SINGAPORE PTE. LTD. Full time

    Site Reliability Engineer - Networking We are seeking competent candidate joining our Infrastructure Team for the mission building and operating MAS regulated marketplace and clearing house. This role is ideal for someone with a strong foundation in AWS services, infrastructure as code, and cloud security, who is passionate about building scalable, secure,...


  • Singapore Jobgether Full time

    Service Reliability Engineer ( Multiple locations)Service Reliability Engineer ( Multiple locations)Get AI-powered advice on this job and more exclusive features. About Jobgether Jobgether is a Talent Matching Platform that partners with companies worldwide to efficiently connect top talent with the right opportunities through AI-driven job matching. About...


  • Singapore Jobgether Full time

    Service Reliability Engineer ( Multiple locations) Service Reliability Engineer ( Multiple locations) Get AI-powered advice on this job and more exclusive features. About JobgetherJobgether is a Talent Matching Platform that partners with companies worldwide to efficiently connect top talent with the right opportunities through AI-driven job matching. About...


  • Singapore Abaxx Commodity Futures Exchange and Clearinghouse Full time

    Site Reliability Engineer - Networking We are seeking a competent candidate joining our Infrastructure Team for the mission building and operating a MAS regulated marketplace and clearing house. This role is ideal for someone with a strong foundation in AWS services, infrastructure as code, and cloud security, who is passionate about building scalable,...


  • Singapore NetEase Games Full time

    Overview Join to apply for the Site Reliability Engineer role at NetEase Games . As a leading internet technology company based in China, NetEase provides premium online services centered around content creation and operates a broad gaming ecosystem. Job Description Site Reliability Engineering (SRE) refers to using software engineering methods to manage...


  • Singapore NetEase Games Full time

    Overview Join to apply for the Site Reliability Engineer role at NetEase Games . As a leading internet technology company based in China, NetEase provides premium online services centered around content creation and operates a broad gaming ecosystem. Job Description Site Reliability Engineering (SRE) refers to using software engineering methods to manage...