Principal Site Reliability Engineer

2 weeks ago


Singapore GXS BANK PTE. LTD. Full time

We are living in exciting times. Technology is reshaping how we live and we want to redefine how financial services are offered, which is why Singtel and Grab are coming together. Singtel is Asia’s leading communications group connecting millions of consumers and enterprises to essential digital services while Grab is the leading technology company in Southeast Asia offering everyday services to consumers. Together, we have big dreams to unlock and financial inclusion for people in our region is just one. We want to build a digital bank with the right foundation - using data, technology and trust to solve problems and serve customers.

**Get to know the Role**:
As Principal Site Reliability Engineer, you will be one of the few key systems owners for the Digibank You will work very closely with the Principal Software Engineers and other technical staff in ensuring the architecture of our core systems meets world-class security, quick feature velocity, massive business scaling, and mission-critical stability requirements. WIth your ninja SRE & DevOps skills, you will help in designing multi cloud network/deployment architecture, building infrastructure as service, implementation of Observability platform, Security and incident management.

**Some specific activities would include**:

- Implement/own secure and scalable ‘Infrastructure as a Service’ and network architecture of the bank, in a multi-cloud environment
- Acts as an infrastructure expert for infrastructure, security and engineering teams in the plan, design and delivery of enterprise solutions.
- Troubleshoot the connectivity, performance or failover issues with Multi-Cloud infrastructure, as needed.
- Lead the analysis of the current technology environment to detect critical deficiencies and recommend solutions for improvement and lead the analysis of technology industry, market trends to determine their potential impact on the enterprise infrastructure architecture.
- Assist in designing the relevant financial regulatory activities associated with ensuring full compliance of the tech systems in the bank.
- Educate the wider engineering organization on design and operational best practices for distributed computing
- Helping set SLAs for internal and external services and continual improvement of operational processes (weekly ops meetings, metrics, etc)
- Developing or improving guidelines for using cloud services and on-premises data centers
- Representing overall company needs to cloud service providers and working with them to develop any unique features we need
- Build tools and automation to improve system's observability, availability, reliability, performance/latency, monitoring, emergency response.
- Work closely with Security, Compliance and Audit teams to ensure Digibank Engineering systems, processes and policies adhere to and exceed the relevant regulatory requirements.

**Job Requirements**
- Strong track record of implementing AWS/GCP/Azure services in a variety of distributed computing environments, with good understanding on Docker, Kubernetes
- Understanding of CNI/CNCF landscape is good to have
- Strong knowledge of runtimes of Storage/RDBMS and No-SQL databases.
- Experience in implementing multi cloud networking and deployment architecture.
- Good understanding of the L3/4/7 network layers (including SDN)
- Hand on design, coding on any one of - Python, Shell, Go or Java.
- Strong debugging/troubleshooting skills.
- Experience on implementing observability platforms using any of products suites like DataDog, NewRelic, ELK, Prometheus.
- Strong Experience with infrastructure automation and monitoring tools
- Terraform, Helm, Ansible, Puppet, Chef, etc.
- Experience with modern cloud development practices (microservices architectures, REST interfaces, etc. )
- Deep working knowledge on Linux servers and networking.



  • Singapore NXP Semiconductors Full time

    Senior Principal Reliability Engineer page is loaded## Senior Principal Reliability Engineerlocations: Singaporetime type: Full timeposted on: Posted Todayjob requisition id: R- We are looking for Reliability Engineer role in preparation for the formation of the joint venture of NXP and VIS, known as VSMC.**Job Description**This posting is for a Senior...


  • Singapore Oracle Full time

    Job Description At Oracle Cloud Infrastructure (OCI), we build the more intelligent future of cloud. OCI Sovereign Cloud is a team of smart, motivated, and diverse people that are focused on bringing the world's most important work to OCI. We build and operate our government, classified, and sovereign cloud regions to be reliable and high performance, just...


  • Singapore Sea Limited Full time

    Engineering and Technology - Infrastructure, Singapore - Entry Level Our DevOps Engineering team plays an important role in developing and maintaining the internal systems and tools for the Infrastructure team. As a Site Reliability Engineer, you are responsible for improving the availability and reliability of our Infrastructure services. - Responsible for...


  • Singapore Hyphen Connect Full time

    Site Reliability Engineer (Crypto Trading) Join to apply for the Site Reliability Engineer (Crypto Trading) role at Hyphen Connect Site Reliability Engineer (Crypto Trading) 2 days ago Be among the first 25 applicants Join to apply for the Site Reliability Engineer (Crypto Trading) role at Hyphen Connect We are hiring for one of our ecosystem projects in...


  • Singapore Vega Solutions Full time

    Join to apply for the Site Reliability Engineer role at Vega Solutions Join to apply for the Site Reliability Engineer role at Vega Solutions Get AI-powered advice on this job and more exclusive features. Tokka Labs | Singapore | Full-TimeTokka Labs is a proprietary trading firm with a focus on close collaboration, rigorous research, and cutting-edge...


  • Singapore DHATCH CONSULTANCY PTE. LTD. Full time

    Site Reliability Engineer: **Preferred Qualifications** - 3+ years of experience in site reliability engineering, DevOps, or software engineering roles. - Proven skills in: - Monitoring & alerting tools (Grafana, New Relic) - CI/CD pipelines (Git, Jenkins, GitHub Actions, etc.) - Container orchestration (Docker, Kubernetes) - Infrastructure-as-code...


  • Singapore TEAMLEASE DIGITAL CONSULTING PTE. LTD. Full time

    As a Site Reliability Engineer, you will be filling a mission-critical role ensuring that our systems are healthy, monitored, automated, fault-tolerant and designed to scale. You will collaborate and work closely with engineering teams to continually improve our production services, facilitating fast delivery of new products, and reducing downtime. Key...


  • Singapore HCLTech Full time

    Get AI-powered advice on this job and more exclusive features. This role combines software and systems engineering to build run, and maintain high performant, distributed, fault tolerant and resilient financial systems. Site Reliability Engineers focus on ensuring a joyful customer journey. As a Site Reliability Engineer you will be filling a...


  • Singapore Vega Solutions Full time

    Join to apply for the Site Reliability Engineer role at Vega SolutionsJoin to apply for the Site Reliability Engineer role at Vega SolutionsGet AI-powered advice on this job and more exclusive features.Tokka Labs | Singapore | Full-TimeTokka Labs is a proprietary trading firm with a focus on close collaboration, rigorous research, and cutting-edge...


  • Singapore Tardis Group Full time

    Direct message the job poster from Tardis Group Recruiter at Tardis Group | Finding Top Talent in Tech & Quant About the Company A rapidly growing technology firm operating at the forefront of artificial intelligence and advanced software solutions. The company fosters a fast-paced, collaborative, and innovation-driven culture, uniting talent across...