Avp, Site Reliability Engineer

1 week ago


Singapore DBS Bank Full time

**Business Function**

Group Technology and Operations (T&O) enables and empowers the bank with an efficient, nimble and resilient infrastructure through a strategic focus on productivity, quality & control, technology, people capability and innovation. **In Group T&O, we manage the majority of the Bank's operational processes and inspire to delight our business partners through our multiple banking delivery channels.**

**Responsibilities**
- The Role engineers/leads initiatives for stability and resiliency of Production environments across CORE Banking; involves identification of focus areas, solution development, Observability and automation including AIOPS wherever applicable.
- Application improvements ranging from performance and operational improvements, identification and remediation of system and automate Toils.
- Automation of manual activities/ processes and System Health checks for Production teams. (Automation experience required) and ensuring SLIs/ SLOs are met.
- Working in a stretch role to take forward production issues from analysis to production fix/deployment
- Follow Production Support Processes and giving input to strengthen time to time
- Providing status to leads, stakeholders and working with vendors to review the design/fix/enabling for production deployment
- Own communication for Incidents (SLA breaches, Application Major Incidents, Logistics issue) and responsible for communications with management
- Coordinate recurring issues and ensure long-term resolution through proper Incident and Problem Management
- Working with various teams like Infrastructure, development team to resolve, analysis of root cause for complex issues and outages
- Strong stakeholder management skills with focus on continuous service improvement, consistent delivery, and stability of production.
- Drives Root Cause Analysis with technology partners, post incident resolution and facilitates RCA reviews.
- Work with Risk team to respond timely to Audit & Risk RFIs. Manage Audit walkthroughs
- Must have Good functional knowledge of FINACLE v10.x & v11.x in Payments (Retail and Corporate), Loans, Collaterals, CASA/ TD, WMS and interfaces modules.
- Good to have: Functional knowledge of CRM systems - Finacle/ non-Finacle (e.g., Microsoft CRM, Salesforce CRM etc. or in-house Customer Master/ MDM systems)
- Analyze incidents in Finacle and Customer master systems and providing solution/work around for the issues independently.

**Requirements**:

- An undergraduate degree or higher
- 8-15 years of strong experience in the Banking industry with minimum 3+ years in Run-the-Bank (RTB) lead role with a proven track record of working in Finacle environment
- SRE. Implement Site Reliability Engineering principles with regards to performance, reliability, monitoring, alerting and maintenance in Production environment. Pro-active Capacity monitoring & Observability of production Infrastructure, automated alerting, performance monitoring and reporting tools
- Automation of manual tasks in a CORE Banking ecosystem
- Build and maintain Production monitoring and automation solutions
- Build and implement Service improvements. Identify, measure and report performance trends - SLIs/ SLOs/ SLAs periodically and improve systems performance and associated performance KPIs
- Good knowledge of infrastructure technologies used, with focus on AIX/ Linux/ Openshift/ Oracle/ Postgres/ MariaDB/ Java in a large Banking environment.
- Solid understanding of BAU support, incident, problem management processes as well as escalation management across a diversified environment
- Strong team player, effective at communicating internationally and used to working closely with remote teams
- Understanding of Risk Management, Disaster Recovery, Business Continuity, IT Security Architecture, and IT Regulatory Compliance.
- Present facts and recommendations effectively in oral and written form
- Pro-active, independent, resourceful, and able to work in a team
- High attention to detail with focus on understanding the issues with finding solutions

**Technology Requirements**
- Hands-on SRE/ Production support experience on Core Banking platforms - Finacle and Customer master systems
- Strong technical skills, e.g., scripting or programming experience, DBA/SA skills etc.
- Strong transformation and change management experience
- Software Configuration Mgmt., Quality Control Mgmt, Version Control Mgmt
- Operating System - AIX / Linux
- Cloud platforms. Openshift/ AWS/ PCF/ Kubernetes
- Database - Oracle / Postgres/ EDB/ MariaDb, In-memory database - Redis
- Application Servers - IBM WebSphere, JBoss
- Middleware technology - MQ, File transfers
- Eventing systems - Kafka
- Working Knowledge of Microservices/ API Mgmt
- Scheduling software - Tivoli
- Knowledge of Python. Exposure to building Dashboards
- Good knowledge of Machine Learning Algorithms like Regression, Classification, Decision Tree, Random Forest, Bagging & Boosting Techniques like XGBoost



  • Singapore IDEMIA Full time

    Join to apply for the Site Reliability Engineer role at IDEMIA Join to apply for the Site Reliability Engineer role at IDEMIA Get AI-powered advice on this job and more exclusive features. PurposeThis role plays a critical part in ensuring reliability, scalability, and performance of our systems and services. You will work closely with development and...


  • Singapore IDEMIA Full time

    Join to apply for the Site Reliability Engineer role at IDEMIA Join to apply for the Site Reliability Engineer role at IDEMIA Get AI-powered advice on this job and more exclusive features. PurposeThis role plays a critical part in ensuring reliability, scalability, and performance of our systems and services. You will work closely with development and...


  • Singapore Ethos BeathChapman Full time

    **Job Details**: **Location** Singapore **Salary** Competitive Salary **Job Type** Permanent **Ref** BH-17681 **Contact** Zain Hussain- **Posted** about 3 hours ago - We are partnering with a global bank, who are rapidly expanding their infrastructure team in Singapore and looking to hire multiple Site Reliability Engineers.**Responsibilities**: -...


  • Singapore IDEMIA Full time

    Join to apply for the Site Reliability Engineer role at IDEMIA Join to apply for the Site Reliability Engineer role at IDEMIA Get AI-powered advice on this job and more exclusive features. Purpose This role plays a critical part in ensuring reliability, scalability, and performance of our systems and services. You will work closely with development and...


  • Singapore beBeeSiteReliability Full time $90,000 - $120,000

    Unlock Your Full Potential in Site Reliability EngineeringAbout the RoleThis is an exciting opportunity to work with a global banking institution, leveraging your skills in production management and site reliability engineering to drive business growth.Develop and implement proactive, predictive models for shift production management using SRE...


  • Singapore beBeeSiteReliability Full time

    Unlock Your Full Potential in Site Reliability Engineering About the Role This is an exciting opportunity to work with a global banking institution, leveraging your skills in production management and site reliability engineering to drive business growth. Develop and implement proactive, predictive models for shift production management using SRE...


  • Singapore Hyphen Connect Full time

    Site Reliability Engineer (Crypto Trading) Join to apply for the Site Reliability Engineer (Crypto Trading) role at Hyphen Connect Site Reliability Engineer (Crypto Trading) 2 days ago Be among the first 25 applicants Join to apply for the Site Reliability Engineer (Crypto Trading) role at Hyphen Connect We are hiring for one of our ecosystem projects in...


  • Singapore DHATCH CONSULTANCY PTE. LTD. Full time

    Site Reliability Engineer: **Preferred Qualifications** - 3+ years of experience in site reliability engineering, DevOps, or software engineering roles. - Proven skills in: - Monitoring & alerting tools (Grafana, New Relic) - CI/CD pipelines (Git, Jenkins, GitHub Actions, etc.) - Container orchestration (Docker, Kubernetes) - Infrastructure-as-code...


  • Singapore Hyphen Connect Full time

    Site Reliability Engineer (Crypto Trading) Join to apply for the Site Reliability Engineer (Crypto Trading) role at Hyphen Connect Site Reliability Engineer (Crypto Trading) 2 days ago Be among the first 25 applicants Join to apply for the Site Reliability Engineer (Crypto Trading) role at Hyphen Connect We are hiring for one of our ecosystem...


  • Singapore HCLTech Full time

    Get AI-powered advice on this job and more exclusive features. This role combines software and systems engineering to build run, and maintain high performant, distributed, fault tolerant and resilient financial systems. Site Reliability Engineers focus on ensuring a joyful customer journey. As a Site Reliability Engineer you will be filling a...