Vp, Platform Sre Engineer, Sre

2 weeks ago


Singapore DBS Bank Full time

Job ObjectiveDBS Bank is looking for a Platform SRE Engineer with experience working on enterprise level data engineering, analytics, and observability applications. The SRE engineer would be responsible for ensuring high availability of the platform services and perform continuous improvements to increase the platform’s efficiency and resiliency. The SRE engineer will also perform automation development tasks to remove toil and increase the team’s productivity.Roles and Responsibilities* Develop monitoring and onboarding guidelines for various applications using observability platform stack, ensuring accurate monitoring and data collection.* Drive Observability standards, best practices, operations and processes for the Enterprise in AppDynamics & other observability tools* Automate routine tasks and reporting processes using APIs and scripting, reducing manual effort and improving efficiency in AppDynamics & other observability tools* Identify and resolve performance issues through detailed analysis of transaction traces, application logs, and system metrics.* Collaborate with stakeholders to define performance metrics and monitoring requirements aligned with business goals.* Contribute to internal knowledge bases, create documentation, and share insights with the team to promote a culture of learning and collaboration.* Design and implement monitoring solutions to track application performance, identifying bottlenecks and optimising system efficiency.* Conduct performance tuning and capacity planning to ensure applications meet scalability and reliability requirements.* Develop custom dashboards and reports to provide actionable insights and drive decision-making processes.* Collaborate with development and operations teams to integrate Observability platform stack with CI/CD pipelines and other DevOps tools.* Configure and fine-tune alerts to proactively detect and address performance issues before they impact end-users.* Continuously review and enhance monitoring processes and methodologies to improve efficiency and effectiveness.* Work with application teams to develop long-term monitoring strategies that align with business goals and technology roadmaps.* Create data retention polices and access controls (RBAC) to manage user permissions.* Perform application maintenance, patching, upgrading controller versions, agents etc and ensure EOS/EOL is maintained.Deliverables* Ensure on-time delivery of tasks and projects.* Ensure continuous uptime of applications and services.* Ensure no security or audit issues.Job Dimensions* Comply to bank standards to track and follow up on the assigned projects.* Cover all areas in application and infrastructure operations of the platform.Requirements* You should be a university graduate (computer science or related field) with good experience working with contemporary technologies and scripting languages.* Strong communication skills and ability to explain protocol and processes with team and management* A passion for learning and using new technologies in the open-source communities.* A passion for coding.* Min 10 years of IT work experience.* Working knowledge in AppDynamics, ELK Stack, Grafana, Open Telemetry (OTEL)* In-depth experience in Unix/Linux/Shell/Python scripting with quality, scalability, and extensibility.* Experience in triaging and troubleshooting application problems quickly in monitoring tools by using various techniques - Transaction snapshots, Diagnostic Sessions, Data Collectors* Knowledgeable and experienced in SRE (Site Reliability Engineering) practices covering monitoring, observability, performance management, automation, and resiliency.* Knowledge in Confluent Kafka, Prometheus & other APM tools (Dynatrace, Datadog, New Relic, Splunk) is a plus.* Knowledge in AI/ML capabilities to automate RCA’s and shorter MTTR when issues arise.* Good understanding of Network routing, Load balancing and Networking protocols; a base knowledge of TCP/IP, with an understanding of HTTP and DNS* Ability to contribute to discussions on design and strategy.* Adequate knowledge of database systems (RDBMS, MariaDB, SQL, NOSQL), Object Oriented Programming and web application development.* Good problem diagnosis and creative problem-solving skills* Experience in NodeJS, Spring boot could be a plus.en



  • Singapore DBS Bank Limited Full time

    VP, Technology Risk Manager, SRE&Governance, Group Technology VP, Technology Risk Manager, SRE&Governance, Group Technology VP, Technology Risk Manager, SRE&Governance, Group Technology Business Function Group Technology enables and empowers the


  • Singapore DBS Bank Full time

    VP, Technology Risk Manager, SRE&Governance, Group Technology Join to apply for the VP, Technology Risk Manager, SRE&Governance, Group Technology role at DBS

  • Sre/devops Engineer

    2 weeks ago


    Singapore Skill Quotient Technologies Inc Full time

    **Role **: SRE/DevOps Engineer **Location **:Singapore **Payroll**: Skill Quotient **Experience** : 5-10 years **Requirements**: - **Experience**: 5+ years as a Platform Engineer or in a similar role like DevOps,SRE. - **Cloud Proficiency**: Strong experience with AWS or equivalent cloud environments. - **Operating Systems**: Expertise in Windows and...

  • Cloud SRE Engineer

    2 days ago


    Singapore OCBC Full time

    Join to apply for the Cloud SRE Engineer - Linux role at OCBC 2 days ago Be among the first 25 applicants Join to apply for the Cloud SRE Engineer - Linux role at OCBC Who We AreAs Singapore's longest established

  • Sre Manager

    2 weeks ago


    Singapore TikTok Full time

    Responsibilities About TikTok TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. TikTok has global offices including Los Angeles, New York, London, Paris, Berlin, Dubai, Singapore, Jakarta, Seoul, and Tokyo. Why Join Us Creation is the core of TikTok's purpose. Our platform is built to help...

  • Engineer, Sre

    2 weeks ago


    Singapore Sea Limited Full time

    The SRE and Infrastructure teams in Sea Labs manage thousands of servers which serve millions of users. As an SRE Engineer, you will work with the team to improve the availability and reliability of our services, and drive our service management to the next level. - Engage in the design, implementation, testing and operation of our on-prem Kubernetes...


  • Singapore Citi Full time

    **Overview of Citi**: Citi, the world leading global bank, has approximately 200 million customer accounts and a presence in more than 160 countries and jurisdictions worldwide. Citi provides consumers, corporations, governments, and institutions with a broad range of financial products and services, including consumer banking and credit, corporate and...


  • Singapore NodeFlair Full time

    **Job Summary**: **Salary** S$6,500 - S$8,000 / Monthly **Job Type** **Seniority** Mid **Years of Experience** At least 5 years **Tech Stacks** Container Powershell GitLab AWS Terraform Jenkins play GitLab CI CI ELK Git Azure Grafana Prometheus Splunk Kubernetes Ansible Python **Company Overview**: We are a leading technology fintech company at the...

  • Public Cloud Sre

    2 weeks ago


    Singapore DBS Bank Full time

    Role Responsibilities - ; Partner with DBS development teams to help reproduce and resolve public cloud platform issues. - ; Taking ownership of incidents reported and coordinating with L3 and engineering teams for resolution - ; Constantly learn and use cutting edge cloud technologies - ; Leverage your extensive customer support experience to provide...


  • Singapore ByteDance Full time

    About the Company Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content. Why Join...