Observability Engineer/SRE

7 days ago


Singapore AVENSYS CONSULTING PTE. LTD. Full time

Avensys is a reputed global IT professional services company headquartered in Singapore. Our service spectrum includes enterprise solution consulting, business intelligence, business process automation and managed services. Given our decade of success we have evolved to become one of the top trusted providers in Singapore and service a client base across banking and financial services, insurance, information technology, healthcare, retail, and supply chain. We are currently looking to hire Observability Engineer/SRE . This is an exciting opportunity to expand your skill set, achieve job satisfaction and work-life balance. More details as below. Role Overview We are looking for a highly motivated Observability Engineer / SRE to maintain, enhance, and optimize our enterprise-grade monitoring, logging, and tracing platforms. The ideal candidate will have strong hands‐on experience with open‐source observability tools, Kubernetes, and CI/CD pipelines, and will play a key role in driving observability practices across development and operations teams. Key Responsibilities Maintain and optimize open‐source monitoring infrastructure ; perform migration or enhancements as required. Support application teams with OpenShift/Kubernetes deployments , including upgrades, troubleshooting, and automation. Implement application instrumentation using OpenTelemetry and related frameworks. Manage metrics data stores (e.g., Prometheus ) and perform cardinality and resource optimization . Administer and tune distributed tracing infrastructure (Jaeger, Zipkin, OpenTelemetry). Provide production support for logging platforms such as ELK Stack and Grafana Loki ; manage Index Lifecycle in Elasticsearch. Configure and integrate alerting systems (PagerDuty, MS Teams); define alert rules with application teams. Deploy and administer visualization tools (Grafana, Kibana); create reusable dashboards and implement RBAC . Promote observability best practices — define SLIs, SLOs, error budgets, and help reduce MTTD/MTTR. Troubleshoot and secure observability infrastructure on Linux VMs and Kubernetes Pods (TLS, OAuth, LDAPS, MFA). Configure and enhance CI/CD pipelines for monitoring infrastructure across multiple environments. Technical Skills Required Elasticsearch / Kibana – Cluster Management, Search Optimization Prometheus / Grafana – Metrics & Visualization OpenTelemetry – Instrumentation & Tracing Kubernetes / OpenShift – Deployments, CI/CD Integration Linux OS – Troubleshooting and Performance Tuning Good understanding of SRE and Observability principles (SLI/SLOs, error budgets, alerting practices)Ideal Candidate 2–6 years of experience in Observability / SRE / DevOps / Platform Engineering . Strong problem‐solving and troubleshooting skills in production environments. Excellent communication and collaboration skills with development and infrastructure teams. WHAT'S ON OFFER You will be remunerated with an excellent base salary and entitled to attractive company benefits. Additionally, you will get the opportunity to enjoy a fun and collaborative work environment, alongside a strong career progression. To submit your application, please apply online or email your UPDATED CV in Microsoft Word format to Your interest will be treated with strict confidentiality. CONSULTANT DETAILS Consultant Name: Deepa Shivakoti Reg No: R Avensys Consulting Pte Ltd EA Licence 12C5759Privacy Statement: Data collected will be used for recruitment purposes only. Personal data provided will be used strictly in accordance with the relevant data protection law and Avensys' privacy policy. #J-18808-Ljbffr


  • AVP/VP, Observability

    2 weeks ago


    Singapore GIC Full time

    AVP/VP, Observability & SRE Engineering, Technology Group Join to apply for the AVP/VP, Observability & SRE Engineering, Technology Group role at GIC. GIC is one of the world’s largest sovereign wealth funds. With over 2,000 employees across 11 locations, we invest in more than 40 countries globally across asset classes and businesses. Working at GIC gives...


  • Singapore GIC Full time

    AVP/VP, Observability & SRE Engineering, Technology Group Join to apply for the AVP/VP, Observability & SRE Engineering, Technology Group role at GIC. GIC is one of the world's largest sovereign wealth funds. With over 2,000 employees across 11 locations, we invest in more than 40 countries globally across asset classes and businesses. Working at GIC gives...


  • Singapore DBS Bank Full time

    AVP, SRE Observability Platform Engineer, SRE & Governance, Group Technology Join to apply for the AVP, SRE Observability Platform Engineer, SRE & Governance, Group Technology role at DBS


  • Singapore AVENSYS CONSULTING PTE. LTD. Full time

    Roles & Responsibilities Avensys is a reputed global IT professional services company headquartered in Singapore. Our service spectrum includes enterprise solution consulting, business intelligence, business process automation and managed services. Given our decade of success we have evolved to become one of the top trusted providers in Singapore and service...

  • AVP/VP, Observability

    2 weeks ago


    Singapore GIC Private Limited Full time

    Overview GIC is one of the world's largest sovereign wealth funds. With over 2,000 employees across 11 locations around the world, we invest in more than 40 countries globally across asset classes and businesses. Working at GIC gives you exposure to an extraordinary network of the world's industry leaders. As a leading global long-term investor, we Work at...


  • Singapore OPENSOURCE PTE. LTD. Full time

    Position: Site Reliability Engineer (SRE Support) **Responsibilities**: Demonstrate proficiency in automating manual tasks using Terraform scripting and other automation tools. Utilize Datadog as an observability tool to monitor and analyze system performance. Technical Skill Set: Strong expertise in Site Reliability Engineering (SRE) principles and...


  • Singapore Opensource Pte Ltd. Full time

    Position: Site Reliability Engineer (SRE Support) **Responsibilities**: Demonstrate proficiency in automating manual tasks using Terraform scripting and other automation tools. Utilize Datadog as an observability tool to monitor and analyze system performance. Technical Skill Set: Strong expertise in Site Reliability Engineering (SRE) principles and...


  • Singapore DBS Bank Full time

    Job ObjectiveDBS Bank is looking for a Platform SRE Engineer with experience working on enterprise level data engineering, analytics, and observability applications. The SRE engineer would be responsible for ensuring high availability of the platform services and perform continuous improvements to increase the platform’s efficiency and resiliency. The SRE...


  • Singapore Krisvconsulting Services Pte Ltd Full time

    We are seeking talented and driven professionals to join our Site Reliability Engineering (SRE) team. This role involves helping organizations enhance the availability, performance, and resilience of their applications and services through the deployment and administration of Observability Platforms Responsibilities:- Deploy and manage Observability...


  • Singapore Krisvconsulting Services Pte Ltd Full time

    Overview We are seeking talented and driven professionals to join our Site Reliability Engineering (SRE) team. This role involves helping organizations enhance the availability, performance, and resilience of their applications and services through the deployment and administration of Observability Platforms Responsibilities Deploy and manage Observability...