Information Technology Observability Engineer
2 weeks ago
Assurity Trusted Solutions (ATS) is a wholly owned subsidiary of the Government Technology Agency (GovTech). As a Trusted Partner over the last decade, ATS offers a comprehensive suite of products and services ranging from infrastructure and operational services, authentication services, governance and assurance services as well as managed processes. In a dynamic digital and cyber landscape, where trust & collaboration are key, ATS continues to drive mutually beneficial business outcomes through collaboration with GovTech, government agencies and commercial partners to mitigate cyber risks and bolster security postures.
We are looking for experienced Observability Engineer to lead and evolve our monitoring, logging and observability strategy and implementation across various projects. You will play a pivotal role in defining and translating observability and monitoring requirements into scalable technical solutions, ensuring that our systems are reliable, performant, and transparent. You’ll partner closely with internal engineering, operations, and product teams—as well as external contractors—to drive efficiency and improve user experience through data-driven insights.
You should treat observability as a product: designing, owning, and continuously improving our telemetry stack to support a proactive and performance-oriented culture.
**Responsibilities**
- Translate business and operational goals into technical specifications and instrumentation strategies with actionable telemetry (metrics, logs, traces, events).
- Work with internal engineers and external contractors to develop and maintain dashboards, alerts, and log pipelines to support proactive monitoring, troubleshooting, and incident response.
- Champion observability best practices (metrics, logs, traces, events) across teams and advocate for Service Level Objectives (SLOs), Service Level Indicators (SLIs), and error budgets to drive operational maturity.
- Own and evolve the observability platform as a product—including roadmap planning, stakeholder engagement, user enablement, and adoption tracking.
- Monitor and continuously improve the reliability, performance and scalability of the observability stack.
- Identify and address gaps in visibility that impact operational efficiency and user experience.
- Train and enable engineers, developers and operators to effectively use observability tools and data for infrastructure troubleshooting, debugging and performance optimization.
**Requirements**:
- 5+ years in infrastructure monitoring, observability, or SRE focused roles.
- Deep understanding of observability principles, including telemetry collection, correlation, and analysis (metrics, logs, traces).
- Experience with observability stacks and tools such as Prometheus, Grafana, OpenTelemetry, Elastic Stack, Datadog, New Relic, or similar.
- Strong systems thinking with the ability to break down complex technical requirements and communicate effectively with both technical and non-technical stakeholders.
- Proven experience working with or managing external contractors and vendors.
- Experience defining and managing SLOs and SLIs to drive operational excellence.
- A product mindset—proactive in identifying improvement opportunities, user pain points, and driving platform adoption.
- Strong understanding of infrastructure monitoring concepts, including node health, resource utilization, system-level metrics, and log aggregation at scale.
Nice to Have
- Background in infrastructure engineering.
- Familiarity with incident management, root cause analysis, and chaos engineering practices.
- Exposure to distributed systems, networking, compute or storage observability.
- Knowledge of compliance, security, and governance considerations related to telemetry data
- Hands-on experience monitoring infrastructure services (e.g. compute, storage, network and containers)
Join us and discover a meaningful and exciting career with Assurity Trusted Solutions
**Benefits**
- A wholly-owned subsidiary of GovTech.
- We promote a learning culture and encourage you to grow and learn.
-
Singapore Assurity Trusted Solutions Pte Ltd Full timeAssurity Trusted Solutions (ATS) is a wholly owned subsidiary of the Government Technology Agency (GovTech). As a Trusted Partner over the last decade, ATS offers a comprehensive suite of products and services ranging from infrastructure and operational services, authentication services, governance and assurance services as well as managed processes. In a...
-
AVP/VP, Observability
1 week ago
Singapore GIC Full timeAVP/VP, Observability & SRE Engineering, Technology Group Join to apply for the AVP/VP, Observability & SRE Engineering, Technology Group role at GIC. GIC is one of the world’s largest sovereign wealth funds. With over 2,000 employees across 11 locations, we invest in more than 40 countries globally across asset classes and businesses. Working at GIC gives...
-
Site Reliability Engineer
5 days ago
Singapore Krisvconsulting Services Pte Ltd Full timeOverview We are seeking talented and driven professionals to join our Site Reliability Engineering (SRE) team. This role involves helping organizations enhance the availability, performance, and resilience of their applications and services through the deployment and administration of Observability Platforms Responsibilities Deploy and manage Observability...
-
Observability Manager
1 week ago
Singapore Marina Bay Sands Full timeLOVE WHAT YOU DO? THERE IS A PLACE FOR YOU HERE! Be part of our diverse and inclusive team. Job Summary Job Responsibilities Observability & Monitoring - Lead the implementation and optimization of observability platforms (e.g., ITRS Geneos, Dynatrace, etc). - Develop dashboards, alerts, and telemetry pipelines to ensure real-time visibility into system...
-
AVP/VP, Observability
2 weeks ago
Singapore GIC Private Limited Full timeOverview GIC is one of the world’s largest sovereign wealth funds. With over 2,000 employees across 11 locations around the world, we invest in more than 40 countries globally across asset classes and businesses. Working at GIC gives you exposure to an extraordinary network of the world’s industry leaders. As a leading global long-term investor, we Work...
-
Singapore GIC Full timeGIC is one of the world’s largest sovereign wealth funds. With over 2,000 employees across 11 locations around the world, we invest in more than 40 countries globally across asset classes and businesses. Working at GIC gives you exposure to an extraordinary network of the world’s industry leaders. As a leading global long‑term investor, we work at the...
-
Information Technology Engineer
2 weeks ago
Singapore CAREERS COMPASS CONSULTANTS PTE. LTD. Full timeIT engineers are **high-level IT personnel who design, install, and maintain a company's computer systems **. They are responsible for testing, configuring, and troubleshooting hardware, software, and networking systems to meet the needs of the employer. IT engineers may also be required to train staff and manage projects. - The job description of IT...
-
Information Technology
1 week ago
Singapore Singapore Airlines Full timeInformation Technology - Principal Technologist (Cloud & Platforms)Join to apply for the Information Technology - Principal Technologist (Cloud & Platforms)role at Singapore Airlines Information Technology - Principal Technologist (Cloud & Platforms)Join to apply for the Information Technology - Principal Technologist (Cloud & Platforms)role at Singapore...
-
Backend Software Engineer
6 days ago
Singapore TikTok Full timeResponsibilities TikTok will be prioritizing applicants who have a current right to work in Singapore, and do not require TikTok's sponsorship of a visa. About TikTok TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. TikTok has global offices including Los Angeles, New York, London, Paris,...
-
Senior Infra Engineer: Automation
2 days ago
Singapore Tech Full timeA technology company in Singapore is seeking a skilled Ansible Automation and Elastic Observability Engineer for a 2-year contract. You will design and maintain infrastructure automation and observability solutions using the Elastic Stack. The ideal candidate has 3+ years of experience with Elasticsearch, Logstash, and proficiency in Python or Java. The role...