Observability Manager
1 week ago
LOVE WHAT YOU DO? THERE IS A PLACE FOR YOU HERE
Be part of our diverse and inclusive team.
Job Summary
Job Responsibilities
Observability & Monitoring
- Lead the implementation and optimization of observability platforms (e.g., ITRS Geneos, Dynatrace, etc).
- Develop dashboards, alerts, and telemetry pipelines to ensure real-time visibility into system health and performance.
- Drive adoption of observability best practices across IT teams.
Automation & Operational Efficiency
- Automate routine operational tasks to reduce manual intervention and human error.
- Integrate observability tools with CI/CD pipelines and incident management platforms (e.g., ServiceNow).
- Promote the use of Infrastructure as Code (IaC) and configuration management tools (e.g., Ansible, Terraform).
Security & Compliance
- Collaborate with IT security teams to identify and remediate vulnerabilities through observability insights.
- Conduct security assessments and support remediation efforts.
- Promote secure configurations and compliance with standards such as SOX and PCI DSS.
Continuous Improvement & Innovation
- Stay current with industry trends, emerging technologies, and best practices in observability and IT operations.
- Lead initiatives to improve system reliability, reduce MTTR (Mean Time to Recovery), and enhance incident response capabilities.
- To work closely with Sands global Observability team to foster a culture of learning, collaboration, and operational excellence.
Job Requirements
Education & Certification
- Bachelor’s degree in computer science, Information Technology, or a related field (or equivalent practical experience).
- Preferred certifications: Azure Architect, CISSP, or equivalent.
Experience
- Minimum 8 years of experience in IT operations, systems monitoring, or infrastructure engineering.
- Familiarity with Site Reliability Engineering (SRE) principles and DevSecOps practices.
- Ability to identify and remediate vulnerabilities using observability insights.
- Knowledge of security frameworks and compliance standards (e.g., ISO 27001, PCI DSS).
- Experience with security monitoring, anomaly detection, and automated alerting.
Technical Skills
- Proven experience with observability and monitoring tools such as Dynatrace,ITRS Geneos or similar.
- Strong understanding of telemetry pipelines, log aggregation, metrics collection, and distributed tracing.
- Proficiency in scripting and automation (e.g. PowerShell) to support monitoring and alerting automation.
- Solid grasp of cloud infrastructure (Azure or Alicloud), network protocols, and operating systems (Linux/Windows).
- Working knowledge of Kubernetes (Azure Stack/Red Hat Open Shift) services would be beneficial
- Experience integrating observability with CI/CD pipelines and incident management platforms (e.g., ServiceNow).
- Experiences with relational databases (eg. SQL,MySQL, Oracle)
Other Prerequisites
- Strong analytical and problem-solving skills with a proactive, data-driven mindset.
- Excellent communication skills, with the ability to explain complex technical concepts to non-technical stakeholders.
- Demonstrated leadership in cross-functional teams and the ability to drive adoption of observability practices.
- Familiarity with ITIL processes and experience managing service requests, incidents, problems, and changes.
Marina Bay Sands is committed to building a diverse, equitable and inclusive workforce, providing equal opportunities as we grow our talent base to match our growth ambitions in Singapore. Our employees are committed to adhere to and abide by all rules, regulations, policies and procedures, including the rules of conduct and business ethics of the Company.
-
Observability Solutions Specialist
1 week ago
Singapore CISCO SYSTEMS (USA) PTE. LTD. Full timeObservability Solution Specialist Role The primary function of the Observability Solution Specialist organization is to unify, advance and expand the value of the Splunk portfolio by providing deep domain expertise to drive execution across Splunk's Observability Portfolio. The solution specialists are responsible for being the technical experts in...
-
Site Reliability Engineer
5 days ago
Singapore Krisvconsulting Services Pte Ltd Full timeOverview We are seeking talented and driven professionals to join our Site Reliability Engineering (SRE) team. This role involves helping organizations enhance the availability, performance, and resilience of their applications and services through the deployment and administration of Observability Platforms Responsibilities Deploy and manage Observability...
-
Senior Observability
2 days ago
Singapore GIC Full timeA leading global investment firm in Singapore seeks an experienced AVP / VP, Observability & SRE Engineering to develop enterprise observability and service reliability strategies. The ideal candidate will have expertise in Datadog and AWS, with a deep understanding of SRE principles. This role encompasses the management of observability platforms and...
-
Senior Infra Engineer: Automation
2 days ago
Singapore Tech Full timeA technology company in Singapore is seeking a skilled Ansible Automation and Elastic Observability Engineer for a 2-year contract. You will design and maintain infrastructure automation and observability solutions using the Elastic Stack. The ideal candidate has 3+ years of experience with Elasticsearch, Logstash, and proficiency in Python or Java. The role...
-
Information Technology Observability Engineer
2 weeks ago
Singapore Assurity Trusted Solutions Full timeAssurity Trusted Solutions (ATS) is a wholly owned subsidiary of the Government Technology Agency (GovTech). As a Trusted Partner over the last decade, ATS offers a comprehensive suite of products and services ranging from infrastructure and operational services, authentication services, governance and assurance services as well as managed processes. In a...
-
Singapore Assurity Trusted Solutions Pte Ltd Full timeAssurity Trusted Solutions (ATS) is a wholly owned subsidiary of the Government Technology Agency (GovTech). As a Trusted Partner over the last decade, ATS offers a comprehensive suite of products and services ranging from infrastructure and operational services, authentication services, governance and assurance services as well as managed processes. In a...
-
Backend Software Engineer
6 days ago
Singapore TikTok Full timeResponsibilities TikTok will be prioritizing applicants who have a current right to work in Singapore, and do not require TikTok's sponsorship of a visa. About TikTok TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. TikTok has global offices including Los Angeles, New York, London, Paris,...
-
Cloud Observability Platform Engineer
1 week ago
Singapore ByteDance Full timeA leading technology company in Singapore is looking for a Software Engineer to join its Cloud Native-Observability Team. The role involves delivering user-oriented operation products and developing cloud-native intelligent management platforms. Candidates should have 5+ years of experience, a Bachelor's in Computer Science, and proficiency in Go/Java. Join...
-
Singapore ByteDance Full timeResponsibilities About the Team: Join ByteDance's Cloud Native-Observability Team! We build the data infrastructure for the company's full-stack observability products and deliver VolcanoEngine's observability capabilities. Leveraging k8s, Docker, Spring, and time-series engine tech, we provide high-quality, cost-effective, and high-performance time-series...
-
Head of Observability Asia
3 days ago
Singapore Splunk Full time**Role**: Due to our expansive growth we are seeking an exceptional leader to join our team as Head of Observability for Asia coverings ASEAN, India, HK, Taiwan, China and Korea. In addition to passion, skills, and experience, you will have a proven track record in selling enterprise SaaS solutions, experience in successfully developing go to market...