Site Reliability Engineer
2 weeks ago
We are seeking talented and driven professionals to join our Site Reliability Engineering (SRE) team. This role involves helping organizations enhance the availability, performance, and resilience of their applications and services through the deployment and administration of Observability Platforms.
Key Responsibilities
- Deploy and manage Observability platforms and agents for ingesting metrics, logs, and traces from various sources.
- Parse and organize logs to extract relevant fields and data for processing and filtering.
- Assist developers in instrumenting application code to collect custom Application Performance Monitoring (APM) data.
- Record, script, and manage synthetic monitors for testing purposes.
- Capture user sessions and data for real user monitoring (RUM).
- Set up alerts and notifications for proactive monitoring.
- Generate dashboards, visualizations, and reports to provide actionable insights.
- Participate in and support root cause analysis (RCA) and application/service profiling sessions.
- Educate and assist teams in leveraging observability tools effectively.
Requirements
- Diploma or Degree in Computer Science, Information Technology, or related disciplines.
- Experience working with modern observability platforms
Preferred Qualifications
Observability Certifications:
- Elastic Certified Observability Engineer.
- Dynatrace Associate/Professional.
- Splunk O11y Cloud Certified Metrics User.
Cloud/Developer Certifications:
- AWS Developer Associate.
- Azure Developer Associate
Scalability
Splunk
Kubernetes
Azure
AWS
Root Cause Analysis
Scripting
Administration
Information Technology
Reliability
Reliability Engineering
Networking
Python
Telemetry
Dynatrace
Docker
Ansible
Java
Linux
-
Site Reliability Engineer
7 days ago
Singapore This is an IT support group Full timeAs a key member of our team, you will play a crucial role in ensuring the reliability and efficiency of our plant's utilities and facilities.About the RoleThe Site Reliability Engineer (SRE) ensures that all utilities and facilities within the plant are functioning optimally. This includes managing utilities such as water, electricity, HVAC, compressed air,...
-
Site Reliability Engineer Leader
13 hours ago
Singapore OCBC Full timeJob Description:We are seeking a Site Reliability Engineer Leader to join our team at OCBC. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our infrastructure. This role requires strong expertise in automating releases, continuous integration/delivery systems, and relevant infrastructure...
-
Site Reliability Engineer
2 weeks ago
Singapore FUNFLY PTE. LTD. Full timeRoles & ResponsibilitiesPosition OverviewAs a site reliability engineer, you will be responsible for ensuring the smooth operation of game services by maintaining, monitoring, and responding to faults daily. They will develop automation tools to enhance operational efficiency and manage game servers for optimal performance. The role includes collaborating...
-
Senior Site Reliability Engineer
2 weeks ago
Singapore GK CONSULTING PTE. LTD. Full timeRoles & ResponsibilitiesWe're seeking an experienced Senior Site Reliability Engineer to ensure the reliability, availability, and performance of our cloud-based internet services.Key Responsibilities1. Own reliability, availability, and user experience for assigned cloud services2. Develop and implement service governance initiatives to increase reliability...
-
Site Reliability Engineer
3 weeks ago
Singapore TRINITY CONSULTING SERVICES PTE. LTD. Full timeRoles & Responsibilities· Must have minimum 5 years' experience.· Strong technical knowledge and experience in supporting enterprise-level applications.· Proficiency in troubleshooting application issues, performing log analysis, and using monitoring tools.· Experience with databases and SQL query language.· Familiarity with software development life...
-
Site Reliability Engineer
1 week ago
Singapore FLOWDESK ASIA PTE. LTD. Full timeRoles & ResponsibilitiesAbout the jobAre you passionate about maintaining robust and high-performing infrastructures? Do you thrive in managing complex network environments and ensuring system reliability?Join our infrastructure team and help us elevate operational excellence to new heights.As a Site Reliability Engineer at Flowdesk, you will be at the heart...
-
Site Reliability Engineer
1 week ago
Singapore HELLO PLANET PTE. LTD. Full timeRoles & ResponsibilitiesWe are a global dating app created to give everyone a chance at love. The sense of belonging and connectedness we get from relationships helps us survive and thrive, and we're working to make it a little easier for people to find that. We're inspired by the stories we hear from employees, friends, and family who have used our app to...
-
Site Reliability Engineer
1 week ago
Singapore PATSNAP PTE. LTD. Full timeRoles & ResponsibilitiesAbout the RoleWe are looking for a skilled and experienced DevOps Engineer / Site ReliabilityEngineer (SRE) to ensure the high availability, stability, and performance of ourbusiness platform. This role will be responsible for designing and implementing scalableand maintainable DevOps architecture and automation systems to...
-
Site Reliability Manager
4 days ago
Singapore Oxford Knight Full timeRequirementsOxford Knight seeks a highly motivated and experienced Senior Site Reliability Engineer with a strong background in Linux administration, cloud computing, and programming languages (preferably Python). The ideal candidate should have a degree in Computer Science or a related field and excellent communication skills.Key Skills and Qualifications5+...
-
Site Reliability Engineer
4 days ago
Singapore TIKTOK PTE. LTD. Full timeRoles & ResponsibilitiesTikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. TikTok has global offices including Los Angeles, New York, London, Paris, Berlin, Dubai, Singapore, Jakarta, Seoul and Tokyo.Why Join UsAt TikTok, our people are humble, intelligent, compassionate and creative. We create...
-
Site Reliability Engineer
5 days ago
Singapore Gravitas Recruitment Group Full timeOur client, a leading investor in financial markets, are looking for an autonomous, critical thinking, Site Reliability Engineer to join their team in Singapore. The ideal candidate must have a strong academic background, having graduated from a top university with a bachelor's degree in computer science. This degree should have been applied...
-
Site Reliability Engineer
4 weeks ago
Singapore SCIENTE INTERNATIONAL PTE. LTD. Full timeRoles & ResponsibilitiesJob Summary:We are seeking a highly skilled Site Reliability Engineer (SRE) to join our team. As an SRE, you will play a critical role in enhancing system reliability, performance, and scalability while ensuring the seamless functioning of our production environments. This is an opportunity to work in a fast-paced, dynamic environment...
-
GEL – Site Reliability Engineer
3 weeks ago
Singapore TOSS-EX PTE. LTD. Full timeRoles & ResponsibilitiesRoles & ResponsibilitiesJob PurposeThe Site Reliability Engineer (SRE) combines software development and system engineering to build and run distributed solutions in a secured multi-tier heterogeneous environment to safeguard, provide and continuously improve the software and systems behind the organization’s cloud platform...
-
GEL – Site Reliability Engineer
3 weeks ago
Singapore TOSS-EX PTE. LTD. Full timeRoles & ResponsibilitiesRoles & ResponsibilitiesJob PurposeThe Site Reliability Engineer (SRE) combines software development and system engineering to build and run distributed solutions in a secured multi-tier heterogeneous environment to safeguard, provide and continuously improve the software and systems behind the organization's cloud platform...
-
Senior Site Reliability Engineer
5 days ago
Singapore GXS Bank Full timeAbout the Team: Our team treats infrastructure and operations as software engineering problems. We are responsible for building and progressing software platforms that enable the provisioning and management of all Digibank services in safe, reliable, and scalable ways. We consistently challenge the status quo and use new technologies to build platforms...
-
Site Reliability Engineer
2 weeks ago
Singapore SOURCEO PTE. LTD. Full timeRoles & ResponsibilitiesRequired Expertise and ExperienceAt least 3 years of experience in SRE, DevOps, or a related engineering role. Proficiency in Infrastructure as Code (IaC) using Terraform to manage complex infrastructure. Hands-on experience with log analytics and observability tools, including ELK (Elasticsearch, Logstash, Kibana) and the Grafana...
-
Site Reliability Engineer
4 weeks ago
Singapore BYTEDANCE PTE. LTD. Full timeRoles & ResponsibilitiesAbout Doubao (Seed)Founded in 2023, the ByteDance Doubao (Seed) Team, is dedicated to pioneering advanced AI foundation models. Our goal is to lead in cutting-edge research and drive technological and societal advancements.With a strong commitment to AI, our research areas span deep learning, reinforcement learning, Language, Vision,...
-
Associate VP
3 days ago
Singapore DBS Bank Limited Full timeBusiness Function Group Technology enables and empowers the bank with an efficient, nimble and resilient infrastructure through a strategic focus on productivity, quality & control, technology, people capability and innovation. In Group Technology, we manage the majority of the Bank's operational processes and inspire to delight our business partners...
-
Reliability Engineer
4 weeks ago
Singapore UNITED MICROELECTRONICS CORPORATION (SINGAPORE BRANCH) Full timeRoles & ResponsibilitiesJob Summary:We are seeking a motivated and detail-oriented Fab Reliability Engineer to join our dynamic team. The ideal candidate will play a crucial role in ensuring the reliability and performance of our manufacturing processes. This position involves process qualification, conformance, process change management, new process...
-
Associate VP
4 days ago
Singapore DBS Bank Limited Full timeBusiness Function Group Technology enables and empowers the bank with an efficient, nimble and resilient infrastructure through a strategic focus on productivity, quality & control, technology, people capability and innovation. In Group Technology, we manage the majority of the Bank's operational processes and inspire to delight our business partners...