Site Reliability Engineer
6 days ago
Overview We are seeking talented and driven professionals to join our Site Reliability Engineering (SRE) team. This role involves helping organizations enhance the availability, performance, and resilience of their applications and services through the deployment and administration of Observability Platforms Responsibilities Deploy and manage Observability platforms and agents for ingesting metrics, logs, and traces from various sources. Parse and organize logs to extract relevant fields and data for processing and filtering. Assist developers in instrumenting application code to collect custom Application Performance Monitoring (APM) data, record, script, and manage synthetic monitors for testing purposes. Capture user sessions and data for real user monitoring (RUM). Set up alerts and notifications for proactive monitoring. Generate dashboards, visualizations, and reports to provide actionable insights. Participate in and support root cause analysis (RCA) and application/service profiling sessions. Educate and assist teams in leveraging observability tools effectively. Qualifications Diploma or Degree in Computer Science, Information Technology, or related disciplines, at least 2-5 years of experience working with modern observability platforms. Familiarity with observability concepts and standards such as OpenTelemetry . Experience with observability tools like the Elastic Stack for monitoring cloud infrastructure and application performance. Knowledge of developing, instrumenting, and profiling applications to enhance performance and reliability. Observability Certifications - Elastic Certified Observability Engineer, Dynatrace Associate/Professional, Splunk O11y Cloud Certified Metrics User. Cloud/Developer Certifications - AWS Developer Associate, Azure Developer Associate. #J-18808-Ljbffr
-
Site Reliability Engineer
5 days ago
Singapore TRUEWATCH TECHNOLOGY INC PTE. LTD. Full time**Responsibility**: - Run production environment by monitoring availability and taking a holistic view of the system health. - Achieve site reliability automation, minimize system downtime, and reduce site reliability cost. - Manage risks and resolves issues that affect the release scope, schedule and quality. - Suggest architecture improvements, push for...
-
Site Reliability Engineer
2 weeks ago
Singapore ETEAM WORKFORCE PTE. LTD. Full timePosition: Site Reliability Engineer (SRE) Work Mode - Onsite/Hybrid Timing - 9am to 6 pm Duration – 1 Year (Highly extendable) Salary: 6018 SGD Work Location: Robinson Road, Singapore About the Role We are looking for a seasoned Site Reliability Engineer (SRE) with 5+ years of experience to join our Platform Engineering team. This role is ideal for someone...
-
Site Reliability Engineer
7 days ago
Singapore JJ Consulting Services Full timeOur Client is a fast growing company in Singapore, who is seeking to recruit a Site Reliability Engineer. **Site Reliability Engineer** **Key Roles & Responsibilities** - Providing ancillary support of Enterprise-Grade Products and solutions at customer's sites - Ironing out deployment issues or challenges that our customers may face - Responsible for...
-
Site Reliability Engineer
3 days ago
Singapore Qlik Full time**What makes us Qlik?** A Gartner® Magic Quadrant Leader for 14 years in a row, Qlik transforms complex data landscapes into actionable insights, driving strategic business outcomes. Serving over 40,000 global customers, our portfolio leverages pervasive data quality and advanced AI/ML capabilities that lead to better decisions, faster. We excel in...
-
Site Reliability Engineer
3 days ago
Singapore Adyen Full time**This is Adyen** Adyen provides payments, data, and financial products in a single solution for customers like Meta, Uber, H&M, and Microsoft - making us the financial technology platform of choice. At Adyen, everything we do is engineered for ambition. For our teams, we create an environment with opportunities for our people to succeed, backed by the...
-
Site Reliability Engineer
2 weeks ago
Singapore Crystal Equation Corporation Full timeWe are seeking a skilled Site Reliability Engineer (SRE) to join our team. SRE will be responsible for keeping all internal user-facing applications and other production systems running smoothly. This hybrid role involves a combination of both development and operations skills to build and manage systems that are both efficient and reliable. The Enterprise...
-
Site Reliability Engineer
2 weeks ago
Singapore Point72 Full timeJoin to apply for the Site Reliability Engineer role at Point72 About the role As part of Point72’s Technology Team, you will focus on developing and maintaining complex, distributed, real-time systems that support our Global Macro business. Your responsibilities will include optimizing operations through automation, building foundational SRE components,...
-
Site Reliability Engineer
6 days ago
Singapore APPLE SOUTH ASIA PTE. LTD. Full timeSummary At Apple, new ideas have a way of becoming excellent products, services, and customer experiences very quickly. Bring passion and dedication to your job and there’s no telling what you could accomplish. The people here at Apple don’t just build products - they craft the kind of wonder that’s revolutionized entire industries. It’s the...
-
Site Reliability Engineer
2 weeks ago
Singapore DT One Full timeAbout DT One DT One was founded to provide mobile carriers with the infrastructure and services they need to help migrant workers stay in touch with their family and friends back home. Today we operate a leading global network for mobile top‑up solutions, innovative mobile rewards, and Phone‑to‑Phone solutions. Our global network delivers better...
-
Site Reliability Engineer
4 days ago
Singapore Second Talent Full timeInfrastructure Platform Development Design, build, and enhance infrastructure operation platforms Develop and maintain systems for infrastructure management, CI/CD pipelines, monitoring/alerting, and centralized logging Drive platform standardization and automation initiatives High Availability & Reliability Ensure maximum uptime for production services...