Site Reliability Engineer
7 days ago
Job Title: Site Reliability Engineer Location: Singapore Job Type: Full-timeResponsibility: Cluster Operations & ManagementManage and maintain container clusters (Kubernetes, Docker) and open-source component clusters (Kafka, Redis, Elasticsearch) across multiple business unitsEnsure optimal performance, scalability, and reliability of distributed systemsInfrastructure Platform DevelopmentDesign, build, and enhance infrastructure operation platformsDevelop and maintain systems for infrastructure management, CI/CD pipelines, monitoring/alerting, and centralized loggingDrive platform standardization and automation initiativesHigh Availability & ReliabilityEnsure maximum uptime for production services through proactive monitoring and incident responseContinuously optimize service architecture, deployment strategies, and operational processesImplement and maintain SLA/SLO frameworks and reliability engineering practicesAutomation & Process ImprovementLead the development of automated operations and maintenance systemsCreate self-service tools and workflows to improve team productivityEstablish best practices for infrastructure such as code and configuration management Required Qualifications Experience & Education2+ years of hands-on experience in Systems Operations, DevOps, or Site Reliability Engineering (SRE)Bachelor's degree in Computer Science, Engineering, or related technical field preferred Cloud & InfrastructureExperience with public cloud platforms (AWS, Azure, or GCP) is highly valuedStrong understanding of large-scale internet architecture and distributed systemsProven experience with infrastructure monitoring, logging, and observability tools Technical SkillsProficiency in scripting and automation using Shell, Python, or similar languagesStrong knowledge of containerization technologies (Kubernetes, Docker)Hands-on experience operating production-grade container clusters and managing CI/CD pipelinesStrong familiarity with common infrastructure components: Nginx, MySQL, Redis, Kafka, Elasticsearch Advanced Networking (Preferred) Experience with Service Mesh architectures, Cilium CNI, and eBPF technologiesUnderstanding network security, load balancing, and traffic managementKnowledge of cloud-native networking patterns and best practices
-
Site Reliability Engineer
2 weeks ago
Singapore TRUEWATCH TECHNOLOGY INC PTE. LTD. Full time**Responsibility**: - Run production environment by monitoring availability and taking a holistic view of the system health. - Achieve site reliability automation, minimize system downtime, and reduce site reliability cost. - Manage risks and resolves issues that affect the release scope, schedule and quality. - Suggest architecture improvements, push for...
-
Site Reliability Engineer
2 days ago
North-East Singapore PERSOLKELLY Full timeThe Site Reliability Engineer is responsible for ensuring the reliability, scalability, and efficiency of our systems and infrastructure. This role involves monitoring, troubleshooting, and resolving issues to maintain optimal performance. The engineer will also collaborate with cross-functional teams to automate processes and improve system reliability....
-
Site Reliability Engineer
2 weeks ago
Singapore Qlik Full time**What makes us Qlik?** A Gartner® Magic Quadrant Leader for 14 years in a row, Qlik transforms complex data landscapes into actionable insights, driving strategic business outcomes. Serving over 40,000 global customers, our portfolio leverages pervasive data quality and advanced AI/ML capabilities that lead to better decisions, faster. We excel in...
-
Site Reliability Engineer
2 weeks ago
Singapore Adyen Full time**This is Adyen** Adyen provides payments, data, and financial products in a single solution for customers like Meta, Uber, H&M, and Microsoft - making us the financial technology platform of choice. At Adyen, everything we do is engineered for ambition. For our teams, we create an environment with opportunities for our people to succeed, backed by the...
-
Site Reliability Engineer
1 week ago
Singapore Viasat Full timeAbout us One team. Global challenges. Infinite opportunities. At Viasat, we’re on a mission to deliver connections with the capacity to change the world. For more than 35 years, Viasat has helped shape how consumers, businesses, governments and militaries around the globe communicate. We’re looking for people who think big, act fearlessly, and create an...
-
Site Reliability Engineer
3 days ago
Singapore RigNet Full timeAbout us One team. Global challenges. Infinite opportunities. At Viasat, we're on a mission to deliver connections with the capacity to change the world. For more than 35 years, Viasat has helped shape how consumers, businesses, governments and militaries around the globe communicate. We're looking for people who think big, act fearlessly, and create an...
-
Site reliability Engineering
3 days ago
Singapore RigNet Full timeAbout us One team. Global challenges. Infinite opportunities. At Viasat, we're on a mission to deliver connections with the capacity to change the world. For more than 35 years, Viasat has helped shape how consumers, businesses, governments and militaries around the globe communicate. We're looking for people who think big, act fearlessly, and create an...
-
Site Reliability Engineer
7 days ago
Singapore Viasat Full timeAbout us One team. Global challenges. Infinite opportunities. At Viasat, we're on a mission to deliver connections with the capacity to change the world. For more than 35 years, Viasat has helped shape how consumers, businesses, governments and militaries around the globe communicate. We're looking for people who think big, act fearlessly, and create an...
-
Site Reliability Engineer
5 days ago
Singapore NTT Data Singapore Full time $120,000 - $200,000 per yearAs a Site Reliability Engineer you will be filling a mission-critical role ensuring that our systems are healthy, monitored, automated, fault tolerant and designed to scale. You will collaborate and work closely with engineering teams to continually improve our production services, facilitating fast delivery of new products, and reducing downtime. Key...
-
Site Reliability Engineer
6 days ago
Singapore Rapsys Technologies Full timeDrive the Site Reliability Engineering agenda forward at an Enterprise Level to improve availability, reliability, and performance of services. - Drive cross-team efforts in resiliency assessment exercises and reporting - Draft and/or contribute to internal SRE training materials - Support services before they go live through activities such as Chaos testing...