Site Reliability Engineer

9 hours ago


Singapore Unison Group Full time

Site Reliability Engineer (SRE) - OpenShift & Release We are looking for an experienced DevOps & Release Management Engineer to manage patching, infrastructure upgrades, OpenShift platforms, and enterprise‑scale release processes. The ideal candidate will have strong expertise in Linux, container platforms, automation, CI/CD, and cross‑functional collaboration to ensure secure, stable, and efficient application delivery. Key Responsibilities Patch & Infrastructure Management Implement and manage comprehensive patch management strategies for operating systems, applications, and network devices to ensure security and compliance. Plan, execute, and oversee infrastructure upgrades across hardware, software, and network components while minimizing downtime and ensuring compatibility. Develop and enhance pre- and post‑patching processes to ensure zero‑disruption execution and risk‑based patch prioritization. Support Linux environment scaling, tuning, automation, patching, and compliance auditing. Administer and maintain operating systems, network infrastructure, and security patches. OpenShift (OCP) Administration Deploy, manage, and maintain OpenShift Container Platform (OCP) clusters, including installation, configuration, scaling, and troubleshooting. Perform OpenShift cluster maintenance such as upgrades, patching, monitoring, and performance optimization. Monitor cluster health and ensure high availability, reliability, and compliance with enterprise standards. Network & Systems Operations Monitor and manage network performance, capacity, and security. Troubleshoot network, hardware, and software‑related issues to ensure business continuity. Ensure all network changes and upgrades comply with security policies, best practices, and organizational standards. Release Management Lead the planning, coordination, and execution of software releases across environments and teams. Develop and manage release plans, schedules, and budgets aligned with business goals. Design and implement automated build, test, and deployment pipelines to accelerate software delivery. Manage and maintain version control systems (e.g., Git), ensuring proper branching, merging, and tagging strategies. Coordinate release activities with Development, QA, and Operations to ensure smooth and timely deployments. Troubleshoot build and deployment failures, identifying root causes and implementing preventive actions. Maintain clear and updated documentation of release processes, pipelines, and tools. Implement and enforce CI/CD best practices to ensure consistency and reliability. Monitor production environments post‑release to ensure stability and address immediate issues when required. Manage risks impacting release scope, schedule and quality, escalating issues when necessary. Ensure adherence to technical standards and governance requirements, including prototype vehicle build support when applicable. Lead complex deployments in distributed, load‑balanced, and service‑oriented architectures. Required Skills & Experience Strong experience in OpenShift or Kubernetes administration Hands‑on experience with patch management and infrastructure updates Good understanding of Linux systems, networking, and security concepts Expertise in CI/CD pipelines, automation, and DevOps tools Proficiency in Git and version control management Strong troubleshooting and problem‑solving skills Ability to work in cross‑functional teams and manage release cycles end‑to‑end Seniority level Mid‑Senior level Employment type Full‑time Job function Other Industries IT Services and IT Consulting #J-18808-Ljbffr



  • Singapore TRUEWATCH TECHNOLOGY INC PTE. LTD. Full time

    **Responsibility**: - Run production environment by monitoring availability and taking a holistic view of the system health. - Achieve site reliability automation, minimize system downtime, and reduce site reliability cost. - Manage risks and resolves issues that affect the release scope, schedule and quality. - Suggest architecture improvements, push for...


  • Singapore ETEAM WORKFORCE PTE. LTD. Full time

    Position: Site Reliability Engineer (SRE) Work Mode - Onsite/Hybrid Timing - 9am to 6 pm Duration – 1 Year (Highly extendable) Salary: 6018 SGD Work Location: Robinson Road, Singapore About the Role We are looking for a seasoned Site Reliability Engineer (SRE) with 5+ years of experience to join our Platform Engineering team. This role is ideal for someone...


  • Singapore JJ Consulting Services Full time

    Our Client is a fast growing company in Singapore, who is seeking to recruit a Site Reliability Engineer. **Site Reliability Engineer** **Key Roles & Responsibilities** - Providing ancillary support of Enterprise-Grade Products and solutions at customer's sites - Ironing out deployment issues or challenges that our customers may face - Responsible for...


  • Singapore Qlik Full time

    **What makes us Qlik?** A Gartner® Magic Quadrant Leader for 14 years in a row, Qlik transforms complex data landscapes into actionable insights, driving strategic business outcomes. Serving over 40,000 global customers, our portfolio leverages pervasive data quality and advanced AI/ML capabilities that lead to better decisions, faster. We excel in...


  • Singapore Adyen Full time

    **This is Adyen** Adyen provides payments, data, and financial products in a single solution for customers like Meta, Uber, H&M, and Microsoft - making us the financial technology platform of choice. At Adyen, everything we do is engineered for ambition. For our teams, we create an environment with opportunities for our people to succeed, backed by the...


  • Singapore ABAXX SINGAPORE PTE. LTD. Full time

    Site Reliability Engineer - Networking We are seeking competent candidate joining our Infrastructure Team for the mission building and operating MAS regulated marketplace and clearing house. This role is ideal for someone with a strong foundation in AWS services, infrastructure as code, and cloud security, who is passionate about building scalable, secure,...


  • Singapore Crystal Equation Corporation Full time

    We are seeking a skilled Site Reliability Engineer (SRE) to join our team. SRE will be responsible for keeping all internal user-facing applications and other production systems running smoothly. This hybrid role involves a combination of both development and operations skills to build and manage systems that are both efficient and reliable. The Enterprise...


  • Singapore Point72 Full time

    Join to apply for the Site Reliability Engineer role at Point72 About the role As part of Point72’s Technology Team, you will focus on developing and maintaining complex, distributed, real-time systems that support our Global Macro business. Your responsibilities will include optimizing operations through automation, building foundational SRE components,...


  • Singapore APPLE SOUTH ASIA PTE. LTD. Full time

    Summary At Apple, new ideas have a way of becoming excellent products, services, and customer experiences very quickly. Bring passion and dedication to your job and there’s no telling what you could accomplish. The people here at Apple don’t just build products - they craft the kind of wonder that’s revolutionized entire industries. It’s the...


  • Singapore DT One Full time

    About DT One DT One was founded to provide mobile carriers with the infrastructure and services they need to help migrant workers stay in touch with their family and friends back home. Today we operate a leading global network for mobile top‑up solutions, innovative mobile rewards, and Phone‑to‑Phone solutions. Our global network delivers better...