Site Reliability Engineer

2 weeks ago


Singapur, Singapore Crystal Equation Corporation Full time

Overview We are seeking a skilled Site Reliability Engineer (SRE) to join our team. SRE will be responsible for keeping all internal user-facing applications and other production systems running smoothly. This hybrid role involves a combination of both development and operations skills to build and manage systems that are both efficient and reliable. The Enterprise Platforms Integration team (EPI) handles onboarding, deployment, ongoing support, automation and integrations of tools used by internal customer teams (engineering and non-engineering). We own over 30 third party applications with a combined user base of tens of thousands. EPI builds and supports the backend and infrastructure that the applications run on, supports users and collaborates with other teams to continue to refine, improve, and expand the products and their user bases. EPI has a rotating on-call assignment to help triage and resolve ad-hoc issues. In addition to this on-call rotation, our SREs are typically involved in any number of projects (to varying degrees) at any given time. These projects can often involve working with other internal teams as well as external vendors. Being able to effectively manage and maintain these cross-functional relationships while driving projects forward in a timely manner is critical. Thus, good interpersonal and organizational skills as well as an attention to detail are essential traits for success. While our environment is dynamic and our workload is high-demand, we foster the kind of collaboration and teamwork that has earned us a reputation for consistent success. If you want the next step of your career to include challenging work and supportive teammates, get in touch with us. Responsibilities Develop and maintain internal tooling that automates the provisioning, configuration and monitoring of the infrastructure and services. Collaborate with software engineers to make applications resilient and scalable. Participate in on‑call rotations to ensure system uptime and performance. Troubleshoot and resolve issues related to application development, deployment, and operations. Reduce operational toil by automating repetitive tasks. Develop and maintain documentation and diagrams detailing the operational architecture and flow of web traffic in multi‑tiered application environments. Conduct post‑mortem reviews of incidents and implement preventive measures. Qualifications Bachelor's degree in Computer Science or Information Systems. 2+ years of proven experience as a Site Reliability Engineer or in a similar hybrid and software engineering role. Proficiency in at least one programming language such as Python, Go, Java, or Rust. Good understanding and practical knowledge of the SDLC, design patterns, architecture patterns, SOLID principles, API maintenance, CI/CD. Experience with automation and configuration management tools such as Chef, Puppet, Ansible. Experience supporting modern services and web applications on Linux and Windows environments. Experience with cloud platforms (AWS preferred). Hands‑on experience with containerization technologies (Docker, Kubernetes). Strong problem‑solving skills, with an ability to troubleshoot complex system issues and keen attention to detail. Excellent communication skills, with the ability to collaborate effectively with a team and vendors. Seniority level Associate Employment type Full‑time Job function Consulting Industries IT Services and IT Consulting #J-18808-Ljbffr



  • Singapur, Singapore NetEase Games Full time

    Overview Join to apply for the Site Reliability Engineer role at NetEase Games . As a leading internet technology company based in China, NetEase provides premium online services centered around content creation and operates a broad gaming ecosystem. Job Description Site Reliability Engineering (SRE) refers to using software engineering methods to manage...


  • Singapur, Singapore APPLE SOUTH ASIA PTE. LTD. Full time

    Summary At Apple, new ideas have a way of becoming excellent products, services, and customer experiences very quickly. Bring passion and dedication to your job and there’s no telling what you could accomplish. The people here at Apple don’t just build products - they craft the kind of wonder that’s revolutionized entire industries. It’s the...


  • Singapur, Singapore PERSOL SINGAPORE PTE. LTD. Full time

    Overview Site Reliability Engineer (SRE) – An excellent Site Reliability Engineer (SRE) opportunity is available in a cutting-edge, fast-growing cloud environment. Job Purpose Deliver reliable, secure, and scalable cloud services by managing and optimizing AWS infrastructure. Job Responsibilities Manage and support AWS services, ensuring uptime,...


  • Singapur, Singapore PERSOL SINGAPORE PTE. LTD. Full time

    Cloud Site Reliability Engineer (AWS) An excellent Cloud Site Reliability Engineer opportunity has just arisen in a global brand supporting mission‑critical government systems. Job Purpose Ensure reliable, secure, and automated cloud operations supporting mission‑critical systems and compliance needs. Responsibilities Manage and support AWS cloud...


  • Singapur, Singapore Thales Full time

    Overview Join to apply for the Site Reliability Engineer role at Thales . Location: Singapore, Singapore Thales is a global technology leader trusted by governments, institutions, and enterprises to tackle their most demanding challenges. From quantum applications and artificial intelligence to cybersecurity and 6G innovation, our solutions empower critical...


  • Singapur, Singapore E-Solutions Full time

    Job Title: Site Reliability Engineer (SRE) Experience: 8+ years (including 3+ years in Java) About the Role: We’re looking for a skilled Site Reliability Engineer with strong Java and cloud-native development experience to design, build, and maintain reliable, scalable systems on Kubernetes and AWS. You’ll work closely with development and platform teams...


  • Singapur, Singapore Razer Inc. Full time

    Join to apply for the Site Reliability Engineer role at Razer Inc. 3 weeks ago Be among the first 25 applicants Joining Razer will place you on a global mission to revolutionize the way the world games. Razer is a place to do great work , offering you the opportunity to make an impact globally while working across a team located across 5 continents. Razer is...


  • Singapur, Singapore TikTok Full time

    Overview Responsibilities About the team TikTok Shop is a content e-commerce business utilising international short video products as carriers. Our aim is to become the preferred choice for users seeking to discover and purchase affordable, high-quality products. We provide users with tailored, vibrant, and efficient consumption experiences while enabling...


  • Singapur, Singapore Manpower Singapore Full time

    Site Reliability Engineer - Global Support Apply for the Site Reliability Engineer - Global Support role at Manpower Singapore . Responsibilities Deploy and manage overseas games infrastructure, including game monitor system and login services. Monitor and dashboard game observability to ensure reliability, scalability, and security. Analyze game...


  • Singapur, Singapore EXASOFT PTE. LTD. Full time

    Job Summary: We are seeking a Senior Site Reliability Engineer (SRE) with 10–15 years of proven experience in building, managing, and maintaining highly available, scalable, and secure infrastructure across multi-cloud and hybrid cloud environments—including on-premises data centers . The ideal candidate will have deep knowledge of SRE principles ,...