Senior Site Reliability Engineer

5 days ago


Singapore Vanguard Software Pte Ltd Full time

Job Summary We are seeking a Senior Site Reliability Engineer (SRE)to join our growing engineering team. In this role, you will work independently to design, build, and optimize infrastructure and deployment pipelines that ensure the stability, scalability, and security of our systems. You will take full responsibility for automating workflows, improving observability, and enabling development teams to ship code faster and safer. This is an excellent opportunity for an experienced engineer with at least 5 years of work experience who thrives on ownership, reliability, and technical leadership. Key Responsibilities Infrastructure & Automation: Design, implement, and maintain scalable cloud infrastructure using Infrastructure as Code (IaC) tools. CI/CD Pipelines: Build and optimize automated pipelines for testing, deployment, and release management. Monitoring & Reliability: Establish observability standards, implement monitoring, logging, and alerting systems to ensure system health. Security & Compliance: Enforce best practices for cloud security, access control, and compliance across environments. Collaboration: Partner with backend, frontend, and product teams to ensure smooth deployments and reliable system operations. Process & Mentorship: Improve DevOps processes, share best practices, and mentor junior engineers. Job Requirements Bachelor's Degree of Computing, Software Engineering, IT or related field. Experience: Minimum 5 years of DevOps, Site Reliability Engineering (SRE), or related experience. Tech Stack: Proficient with cloud platforms (AWS, GCP, or Azure), containerization (Docker, Kubernetes), IaC (Terraform, Ansible, Helm), and CI/CD tools (Jenkins, GitHub Actions, GitLab CI/CD, ArgoCD, etc.). Systems Knowledge: Strong background in Linux administration, networking, and distributed systems. Monitoring & Observability: Hands-on experience with tools like Prometheus, Grafana, ELK/EFK, or Datadog. Scripting & Automation: Proficient in one or more languages (Python, Go, Bash, etc.). Problem Solving: Skilled at diagnosing complex issues, ensuring high availability, and improving system performance. System Design: Capable of designing fault-tolerant, secure, and scalable infrastructure with disaster recovery in mind. Good in written and spoken English and Mandarin is highly desirable to liaise with Chinese speaking clients and counterparts to understand their technical requirements. Soft Skills Team Mindset: Collaborate effectively across teams, proactively contributing to company goals. Ownership: Take responsibility for infrastructure health and ensure continuous improvements. Adaptability: Open to new technologies, evolving processes, and changing business needs. Communication: Clearly explain technical topics to both engineers and non-technical stakeholders. What We Offer Technical Leadership Opportunities: Lead infrastructure design for high-impact projects and guide DevOps best practices. Continuous Growth: Access to mentorship, certifications, and a clear career progression path. High-Performance Collaboration: Work with a talented team in a modern DevOps environment (Agile/CI-CD, GitOps). Flexibility and Trust: An open culture that values innovation, autonomy, and results-driven decision-making. #J-18808-Ljbffr



  • Singapore eTeam Full time

    Description Site Reliability Engineer (SRE)We are looking for a seasoned Site Reliability Engineer (SRE) with 5–10 years of experience to join our Platform Engineering team. This role is ideal for someone who thrives in a fast‐paced environment, is passionate about reliability, and enjoys solving complex challenges. You will play a key role in building...


  • Singapore Airwallex Full time

    Senior Site Reliability Engineer, Spend Foundations Join to apply for the Senior Site Reliability Engineer, Spend Foundations role at Airwallex Senior Site Reliability Engineer, Spend Foundations Join to apply for the Senior Site Reliability Engineer, Spend Foundations role at Airwallex Get AI-powered advice on this job and more exclusive features. About...


  • Singapore EC1 Partners Full time

    Overview EC1 Partners is working with a leading global eFX trading platform that is expanding its technology presence in Singapore. We are seeking an experienced Site Reliability Engineer (SRE) to join their team. This is a full-time, permanent role offering the opportunity to work in a fast-paced environment where scale, performance, and reliability are...


  • Singapore Qube Research & Technologies Full time

    Join to apply for the DevOps /Site Reliability Engineer role at Qube Research & Technologies Qube Research & Technologies (QRT) is a global quantitative and systematic investment manager, operating in all liquid asset classes across the world. We are a technology and data driven group implementing a scientific approach to investing. Combining data, research,...


  • Singapore Crystal Equation Corporation Full time

    We are seeking a skilled Site Reliability Engineer (SRE) to join our team. SRE will be responsible for keeping all internal user-facing applications and other production systems running smoothly. This hybrid role involves a combination of both development and operations skills to build and manage systems that are both efficient and reliable. The Enterprise...


  • Singapore Oxford Knight Full time

    Senior Site Reliability Engineer - Singapore or Hong Kong **Salary**: up to 250-275k SGD base **Summary** High-frequency prop trading firm with offices worldwide looking for skilled Senior Site Reliability Engineer developer to support and maintain their Linux trading infrastructure on a day-to-day basis. This is a pivotal role where you will lead...


  • Singapore Ll Oefentherapie Full time

    At Oracle Cloud Infrastructure (OCI), we build the more intelligent future of cloud. OCI Sovereign Cloud is a team of smart, motivated, and diverse people that are focused on bringing the world's most important work to OCI. We build and operate our government, classified, and sovereign cloud regions to be reliable and high performance, just like our public...


  • Singapore DHATCH CONSULTANCY PTE. LTD. Full time

    Site Reliability Engineer: **Preferred Qualifications** - 3+ years of experience in site reliability engineering, DevOps, or software engineering roles. - Proven skills in: - Monitoring & alerting tools (Grafana, New Relic) - CI/CD pipelines (Git, Jenkins, GitHub Actions, etc.) - Container orchestration (Docker, Kubernetes) - Infrastructure-as-code...

  • Site Reliability

    1 day ago


    Singapore Canonical Full time

    Join to apply for the Site Reliability / Gitops Engineer role at Canonical 1 day ago Be among the first 25 applicants Join to apply for the Site Reliability / Gitops Engineer role at Canonical Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is very widely...


  • Singapore Manpower Singapore Full time

    Site Reliability Engineer - Global Support Apply for the Site Reliability Engineer - Global Support role at Manpower Singapore . Responsibilities Deploy and manage overseas games infrastructure, including game monitor system and login services. Monitor and dashboard game observability to ensure reliability, scalability, and security. Analyze game...