Senior Site Reliability Engineer

5 days ago


Singapore Hyphen Connect Full time

Overview Senior Site Reliability Engineer (Crypto Exchange) – Hyphen Connect Join to apply for the Senior Site Reliability Engineer (Crypto Exchange) role at Hyphen Connect. We are working with a decentralised exchange which looks to innovate on providing the best of CEXs and DEXs, focusing on building a safe, simple and scalable platform for trading. They differentiate themselves by offering institutional level systems and support whilst remaining on-chain and decentralised. Responsibilities Design, implement, and maintain scalable infrastructure for a high‐performance, low‐latency trading platform. Operate and enhance Kubernetes and Nomad‐based environments to ensure system stability, scalability, and security. Develop infrastructure automation and deployment pipelines using Terraform, Ansible, ArgoCD, and GitHub Actions. Collaborate with engineering teams to streamline service onboarding, automate repetitive tasks, and improve deployment efficiency. Enhance observability and reliability through improved logging, metrics, tracing, and alerting using the Grafana ecosystem. Perform root cause analysis and postmortems for production incidents, driving continuous improvements in system resilience and incident response. Work with security and compliance teams to ensure infrastructure meets regulatory and organizational standards. Support multi‐environment deployments (dev, staging, testnet, mainnet) with a focus on safe rollouts, rollbacks, and configuration management. Contribute to capacity planning, cost optimization, and infrastructure scaling strategies to support platform growth. Qualifications 5+ years of relevant experience as DevOps/ SRE Engineers. Proven ability to participate in an on‐call rotation, demonstrating ownership in incident response and a focus on long‐term system stability. Extensive experience operating and maintaining low‐latency, distributed systems in production environments. Proficiency with cloud‐native platforms and container orchestration tools, including AWS, GCP, Kubernetes, and Nomad. Strong knowledge of Linux/Unix internals and the TCP/IP networking stack. Proficiency in one or more of: Bash, Go, or Python. Expertise in root cause analysis, performance tuning, and system‐level debugging in complex service architectures. Experience building and managing end‐to‐end infrastructure, including infrastructure as code, CI/CD pipelines, and monitoring systems. Familiarity with modern GitOps workflows and tools such as GitHub Actions, ArgoCD, Argo Workflows, and Argo Events. Ability to own production systems end‐to‐end, from infrastructure as code to automated monitoring and deployment workflows. Pragmatic approach with a focus on depth, ownership, and a bias for action over broad familiarity. Bonus: Experience with the Aeron messaging system is a strong advantage. Details Seniority level: Mid‐Senior level Employment type: Full‐time Job function: Engineering and Information Technology Industries: Staffing and Recruiting #J-18808-Ljbffr



  • Singapore Canonical Full time

    Senior Site Reliability / Gitops Engineer Join to apply for the Senior Site Reliability / Gitops Engineer role at Canonical Senior Site Reliability / Gitops Engineer 1 day ago Be among the first 25 applicants Join to apply for the Senior Site Reliability / Gitops Engineer role at Canonical Canonical is a leading provider of open source software and operating...


  • Singapore GK CONSULTING PTE. LTD. Full time

    We're seeking an experienced Senior Site Reliability Engineer to ensure the reliability, availability, and performance of our cloud-based internet services. Key Responsibilities 1. Own reliability, availability, and user experience for assigned cloud services 2. Develop and implement service governance initiatives to increase reliability and user...


  • Singapore Airwallex Full time

    Senior Site Reliability Engineer, Spend Foundations Join to apply for the Senior Site Reliability Engineer, Spend Foundations role at Airwallex Senior Site Reliability Engineer, Spend Foundations Join to apply for the Senior Site Reliability Engineer, Spend Foundations role at Airwallex Get AI-powered advice on this job and more exclusive features. About...


  • Singapore Qube Research & Technologies Full time

    Join to apply for the DevOps /Site Reliability Engineer role at Qube Research & Technologies Qube Research & Technologies (QRT) is a global quantitative and systematic investment manager, operating in all liquid asset classes across the world. We are a technology and data driven group implementing a scientific approach to investing. Combining data, research,...


  • Singapore Crystal Equation Corporation Full time

    We are seeking a skilled Site Reliability Engineer (SRE) to join our team. SRE will be responsible for keeping all internal user-facing applications and other production systems running smoothly. This hybrid role involves a combination of both development and operations skills to build and manage systems that are both efficient and reliable. The Enterprise...


  • Singapore TRUEWATCH TECHNOLOGY INC PTE. LTD. Full time

    **Responsibility**: - Run production environment by monitoring availability and taking a holistic view of the system health. - Achieve site reliability automation, minimize system downtime, and reduce site reliability cost. - Manage risks and resolves issues that affect the release scope, schedule and quality. - Suggest architecture improvements, push for...

  • Site Reliability

    2 weeks ago


    Singapore Canonical Full time

    Join to apply for the Site Reliability / Gitops Engineer role at Canonical 1 day ago Be among the first 25 applicants Join to apply for the Site Reliability / Gitops Engineer role at Canonical Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is very widely...


  • Singapore Canonical Full time

    Overview Join to apply for the Senior Site Reliability Engineer role at Canonical . Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform is Ubuntu, widely used in enterprise initiatives such as public cloud, data science, AI, engineering innovation and IoT. We have 1200+...


  • Singapore Shopify Full time

    Company Description Shopify is the leading omni-channel commerce platform. Merchants use Shopify to design, set up, and manage their stores across multiple sales channels, including mobile, web, social media, marketplaces, brick-and-mortar locations, and pop-up shops. The platform also provides merchants with a powerful back-office and a single view of...


  • Singapore NetEase Games Full time

    Overview Join to apply for the Site Reliability Engineer role at NetEase Games . As a leading internet technology company based in China, NetEase, Inc. (NASDAQ: NTES and HKEX:9999) provides premium online services centered around content creation. NetEase develops and operates games and related services, with in-house R&D capabilities in China and globally....