Site Reliability Engineering Lead

2 weeks ago


Singapore Pinpoint Consulting Pte Ltd Full time

Our client is a leading web3 firm that offers a cutting‑edge, user‑friendly solution that combines industry‑leading security features with a powerful, intuitive interface in today's fast‑paced digital economy, managing your cryptocurrency assets with security and ease. Their platform and wallet empower you to store, send, and receive a wide range of digital assets effortlessly. Built with advanced encryption protocols to ensure your assets are always protected, giving you peace of mind in a constantly evolving market. They are presently expanding their business and looking for an experienced Site Reliability Engineer to join their exchange team. About the Role As an SRE Lead, forming and managing the SRE team will form part of the mandate. You will also need to establish a unified incident response system and promote a no‑responsibility review and systematic improvements. Key Responsibilities Strategy and Governance Team and Organization Cross‑team collaboration, working with R&D, architecture, DBA, network, security, legal/compliance, to drive the inclusion of reliability goals in the roadmap and KPIs. Platform and Engineering Implementation Exchange Scenario Special Project, like end‑to‑end latency SLI, matching confirmation and replay, serial number consistency and idempotence, isolation of hot trading pairs. Multi‑chain node operation and maintenance, congestion and reorg handling, MPC/HSM, risk control, and approval flow for coin withdrawal and deposit, closed loop for reconciliation errors. Security and Compliance: Audit of sensitive operations, meeting requirements such as SOC2/ISO 27001/PCI‑DSS. Requires Skills & Experience Over 8 years of experience in back‑end/platform/operation and maintenance engineering, over 4 years of SRE or production engineering experience, and over 2 years of team management/leadership experience. Having successful cases of stability governance and incident handling in high‑concurrency and low‑latency businesses (transactions/payments/advertising/large‑scale real‑time systems). SLO/SLI and incorrect budgeting practices, observability system construction (Prometheus/Grafana/ELK or similar, OpenTelemetry, Tracing). Kubernetes/Service Mesh, microservice gateway (Nginx/Envoy), CI/CD (GitHub Actions/GitLab CI, etc.), GitOps (Argo CD). Design and implementation of progressive delivery (Canary/Batch/feature Switch) and automatic rollback strategies. Data and Storage: MySQL/ Sharding/Replication and Failover, Redis/Kafka, Backup and Disaster Recovery Drills; Consistency and reconciliation thinking. Performance and Capacity Engineering: Stress testing, benchmarking, analysis, and tuning (flame diagram /CPU/GC/ Network /TCP kernel parameters, etc.). Event management: SEV grading, IM/IC command, cross‑team collaboration and communication, writing high‑quality retrospectives, and tracking action items. Preferred Experience Experience in exchange/matching/



  • Singapore JPMorganChase Full time

    Public Cloud SRE is responsible for engineering and operating the cloud infrastructure and platforms of JPMC ensuring reliability, resiliency, and security. We have a Senior Software Engineer, Site Reliability position to build the infrastructure and tooling for JPMC’s Public Cloud Platform. As a Lead Site Reliability Engineer at JPMorgan Chase within the...


  • Singapore ETEAM WORKFORCE PTE. LTD. Full time

    Position: Site Reliability Engineer (SRE) Work Mode - Onsite/Hybrid Timing - 9am to 6 pm Duration – 1 Year (Highly extendable) Salary: 6018 SGD Work Location: Robinson Road, Singapore About the Role We are looking for a seasoned Site Reliability Engineer (SRE) with 5+ years of experience to join our Platform Engineering team. This role is ideal for someone...


  • Singapore JPMorganChase Full time

    Lead Site Reliability Engineer, Electronic Trading Service Join to apply for the Lead Site Reliability Engineer, Electronic Trading Service role at JPMorgan Chase . Job Description Assume a critical role in defining the future of a globally recognized firm and have a direct and significant effect in a realm tailored for top achievers in site reliability. As...


  • Singapore Qlik Full time

    **What makes us Qlik?** A Gartner® Magic Quadrant Leader for 14 years in a row, Qlik transforms complex data landscapes into actionable insights, driving strategic business outcomes. Serving over 40,000 global customers, our portfolio leverages pervasive data quality and advanced AI/ML capabilities that lead to better decisions, faster. We excel in...


  • Singapore TRUEWATCH TECHNOLOGY INC PTE. LTD. Full time

    **Responsibility**: - Run production environment by monitoring availability and taking a holistic view of the system health. - Achieve site reliability automation, minimize system downtime, and reduce site reliability cost. - Manage risks and resolves issues that affect the release scope, schedule and quality. - Suggest architecture improvements, push for...


  • Singapore Adyen Full time

    **This is Adyen** Adyen provides payments, data, and financial products in a single solution for customers like Meta, Uber, H&M, and Microsoft - making us the financial technology platform of choice. At Adyen, everything we do is engineered for ambition. For our teams, we create an environment with opportunities for our people to succeed, backed by the...


  • Singapore APPLE SOUTH ASIA PTE. LTD. Full time

    Summary At Apple, new ideas have a way of becoming excellent products, services, and customer experiences very quickly. Bring passion and dedication to your job and there’s no telling what you could accomplish. The people here at Apple don’t just build products - they craft the kind of wonder that’s revolutionized entire industries. It’s the...


  • Singapore JJ Consulting Services Full time

    Our Client is a fast growing company in Singapore, who is seeking to recruit a Site Reliability Engineer. **Site Reliability Engineer** **Key Roles & Responsibilities** - Providing ancillary support of Enterprise-Grade Products and solutions at customer's sites - Ironing out deployment issues or challenges that our customers may face - Responsible for...


  • Singapore DT One Full time

    About DT One DT One was founded to provide mobile carriers with the infrastructure and services they need to help migrant workers stay in touch with their family and friends back home. Today we operate a leading global network for mobile top‑up solutions, innovative mobile rewards, and Phone‑to‑Phone solutions. Our global network delivers better...


  • Singapore Pinpoint Asia Full time

    Our client is a leading web3 firm that offers a cutting-edge, user-friendly solution that combines industry-leading security features with a powerful, intuitive interface in today's fast-paced digital economy, managing your cryptocurrency assets with security and ease. Their platform and wallet empower you to store, send, and receive a wide range of digital...