Site Reliability Engineering Lead

3 weeks ago


Singapur, Singapore Pinpoint Consulting Pte Ltd Full time

Our client is a leading web3 firm that offers a cutting‑edge, user‑friendly solution that combines industry‑leading security features with a powerful, intuitive interface in today's fast‑paced digital economy, managing your cryptocurrency assets with security and ease. Their platform and wallet empower you to store, send, and receive a wide range of digital assets effortlessly. Built with advanced encryption protocols to ensure your assets are always protected, giving you peace of mind in a constantly evolving market. They are presently expanding their business and looking for an experienced Site Reliability Engineer to join their exchange team. About the Role As an SRE Lead, forming and managing the SRE team will form part of the mandate. You will also need to establish a unified incident response system and promote a no‑responsibility review and systematic improvements. Key Responsibilities Strategy and Governance Team and Organization Cross‑team collaboration, working with R&D, architecture, DBA, network, security, legal/compliance, to drive the inclusion of reliability goals in the roadmap and KPIs. Platform and Engineering Implementation Exchange Scenario Special Project, like end‑to‑end latency SLI, matching confirmation and replay, serial number consistency and idempotence, isolation of hot trading pairs. Multi‑chain node operation and maintenance, congestion and reorg handling, MPC/HSM, risk control, and approval flow for coin withdrawal and deposit, closed loop for reconciliation errors. Security and Compliance: Audit of sensitive operations, meeting requirements such as SOC2/ISO 27001/PCI‑DSS. Requires Skills & Experience Over 8 years of experience in back‑end/platform/operation and maintenance engineering, over 4 years of SRE or production engineering experience, and over 2 years of team management/leadership experience. Having successful cases of stability governance and incident handling in high‑concurrency and low‑latency businesses (transactions/payments/advertising/large‑scale real‑time systems). SLO/SLI and incorrect budgeting practices, observability system construction (Prometheus/Grafana/ELK or similar, OpenTelemetry, Tracing). Kubernetes/Service Mesh, microservice gateway (Nginx/Envoy), CI/CD (GitHub Actions/GitLab CI, etc.), GitOps (Argo CD). Design and implementation of progressive delivery (Canary/Batch/feature Switch) and automatic rollback strategies. Data and Storage: MySQL/ Sharding/Replication and Failover, Redis/Kafka, Backup and Disaster Recovery Drills; Consistency and reconciliation thinking. Performance and Capacity Engineering: Stress testing, benchmarking, analysis, and tuning (flame diagram /CPU/GC/ Network /TCP kernel parameters, etc.). Event management: SEV grading, IM/IC command, cross‑team collaboration and communication, writing high‑quality retrospectives, and tracking action items. Preferred Experience Experience in exchange/matching/payment clearing and settlement/operation, and maintenance of securities firms or crypto wallets and chain nodes. Experience in implementing anti‑ddos, WAF, Bot management, rate limiting, and traffic governance systems. Experience in compliance systems (SOC2, ISO 27001, PCI‑DSS, SOX‑class controls), security audits, and evidence retention. Experience in multi‑region GSLB, cross‑cloud/multi‑cloud architecture, Chaos engineering, and GameDay organization. Go/Java optimization experience, practical experience in messaging systems (Kafka/RocketMQ/Pulsar) and storage (TiDB/Vitess/Citus/TDSQL, etc.). Have experience in cost optimization and FinOps. If this outstanding opportunity sounds like your next career move, please submit through "Apply Now" or send your resume in Word format to Luke Wang at and put SRE Lead - Top tier Crypto Exchange in the subject header. Data provided is for recruitment purposes only. #J-18808-Ljbffr



  • Singapur, Singapore JPMorganChase Full time

    Lead Site Reliability Engineer, Electronic Trading Service Join to apply for the Lead Site Reliability Engineer, Electronic Trading Service role at JPMorgan Chase . Job Description Assume a critical role in defining the future of a globally recognized firm and have a direct and significant effect in a realm tailored for top achievers in site reliability. As...


  • Singapur, Singapore ABAXX SINGAPORE PTE. LTD. Full time

    Site Reliability Engineer - Networking We are seeking competent candidate joining our Infrastructure Team for the mission building and operating MAS regulated marketplace and clearing house. This role is ideal for someone with a strong foundation in AWS services, infrastructure as code, and cloud security, who is passionate about building scalable, secure,...


  • Singapur, Singapore APPLE SOUTH ASIA PTE. LTD. Full time

    Summary There is a lot that goes into building the most secure yet user-friendly devices in the world. We are a unique Software Development group with a charter to secure our platforms, which include iOS software, iOS Devices, and Mac. We build solutions that are used by our customers, engineering teams, and manufacturing environments. We are looking for...


  • Singapur, Singapore APPLE SOUTH ASIA PTE. LTD. Full time

    Summary Imagine what you could accomplish here. Bring your passion, creativity, and dedication, and there will be no limit to what you can achieve. This is not just another SRE role - it's a chance to help redefine how reliability engineering is practiced at hyper-scale. Our team is building the platforms that will autonomously operate Apple's core...


  • Singapur, Singapore DADACONSULTANTS PTE. LTD. Full time

    Site Reliability Engineer (SRE) Responsibilities Assist in deploying and managing microservices on Kubernetes cloud platforms. Work with Cloud and DevOps teams to deploy services across multiple cloud providers (AWS, OCI, Azure, GCP). Conduct load and chaos testing to ensure system scalability and reliability. Support disaster recovery planning and...


  • Singapur, Singapore APPLE SOUTH ASIA PTE. LTD. Full time

    Summary Imagine what you could accomplish here. Bring your passion, creativity, and dedication, and there will be no limit to what you can achieve. This is not just another SRE role-it's a chance to help redefine how reliability engineering is practiced at hyper-scale. Our team is building the platforms that will autonomously operate Apple's core information...


  • Singapur, Singapore Apple Inc. Full time

    Manager, Site Reliability Engineering - Information Security Singapore, Singapore Software and Services Imagine what you could accomplish here. Bring your passion, creativity, and dedication, and there will be no limit to what you can achieve. This is not just another SRE role—it’s a chance to help redefine how reliability engineering is practiced at...


  • Singapur, Singapore SYSTEMS ON SILICON MANUFACTURING COMPANY PTE LTD Full time

    SSMC (Systems on Silicon Manufacturing Company Pte. Ltd.), is a Joint Venture between NXP and TSMC. We offer flexible and cost-effective semiconductor fabrication solutions by maintaining fully equipped SMIF cleanroom environment, 100% equipment automation and proven wafer-manufacturing processes.We're looking for innovative, passionate, and talented people...

  • SITE ENGINEER

    2 weeks ago


    Singapur, Singapore SAI RKGL CONSTRUCTION & ENGINEERING PTE. LTD. Full time

    1. Manage Site operation issues2. Liaise with Consultants clients and main contractor3. Prepare Flow Chart and Methods of Construction Supervising Site Activities4. Computer knowledge with MS Office5. Able to lead Site team independently6. Able to work under pressure and meet deadlines7. May have to work OT as per site requirement8. Familiar with AUTOCAD9....

  • SITE ENGINEER

    2 weeks ago


    Singapur, Singapore SUSFORCE ENGINEERING PTE. LTD. Full time

    Site Engineer Responsibilities Inspect facilities and analyze operational data Maintain compliance with safety and regulatory requirements Compile estimates for technical and material requirements for project development Determine and present estimates of operating costs Evaluate operations and processes Suggest process and technical design changes to...