
Platform & Reliability Engineer
1 week ago
**SLOs & error budgets**:
- Define, track, and evangelize latency and availability targets for our payment APIs.
- **Observability**:
- Deploy Cloud Monitoring, Cloud Trace, Error Reporting, and dashboards; integrate alerts via Incident.io and Slack for on-call.
- **Incident lifecycle**:
- Establish blameless postmortems, guardrails, and runbooks to drive learning and prevent recurrence.
- **CI/CD golden path**:
- Codify Cloud Build pipelines and automated canary rollouts for Cloud Functions / Cloud Run.
- **Infrastructure as Code**:
- Manage GCP resources; embed security, IAM least-privilege, and cost controls by default.
- **Performance & cost tuning**:
- Profile hot paths (BigQuery, Firestore, Pub/Sub), and implement caching or concurrency improvements to keep user latency < 100 ms.
- **Developer tooling**:
- Eliminate toil by improving local-to-prod parity, secrets management, and spinning up environments with a single command.
- **Culture carrier**:
- Instill reliability thinking across engineering and product as the first platform-focused hire.
**Requirements**:
- At least 5+ years of experience building/operating production systems at scale, ideally on Google Cloud or a similar serverless stack, ideally in fast-paced or startup settings.
- Hands‑on Fluency with Firebase, Cloud Build, Cloud Run/Functions, Pub/Sub, Cloud SQL/Spanner, VPC Service Controls.
- Strong coding in Python or Go for automation, with an eye on maintainability.
- Demonstrated record of driving observability, on‑call and cost optimisation in a fast‑moving environment.
- Excellent collaboration and communication skills to work effectively with cross-functional teams.
- Experience in payments, PCI‑DSS, or crypto settlement flows is a bonus.
**_Tech note: _**_we are _**_99 % serverless _**_. There are no pet VMs to patch, but the stakes are higher: every cold‑start, DB connection pool and retry policy can impact real money transfers. You’ll architect for resiliency and velocity._
-
Amps Engineer
2 hours ago
Singapore Pfizer Full timeCompany Description Entrusted by Pfizer Singapore, Cielo Talent supports Pfizer to recruit permanent employees for the expansion of Pfizer Tuas manufacturing site in Singapore. **Why Pfizer** Pfizer careers are like no other. In our culture of individual ownership, we believe in our ability to improve future healthcare, and potential to transform millions...
-
Platform & Reliability Engineer
1 day ago
Singapore Breeze Full timeOverview Platform & Reliability Engineer at Breeze. Join a Sequoia-backed fintech startup building the universal
-
Singapore Shopify Full timeCompany Description Shopify is the leading omni-channel commerce platform. Merchants use Shopify to design, set up, and manage their stores across multiple sales channels, including mobile, web, social media, marketplaces, brick-and-mortar locations, and pop-up shops. The platform also provides merchants with a powerful back-office and a single view of...
-
Site Reliability Engineer
7 days ago
Singapore Shopee Full timeDepartmentEngineering and Technology- LevelExperienced (Individual Contributor)- LocationSingaporeThe Engineering and Technology team is at the core of the Shopee platform development. The team is made up of a group of passionate engineers from all over the world, striving to build the best systems with the most suitable technologies. Our engineers do not...
-
Singapore Shopify Full timeCompany Description Shopify is the leading omni-channel commerce platform. Merchants use Shopify to design, set up, and manage their stores across multiple sales channels, including mobile, web, social media, marketplaces, brick-and-mortar locations, and pop-up shops. The platform also provides merchants with a powerful back-office and a single view of...
-
Data Reliability Engineer, Data Platform
4 days ago
Singapore Shopify Full timeCompany Description Shopify is a leading global commerce company, providing trusted tools to start, grow, market, and manage a retail business of any size. Shopify makes commerce better for everyone with a platform and services that are engineered for reliability, while delivering a better shopping experience for consumers everywhere. Shopify powers...
-
Platform Reliability Expert
2 days ago
Singapore beBeeInfrastructurereliability Full time $120,000 - $160,000Job Title:Infrastructure Reliability SpecialistOverviewThe Infrastructure Reliability Specialist will be responsible for managing the reliability of game-related platforms and infrastructure across both cloud and on-premise environments.Key Responsibilities:Deployment, change, and issue triage of overseas games and relevant components and system.Monitoring...
-
Site Reliability Engineer, Traffic Platform
2 weeks ago
Singapore ByteDance Full timeSite Reliability Engineer, Traffic Platform - 2025 StartJoin to apply for the Site Reliability Engineer, Traffic Platform - 2025 Start role at ByteDanceSite Reliability Engineer, Traffic Platform - 2025 Start3 days ago Be among the first 25 applicantsJoin to apply for the Site Reliability Engineer, Traffic Platform - 2025 Start role at ByteDanceGet...
-
Platform Engineer
6 days ago
Singapore Centre for Strategic Infocomm Technologies (CSIT) Full timeJoin to apply for the Platform Engineer role at Centre for Strategic Infocomm Technologies (CSIT) . You will be part of a dynamic team responsible for researching, exploring, and adopting the latest cloud technologies to modernise platform services and solutions. You must have a good understanding of IT architecture, System Design, Application & System...
-
Platform Engineer
2 days ago
Singapore Centre for Strategic Infocomm Technologies Full timeYou will be part of a dynamic team responsible for researching, exploring and adopting the latest cloud technologies to modernise platform services and solutions. You must have a good understanding of IT architecture, System Design, Application & System Integration, DevSecOps and Site Reliability Engineering. Our ideal team member will design workflow and...