Site Reliability Engineer
2 weeks ago
Company Description
Higogame is a trailblazer in the mobile gaming and entertainment industry. Since our inception in late 2020, we have been dedicated to transforming the gaming landscape in Southeast Asia and beyond, delivering innovative and immersive experiences that engage millions of players around the globe.
- Our revenue has seen remarkable growth year after year, with operations extending across multiple regions worldwide.
- In just three years, we've risen to become one of the top two games of our kind in the local market.
- We proudly serve around 2 million active users daily and have a total monthly active user base of 5 million worldwide.
- Our team consists of over 200 talented employees, including a robust R&D division of more than 100 experts.
- We offer exceptional career development opportunities and foster a multinational culture that empowers everyone to reach their full potential.
Join us as we continue to push the boundaries of mobile gaming
Job Responabilities:
Responsible for the full lifecycle management of the company's global/multi-region infrastructure. Lead the setup of the Singapore physical data center and deep operations of Google Cloud (GCP) platform. Drive automation and intelligent operations systems to ensure high availability, low cost, and scalable business operations. The role requires both traditional data center operations experience and cloud-native technical vision, acting as a key technical backbone connecting physical resources with cloud capabilities.
I. Core Responsibilities
1. Physical Data Center Planning & Implementation
- Lead end-to-end management of self-built/hosted data centers: requirements analysis, architecture design (network/power/cooling/cabling), equipment selection (servers/switches/UPS), construction acceptance, and post-operations optimization.
- Design multi–data center disaster recovery architectures (e.g., active-active across two sites, three centers), including cross-site synchronization and failover strategies to ensure business continuity.
- Manage internal resource backup/disaster recovery, including art assets, code, and other data assets.
2. Google Cloud (GCP) Deep Operations & Optimization
- Design and manage GCP architecture (Compute Engine, VPC, Cloud Storage, GKE, BigQuery, etc.), supporting cloud migration and hybrid cloud deployment of core business systems.
- Lead full lifecycle management of cloud resources, including cost optimization (reserved instances, autoscaling, idle resource reclamation), performance tuning (network latency, storage IOPS, compute utilization), and security hardening (IAM governance, encryption policies, vulnerability scanning).
- Build cloud-native ops systems using Cloud Monitoring/Logging for real-time alerting and fault detection.
3. Automation & Intelligent Operations Systems
- Lead development and integration of operations toolchains (e.g., Ansible/Puppet automation, Prometheus+Grafana monitoring, ELK logging) to shift operations from manual to platform-based and intelligent.
- Integrate CI/CD pipelines with cloud platforms, optimizing deployment efficiency and stability of containerized (K8s) and serverless (Cloud Functions) workloads.
- Lead root cause analysis (RCA) and postmortems of major incidents, deliver improvement plans, and strengthen contingency planning and drills (e.g., data center power outage, cloud region failure).
4. Cross-Team Collaboration & Technical Enablement
- Collaborate with R&D, QA, and Product teams to provide infrastructure support for rapid business delivery.
- Develop operations standards and technical documentation, drive team knowledge sharing, and mentor junior engineers.
II. Requirements
Basic Qualifications
- Bachelor's degree or higher in Computer Science, Network Engineering, Cloud Computing, or related fields.
- 5+ years in IT operations, including 3+ years in physical data center build/ops, and 2+ years of hands-on GCP experience (must provide project examples).
- Experience in large-scale distributed systems, with solid knowledge of Linux, network protocols (TCP/IP, SDN), and high availability database architectures (MySQL/Redis).
- Must be able to converse in Mandarin due to the need to travel to China to communicate with Chinese speaking stakeholders
- Must be able to travel (up to 50% of the time)
Technical Skills
Data Center
- Familiar with infrastructure (power/cooling/fire safety/cabling), and optimization metrics like PUE/CUE.
- Experience in IDC hosting, custom data center builds, or third-party acceptance audits. Knowledge of industry standards (e.g., GB50174 Data Center Design Standard).
Google Cloud (GCP)
- Proficient in GCP core services: GCE, VPC, Cloud SQL/Spanner, GKE.
- Skilled in GCP cost management (Budgets & Alerts, preemptible VMs, storage tiers).
- Strong in GCP security: IAM, VPC Service Controls, Cloud Firewall, KMS.
Automation & Toolchains
- Skilled with Terraform/Ansible for IaC, scripting in Shell/Python/Go for ops tooling.
- Experienced in Prometheus+Grafana monitoring, ELK/OpenTelemetry for logging & tracing.
- Hands-on Kubernetes operations (scaling, node management, Helm) and CI/CD pipeline integration (Jenkins/GitLab CI).
Soft Skills
- Strong troubleshooting and resilience, able to quickly resolve complex incidents (e.g., data center outage, regional cloud failure).
- Excellent cross-team communication and project leadership skills.
- Fast learner, stays updated on cloud-native (CNCF), AIOps, and industry trends.
III. Nice-to-Haves
- GCP certifications (e.g., Professional Cloud Architect, Associate Cloud Engineer) or ITIL/ISO20000.
- Led/participated in large-scale data center builds (multi-million) or GCP ops projects with million+ annual cloud spend.
- Experience in hybrid cloud (GCP + on-premises) or edge computing ops.
- Published blogs, open-source contributions, or active participation in tech communities (GitHub, CNCF events).
IV. What We Offer
- Competitive salary
- Global platform: Participate in building multi-region intelligent operations infrastructure.
- Growth: Internal tech sharing, external conferences, certification & training support.
- Work environment: Flat management, flexible hours, free snacks, comprehensive medical, hospital and dental coverage
-
Reliability Engineer
2 weeks ago
Central Region, Singapore John Crane Full time $60,000 - $100,000 per yearCompany DescriptionFor more than 100 years, John Crane has equipped global process industries to meet mission-critical challenges. Our customers depend on John Crane to ensure their operations run efficiently and effectively. We deliver on that promise with technologies that maximize reliability, innovations that improve efficiency and services that enable a...
-
Site Reliability Engineer
5 days ago
Central Region, Singapore Manpower Staffing Services (S) Pte Ltd - Head Office Full time $90,000 - $120,000 per yearJoin a global leader in gaming to manage the reliability of game-related platforms and infrastructure across both cloud and on-premise environments. Responsibilities:Responsible for deployment, change, issues triage and infrastructure management of overseas games and relevant components and system, e.g. game monitor system, login services.Responsible for...
-
Reliability Engineer
2 weeks ago
Central Singapore Chevron Full timeAll interested applicants, please read the Data Privacy Notice Responsibilities for this position may include but are not limited to: - Facilitates & stewards the roll out of global reliability initiatives, such as Facility Integrity & Reliability Management (FIRM), within SMP by engaging cross functional stakeholder - Responsible for plant reliability KPI...
-
Site Engineer
7 days ago
Central Region, Singapore Private Advertiser Full time $40,000 - $60,000 per yearResponsibilities :Planning, coordination and supervision of technical aspects of construction projects.Assist Site Manager to manage, oversee and monitor construction activities to ensure design specifications and quality requirements are met.Conduct joint Inspection with RE and RTO.Responsible for all inspection activities are carried out.Assist the Site...
-
Site Resident Engineer
2 weeks ago
Central Region, Singapore GE Vernova - Gas Power Asia Full time $120,000 - $180,000 per yearJob Description SummaryProject Management/ Planning/ Coordinating activities typically in the context of a production or services project to fulfill a customer order or need according to financial/commercial parameters and ensuring customer satisfaction. Responsible for safe and reliable operation of customer's 2XGT26 (HE upgraded) by providing technical...
-
Engineer (Site)
2 weeks ago
Central Region, Singapore Tiong Seng Contractors (Pte) Ltd Full time $90,000 - $120,000 per yearJob Responsibilities:Assist Project Manager in all on-site technical/ engineering matters and authority submissions.Prepare submission of shop drawing / Materials / MOS/ Monthly Progress Claims and PM Reports.Lead in Site Coordination and execution relating to the construction project.Planning of micro scheduling with other trades/disciplines against Master...
-
Site Engineer #78274
7 days ago
Central Region, Singapore Anradus Pte. Ltd. Full time $40,000 - $60,000 per yearSite Engineer #78274Industry/ Organization TypeConstructionPosition TitleSite EngineerWorking LocationIslandwideWorking Hours5.5 days (Mon to Fri 8am - 6pm, Sat 8am - 12pm)Salary PackageBasic Salary up to $4,000 + Allowances + Variable BonusDurationPermanent Key ResponsibilitiesExecute, monitor and ensure the timely completion of the projectLiaise with the...
-
Site Engineer
3 days ago
Central Region, Singapore Adecco Personnel Pte Ltd. Full time $45,000 - $60,000 per year-The OpportunityExciting role with a specialist contractor in architectural glass systems, curtain walls / façades, tension membrane structures, metal & glass canopiesWork location: Multiple LocationBasic Salary up to $4500Adecco is partnering with a construction company specializing in building and architectural solutions. The company is renowned for...
-
Senior Site Engineer
2 weeks ago
Central Region, Singapore Woh Hup (Private) Limited Full time $80,000 - $120,000 per yearThe Senior Site Engineer is responsible for assisting the Project Manager on the progress of site works and manpower related aspects of the project.A summary of the responsibilities of this position are, and not limited to, as follows:Monitor and review master construction program and preparation of catch-up program, if applicable.Monitor and supervise...
-
Site Design Manager APAC
1 day ago
Central Region, Singapore Bridge Data Centres Malaysia Sdn Bhd Full time $120,000 - $180,000 per yearThe role requires a candidate with experience in managing the Data Centre design to meet reliability and availability requirement, efficiency & sustainability benchmark (PUE & WUE), international best practices, regulatory requirement, safety in design, cost target and speed to market.Key ResponsibilitiesHe/She is accountable for completing multiple projects...