Site Reliability Engineer

2 weeks ago


Singapore Ubisoft Full time

Company Description
**CREATOR OF WORLDS**

Ubisoft’s 20,000 team members, working across more than 40 locations around the world, are bound by a common mission to enrich players’ lives with original and memorable gaming experiences. Their dedication and talent has brought to life many acclaimed franchises such as Assassin’s Creed, Far Cry, Watch Dogs, Just Dance, Rainbow Six, and many more to come. Ubisoft is an equal opportunity employer that believes diverse backgrounds and perspectives are key to creating worlds where both players and teams can thrive and express themselves. If you are excited about solving game-changing challenges, cutting edge technologies and pushing the boundaries of entertainment, we invite you to join our journey and help us create the unknown.

Since opening its doors in 2008, Ubisoft Singapore has become the biggest AAA game development studio in Southeast Asia. The 500-strong studio is home to 35+ different nationalities focused on delivering ambitious gaming experiences to our players. Ubisoft Singapore has been contributing to all the Assassin’s Creed® titles since Assassin’s Creed® II. It innovated within the franchise as the studio behind the naval battle gameplay and water technology in Assassin’s Creed® III, Assassin’s Creed® IV Black Flag® and most recently in Assassin’s Creed® Valhalla. Its expertise in AAA and live operations, combined with a passion for naval gameplay, pushed the team to lead the development of Skull and Bones revealed at E3 in 2017.

**Job Description**:
**YOUR DAILY ADVENTURE**

The Site Reliability Engineer (SRE) is responsible of Ops and development tasks such as level 4 support and the implementation of highly scalable Game infrastructure. The SRE is working as the Infra services integrator that enables the production to build Games using principals of cloud-Native, DevOps and continuous Delivery. The SRE has a good development background with knowledge of infrastructure and automation.

**WHAT YOU WILL DO**
- Designing and/or implementing a highly scalable Cloud and Bare Metal server and network infrastructure
- Share responsibility and ownership of game functions and services with developers who create them
- Scale systems sustainably through mechanisms like automation and evolve systems by pushing for changes that improve reliability and velocity.
- Engage in and improve the whole lifecycle of services—from inception and design, through deployment, operation and refinement.
- Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews.
- Maintain services once they are live by measuring and monitoring availability, latency and overall system health.
- Practice sustainable incident response and blameless postmortems.
- Ability to debug and optimize code and automate routine tasks (“toil”)
- Consulting on the game's software and data architecture to ensure maximum infrastructure scalability
- Ensuring reliability and consistency of game data
- Work with developers to develop adequate monitoring and monitor system events to ensure health, maximum system availability and service quality
- Assist in evaluating new requirements, technical design and standards
- Reduce the cost of failure for changes
- Define prescriptive ways to measure reliability

**Qualifications**:
**Education**:
A baccalaureate degree or equivalent experience in Computer Information Systems, Computer Science, Mathematics or a related field.

**Relevant experience**:
2+ years of experience with software development or 5+ years of automation focused system administration with Hybrid hosting solutions.
- Experience in one or more of the following is a plus: C, C++, C#, Java, Python, Go or Ruby.

**WHAT YOU BRING**

**Skills**:

- Self-driven, be slightly paranoid about system stability
- Be able to teach fundamental principles to other engineers/experts.
- Skill in developing techniques and methodologies to resolve unprecedented problems or situations
- Ability to make complex information accessible to non-technical people

**Knowledge**:

- In-depth knowledge of Linux system internals and operating system design
- In-depth understanding of Public Cloud providers (GCP, AWS) and Openstack platform
- In-depth knowledge on CI/CD, Gitlab, Change management
- In-depth knowledge on Infrastructure orchestration with Terraform
- Proficient knowledge in orchestration systems such as Kubernetes
- Proficient knowledge in Configuration Management tools such as Saltstack, Chef, Puppet & Ansible
- Proficient knowledge in Dashboards (Grafana), Alerting and Monitoring system
- Proficient knowledge in Promotheus
- Proficient knowledge in VictoriaMetrics
- Proficient knowledge in relational database systems like MySQL
- Proficient knowledge in document storage systems like MongoDB
- Proficient knowledge in Redis/PostGreSQL

Additional Information
**WHAT YOU’L



  • Singapore Sea Limited Full time

    Engineering and Technology - Infrastructure, Singapore - Entry Level Our DevOps Engineering team plays an important role in developing and maintaining the internal systems and tools for the Infrastructure team. As a Site Reliability Engineer, you are responsible for improving the availability and reliability of our Infrastructure services. - Responsible for...


  • Singapore JJ Consulting Services Full time

    Our Client is a fast growing company in Singapore, who is seeking to recruit a Site Reliability Engineer. **Site Reliability Engineer** **Key Roles & Responsibilities** - Providing ancillary support of Enterprise-Grade Products and solutions at customer's sites - Ironing out deployment issues or challenges that our customers may face - Responsible for...


  • Singapore Qlik Full time

    **What makes us Qlik?** A Gartner® Magic Quadrant Leader for 14 years in a row, Qlik transforms complex data landscapes into actionable insights, driving strategic business outcomes. Serving over 40,000 global customers, our portfolio leverages pervasive data quality and advanced AI/ML capabilities that lead to better decisions, faster. We excel in...


  • Singapore Adyen Full time

    **This is Adyen** Adyen provides payments, data, and financial products in a single solution for customers like Meta, Uber, H&M, and Microsoft - making us the financial technology platform of choice. At Adyen, everything we do is engineered for ambition. For our teams, we create an environment with opportunities for our people to succeed, backed by the...


  • Singapore NodeFlair Full time

    **Job Summary**: **Salary** S$11,500 - S$16,500 / Monthly **Job Type** **Seniority** Senior **Years of Experience** At least 7 years **Tech Stacks** Microsoft Puppet Java Ansible Python **This is Adyen** Adyen provides payments, data, and financial products in a single solution for customers like Meta, Uber, H&M, and Microsoft - making us the...


  • Singapore THALES SOLUTIONS ASIA PTE. LTD. Full time

    Roles & ResponsibilitiesDigital Competence Center (DCC)Thales IFE has decided to create a leading technology center in Singapore for its IFE Digital Engineering. It will leverage on unique digital skillset from Singapore and neighbouring countries on Cloud engineering. Thanks to a multi-year strategic plan, Thales is locating at WeWork@Suntec, a center that...


  • Singapore People Profilers Full time

    Job Description: **Responsibilities**: - Support services before they go live through activities such as system design consulting and launch reviews. - Develop and maintain tools, re-designing capacity planning infrastructure for greater scalability. - Troubleshooting, diagnosing and fixing software issues. - Suggesting architecture improvements, pushing...


  • Singapore Oxford Knight Full time

    Senior Site Reliability Engineer Job OverviewOxford Knight is seeking a highly skilled Senior Site Reliability Engineer to join our team and support our Linux trading infrastructure.Key ResponsibilitiesDesign and implement software components and systems to improve trading services.Provide level II support, including emergency response and advanced...


  • Singapore Gravitas Recruitment Group Full time

    Job details - Location - Singapore - Salary - S$9000 - S$13000 per month - Job Type - Permanent - Ref - BBBH137137_1690786002 - Posted - about 1 hour ago Job summary **Our client, a trading firm, is looking for a Site Reliability Engineer to join their team. They are seeking team players who demonstrate a creative approach to problem-solving and take...


  • Singapore AKAMAI TECHNOLOGIES APJ PTE. LTD. Full time

    As a Senior Site Reliability Engineer, you will influence a wide array of teams. You will be responsible for the performance and reliability of Akamai’s delivery products by working with the Product, Engineering and Support teams to diagnose, mitigate and solve outages. You will have to solve some of the most complex problems in distributed systems at...


  • Singapore NextWave Partners Full time

    Location: - Singapore- Job Type: - Permanent- Discipline: - Software Engineering- Salary: - Negotiable- Contact: - Chelsea Phan**Senior Site Reliability Engineer** **Singapore** **About the role** We are working with a climate technology, who is currently working on a smart carbon measurement, accounting, and management Saas platform that allows...


  • Singapore Sea Limited Full time

    Engineering and Technology - Infrastructure, Singapore - Experienced (Individual Contributor) Our DevOps Engineering team plays an important role in developing and maintaining the internal systems and tools for the Infrastructure team. As a Senior Site Reliability Operation Engineer, you are responsible for improving the availability and reliability of our...


  • Singapore SINGAPORE POWER LIMITED Full time

    **What You'll Do**: - Evangelist for Site Reliability Engineer (SRE) practices in SP Digital (SPD) - Maintain the Reliability tools with regular patching and upgrades - Mange and evolve the full stack observability tools used in SPD - Enhance the customer experience by simplifying the onboarding process and documentation - Work with teams to improve the...


  • Singapore Experis Full time

    **Site Reliability Engineer**: - Location- Singapore- Job reference- BBBH133946_1701163649- Salary- S$5000 - S$7000 per month- Consultant name - Siau Zianyi Consultant contact no. - EA License No. - 02C3423 - Consultant Registration No. - R23113527 **Responsibilities**: - Oversee deployment, change management, issue triage, and infrastructure management...


  • Singapore GXS BANK PTE. LTD. Full time

    **Job Description & Requirements**: Get to know the Role: - As a Site Reliability Engineer (SRE) you will help build a meaningful engineering discipline, combining software and systems to develop creative engineering solutions to operations problems. - Much of our support and software development focuses on optimizing existing systems, building...


  • Singapore Ambition Singapore Full time

    Ambition SingaporeAbout the CompanyAmbition Singapore is a top quantitative trading firm with a results-driven culture, seeking a Site Reliability Engineer to safeguard their innovative services and strategies.


  • Singapore DADACONSULTANTS PTE. LTD. Full time

    Roles & ResponsibilitiesSenior Site Reliability Engineer (SRE) | Big DataResponsibilities:Manage the full lifecycle of services, from design to deployment and maintenance.Develop and improve automation tools for scalability and reliability.Troubleshoot and resolve software and infrastructure issues, ensuring data security.Optimize system architecture and...


  • Singapore TOSS-EX PR PTE. LTD. Full time

    Roles & ResponsibilitiesWe are seeking talented and driven professionals to join our Site Reliability Engineering (SRE) team. This role involves helping organizations enhance the availability, performance, and resilience of their applications and services through the deployment and administration of Observability Platforms.Key ResponsibilitiesDeploy and...


  • Singapore INFOSYS COMPAZ PTE. LTD. Full time

    Roles & ResponsibilitiesJob DescriptionWe are seeking talented and driven professionals to join our Site Reliability Engineering (SRE) team. This role involves helping organizations enhance the availability, performance, and resilience of their applications and services through the deployment and administration of Observability PlatformsKey Responsibilities...


  • Singapore TOSS-EX PR PTE. LTD. Full time

    Roles & ResponsibilitiesWe are seeking talented and driven professionals to join our Site Reliability Engineering (SRE) team. This role involves helping organizations enhance the availability, performance, and resilience of their applications and services through the deployment and administration of Observability Platforms.Key ResponsibilitiesDeploy and...