Site Reliability Engineer

3 days ago


Singapur, Singapore Canonical Full time

Overview

Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT. Our customers include the world's leading public cloud and silicon providers, and industry leaders in many sectors. The company is a pioneer of globally distributed collaboration, with 1200+ colleagues in 75+ countries and very few office-based roles. Teams meet two to four times yearly in person, in interesting locations around the world, to align on strategy and execution.

The company is founder-led, profitable, and growing.

We are hiring a Site Reliability Engineer

Our goal is to perfect enterprise infrastructure DevOps practices, raising the bar on what's possible with automation by embracing a model-driven approach, whether on-premise or on public clouds.

We run hundreds of private cloud, Kubernetes clusters, and applications for customers across both physical and public cloud estates. We identify and address incidents, monitor and observe applications, anticipate potential issues, and enable product refinement to ultimately achieve high-quality standards in our open source portfolio.

To succeed in this role, you need to have a strong background in Linux, Python, networking, and knowledge of how clouds work. Your work will encompass the entire stack, from bare-metal networking and kernel up to Kubernetes and open source applications. You can expect to be trained in our core technologies like OpenStack, Kubernetes, security standards, open source products like Kubeflow, Kafka, OpenSearch, databases, and many others.

Automation for us is a software engineering problem that we approach with a scientific mindset to bring operations at scale, driven by metrics and code.

Location: Globally remote role

Responsibilities
  • Deploy and run OpenStack, Kubernetes, storage solutions, and open source applications, applying DevOps practices.
  • Identify and address incidents, monitor and observe applications, anticipate potential issues, and enable product refinement to achieve high-quality standards in our open source portfolio.
  • Work across the stack from bare-metal networking and kernel up to Kubernetes and open source applications; participate in training on core technologies like OpenStack, Kubernetes, security standards, Kubeflow, Kafka, OpenSearch, databases, and more.
  • Approach automation as a software engineering problem driven by metrics and code.
What we are looking for in you
  • Degree in software engineering or computer science
  • Python software development experience
  • Operational experience in Linux environments
  • Experience with Kubernetes deployment or operations
  • Excellent interpersonal skills, curiosity, flexibility, and accountability
  • Ability to travel internationally twice a year, for company events up to two weeks long
Bonus skills
  • Familiarity with OpenStack deployment or operations
  • Familiarity with public cloud deployment or operations
  • Familiarity with private cloud management
What we offer colleagues

We consider geographical location, experience, and performance in shaping compensation worldwide. We adjust compensation every 6 months to recognize outstanding performance, and in addition to base pay, we offer annual bonuses. We provide all team members with additional benefits reflecting our values and ideals. We balance our programs to meet local needs and ensure fairness globally.

  • Distributed work environment with twice-yearly team sprints in person
  • Personal learning and development budget of USD 2,000 per year
  • Every 6 months compensation review
  • Recognition rewards
  • Annual holiday leave
  • Maternity and paternity leave
  • Employee Assistance Programs
  • Opportunity to travel to new locations to meet your colleagues
  • Priority Pass and travel upgrades for long-haul company events
About Canonical

Canonical is a pioneering tech firm at the forefront of the global move to open source. As the company that publishes Ubuntu, one of the most important open source projects and the platform for AI, IoT, and the cloud, we are changing the world of software. We recruit on a global basis and set a very high standard for people joining the company. We expect excellence – in order to succeed, we need to be the best at what we do. Most colleagues at Canonical have worked from home since its inception in 2004. Working here is a step into the future, and will challenge you to think differently, work smarter, learn new skills, and raise your game.

Canonical is an equal opportunity employer. We are proud to foster a workplace free from discrimination. Diversity of experience, perspectives, and background creates a better work environment and better products. Whatever your identity, we will give your application fair consideration.

#J-18808-Ljbffr

  • Singapur, Singapore IDEMIA Full time

    Join to apply for the Site Reliability Engineer role at IDEMIA Join to apply for the Site Reliability Engineer role at IDEMIA Get AI-powered advice on this job and more exclusive features. PurposeThis role plays a critical part in ensuring reliability, scalability, and performance of our systems and services. You will work closely with development and...


  • Singapur, Singapore Beijing Foreign Enterprise Management Consultants Co.,Ltd. Full time

    Direct message the job poster from Beijing Foreign Enterprise Management Consultants Co.,Ltd. On behalf of Huawei, a world-renowned information and communication technology company, we are seeking passionate and talented individuals to join our team as Site Reliability Engineer Overview On behalf of Huawei, a world-renowned information and communication...


  • Singapur, Singapore Point72 Full time

    Join to apply for the Site Reliability Engineer role at Point72 About the role As part of Point72’s Technology Team, you will focus on developing and maintaining complex, distributed, real-time systems that support our Global Macro business. Your responsibilities will include optimizing operations through automation, building foundational SRE...


  • Singapur, Singapore WeChat International Pte. Ltd. Full time

    Site Reliability Engineer page is loadedSite Reliability Engineer Apply remote type Onsite locations Singapore-CapitaSky time type Full time posted on Posted 30+ Days Ago job requisition id R Business Unit Technology Engineering Group (TEG) is responsible for supporting the company and its business groups on technology and operational platforms, as well as...

  • Site Reliability

    3 days ago


    Singapur, Singapore Canonical Full time

    Join to apply for the Site Reliability / Gitops Engineer role at Canonical 1 day ago Be among the first 25 applicants Join to apply for the Site Reliability / Gitops Engineer role at Canonical Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is very widely...


  • Singapur, Singapore Apple Inc. Full time

    There is a lot that goes into building the most secure yet user-friendly devices in the world. We are a unique Software Development group with a charter to secure our platforms, which include iOS software, iOS Devices, and Mac. We build solutions that are used by our customers, engineering teams, and manufacturing environments.We are lookng for Site...


  • Singapur, Singapore IDEMIA Full time

    Join to apply for the Site Reliability Engineer role at IDEMIA Overview This role plays a critical part in ensuring reliability, scalability, and performance of our systems and services. You will work closely with development and operations teams to build and maintain robust infrastructure and tools that support high availability, monitoring and rapid...


  • Singapur, Singapore RigNet Full time

    About us One team. Global challenges. Infinite opportunities. At Viasat, we’re on a mission to deliver connections with the capacity to change the world. For more than 35 years, Viasat has helped shape how consumers, businesses, governments and militaries around the globe communicate. We’re looking for people who think big, act fearlessly, and create an...


  • Singapur, Singapore Tower Research Capital Full time

    Join to apply for the Site Reliability Engineer role at Tower Research Capital Join to apply for the Site Reliability Engineer role at Tower Research Capital Tower Research Capital is a leading quantitative trading firm founded in 1998. Tower has built its business on a high-performance platform and independent trading teams. We have a 25+ year track...


  • Singapur, Singapore AvePoint Full time

    Site Reliability Engineer (SRE) (GovTech) We are seeking a skilled and passionate Engineer to join our team to build and operate a Whole-of-Government (WoG) runtime platform.As a Site Reliability Engineer, you will be responsible for designing and operating GitLab, AWS and Kubernetes-based infrastructure and solutions that power our platform, to ensure the...