Senior Site Reliability Engineer, Data Platform

2 days ago


Singapore Shopify Full time

Company Description

Shopify is the leading omni-channel commerce platform. Merchants use Shopify to design, set up, and manage their stores across multiple sales channels, including mobile, web, social media, marketplaces, brick-and-mortar locations, and pop-up shops. The platform also provides merchants with a powerful back-office and a single view of their business, from payments to shipping. The Shopify platform was engineered for reliability and scale, making enterprise-level technology available to businesses of all sizes. Headquartered in Ottawa, Canada, Shopify currently powers over millions of merchants' businesses in approximately 175 countries and is trusted by brands such as Allbirds, Gymshark, PepsiCo, Staples, and many more.

**Job Description**:
Our Data Platform Engineering group builds and maintains the platform that delivers accessible data to power decision-making at Shopify for over a million merchants. We’re hiring high-impact developers for our Reliability group:

- The Reliability group operates the data platform efficiently in a consistent and reliable manner. They are reliability engineers that build tools and infrastructure with other teams within the Data Platform to leverage and encourage consistency. They champion reliability across shared platform infrastructure. Think DevOps for Data.
- Data reliability engineers ensure that data flows through the platform on time and as expected. They ensure data services are performing according to service level objectives, that all systems have service level indicators, and if those systems fall below their objectives, recovery is swift.

**Qualifications**:
While our teams value specialized skills, they've also got a lot in common. We're looking for a(n):

- High-energy self-starter with experience and passion for data and big data scale processing. You enjoy working in fast-paced environments and love making an impact.
- Exceptional communicator with the ability to translate technical concepts into easy to understand language for our stakeholders.
- Excitement for working with a remote team; you value collaborating on problems, asking questions, delivering feedback, and supporting others in their goals whether they are in your vicinity or entire cities apart.
- Solid software backend engineer: experienced in building and maintaining systems at scale with reliability metrics as a default.

A Senior Site Reliability Engineer at Shopify typically has 4-6 years of experience in one or more of the following areas:

- Working with deployments of distributed query and compute engines (Spark, Flink, Druid, Trino/PrestoDB)
- Familiarity with modern Big-Data storage technologies (Iceberg, Hudi, Delta)
- Strong programming fundamentals, ideally in a variety of languages (Java, Scala, Python, Go)
- Measuring query latencies, resource allocation and management, and data lake performance (Presto, SQL)
- Familiarity with cloud infrastructure (Google Cloud, Kubernetes, Terraform)
- An understanding of operational toil, observability, performance, and scalability
- Familiarity with incident response and management tools like PagerDuty
- Kubernetes certifications (CKA/CKAD) are a plus
- If you don’t know all this stuff, don’t worry, we’ll teach you

Senior Software Developer
- Distributed Systems #Senior Software Developer
- Data Engineering #Senior Data App Developer #Senior Data Reliability Engineer #Senior Software Developer
- Reliability #SRE #DevOps #Data Infrastructure #Reliability Engineer

Additional Information- At Shopify, we understand that experience comes in many forms. We’re dedicated to adding new perspectives to the team - so if your experience is this close to what we’re looking for, please consider applying._



  • Singapore Razer Inc. Full time

    Site Reliability Engineers/Platform Engineers (Mid/Senior)Joining Razer will place you on a global mission to revolutionize the way the world games. Razer is a place to do great work , offering you the opportunity to make an impact globally while working across a global team located across 5 continents. Razer is also a great place to work, providing you the...


  • Singapore NodeFlair Full time

    **Job Summary**: **Salary** S$7,000 - S$9,000 / Monthly **Job Type** **Seniority** Mid **Years of Experience** At least 3 years **Tech Stacks** Analytics Spring Elastic Shell OOP Logstash Chef Puppet Kibana Grafana Linux kafka Springboot Ansible Node.js Elasticsearch Python **Must have **:Elasticsearch(ELK) Skillset.**: **Role...


  • Singapore AKAMAI TECHNOLOGIES APJ PTE. LTD. Full time

    As a Senior Site Reliability Engineer, you will influence a wide array of teams. You will be responsible for the performance and reliability of Akamai’s delivery products by working with the Product, Engineering and Support teams to diagnose, mitigate and solve outages. You will have to solve some of the most complex problems in distributed systems at...


  • Singapore Shopee Full time

    DepartmentEngineering and Technology- LevelExperienced (Individual Contributor)- LocationSingaporeThe Engineering and Technology team is at the core of the Shopee platform development. The team is made up of a group of passionate engineers from all over the world, striving to build the best systems with the most suitable technologies. Our engineers do not...


  • Singapore Razer Full time

    Joining Razer will place you on a global mission to revolutionize the way the world games. Razer is a place to do great work, offering you the opportunity to make an impact globally while working across a global team located across 5 continents. Razer is also a great place to work, providing you the unique, gamer-centric #LifeAtRazer experience that will put...


  • Singapore NodeFlair Full time

    **Job Summary**: **Salary** S$7,000 - S$9,000 / Monthly **Job Type** **Seniority** Mid **Years of Experience** At least 4 years **Tech Stacks** Analytics Spring Shell OOP Logstash Chef Puppet UNIX Kibana Grafana Linux kafka Springboot Ansible Node.js Elasticsearch Python **NTT DATA Singapore PTE Ltd is a wholly owned subsidiary of NTT DATA Corp, a part...


  • Singapore TIKTOK PTE. LTD. Full time

    A leading content e-commerce platform in Singapore is looking for a skilled Site Reliability Engineer to provide SRE solutions, enhance infrastructure capabilities, and ensure system reliability. The ideal candidate will have a Bachelor's degree in a relevant field, at least 5 years of programming experience, and familiarity with e-commerce systems. Join...


  • Singapore ByteDance Full time

    Responsibilities About ByteDance Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create...


  • Singapore Crystal Equation Corporation Full time

    We are seeking a skilled Site Reliability Engineer (SRE) to join our team. SRE will be responsible for keeping all internal user-facing applications and other production systems running smoothly. This hybrid role involves a combination of both development and operations skills to build and manage systems that are both efficient and reliable. The Enterprise...


  • Singapore Rapsys Technologies Full time

    **Roles and Responsibilities**: 2. Set up and operate the server infrastructure and software (Linux, Elasticsearch, Logstash, Grafana, Kibana, Kafka, Nginx) based on bank’s security standards and industry’s security standards. 3. Perform continuous improvement for the platform covering areas such as: capacity planning, observability, monitoring,...