Senior Site Reliability Engineer, Data Platform
2 days ago
Company Description
Shopify is the leading omni-channel commerce platform. Merchants use Shopify to design, set up, and manage their stores across multiple sales channels, including mobile, web, social media, marketplaces, brick-and-mortar locations, and pop-up shops. The platform also provides merchants with a powerful back-office and a single view of their business, from payments to shipping. The Shopify platform was engineered for reliability and scale, making enterprise-level technology available to businesses of all sizes. Headquartered in Ottawa, Canada, Shopify currently powers over millions of merchants' businesses in approximately 175 countries and is trusted by brands such as Allbirds, Gymshark, PepsiCo, Staples, and many more.
**Job Description**:
Our Data Platform Engineering group builds and maintains the platform that delivers accessible data to power decision-making at Shopify for over a million merchants. We’re hiring high-impact developers for our Reliability group:
- The Reliability group operates the data platform efficiently in a consistent and reliable manner. They are reliability engineers that build tools and infrastructure with other teams within the Data Platform to leverage and encourage consistency. They champion reliability across shared platform infrastructure. Think DevOps for Data.
- Data reliability engineers ensure that data flows through the platform on time and as expected. They ensure data services are performing according to service level objectives, that all systems have service level indicators, and if those systems fall below their objectives, recovery is swift.
**Qualifications**:
While our teams value specialized skills, they've also got a lot in common. We're looking for a(n):
- High-energy self-starter with experience and passion for data and big data scale processing. You enjoy working in fast-paced environments and love making an impact.
- Exceptional communicator with the ability to translate technical concepts into easy to understand language for our stakeholders.
- Excitement for working with a remote team; you value collaborating on problems, asking questions, delivering feedback, and supporting others in their goals whether they are in your vicinity or entire cities apart.
- Solid software backend engineer: experienced in building and maintaining systems at scale with reliability metrics as a default.
A Senior Site Reliability Engineer at Shopify typically has 4-6 years of experience in one or more of the following areas:
- Working with deployments of distributed query and compute engines (Spark, Flink, Druid, Trino/PrestoDB)
- Familiarity with modern Big-Data storage technologies (Iceberg, Hudi, Delta)
- Strong programming fundamentals, ideally in a variety of languages (Java, Scala, Python, Go)
- Measuring query latencies, resource allocation and management, and data lake performance (Presto, SQL)
- Familiarity with cloud infrastructure (Google Cloud, Kubernetes, Terraform)
- An understanding of operational toil, observability, performance, and scalability
- Familiarity with incident response and management tools like PagerDuty
- Kubernetes certifications (CKA/CKAD) are a plus
- If you don’t know all this stuff, don’t worry, we’ll teach you
Senior Software Developer
- Distributed Systems #Senior Software Developer
- Data Engineering #Senior Data App Developer #Senior Data Reliability Engineer #Senior Software Developer
- Reliability #SRE #DevOps #Data Infrastructure #Reliability Engineer
Additional Information- At Shopify, we understand that experience comes in many forms. We’re dedicated to adding new perspectives to the team - so if your experience is this close to what we’re looking for, please consider applying._
-
Singapore Razer Inc. Full timeSite Reliability Engineers/Platform Engineers (Mid/Senior)Joining Razer will place you on a global mission to revolutionize the way the world games. Razer is a place to do great work , offering you the opportunity to make an impact globally while working across a global team located across 5 continents. Razer is also a great place to work, providing you the...
-
Site Reliability Engineer/data Engineer
2 weeks ago
Singapore NodeFlair Full time**Job Summary**: **Salary** S$7,000 - S$9,000 / Monthly **Job Type** **Seniority** Mid **Years of Experience** At least 3 years **Tech Stacks** Analytics Spring Elastic Shell OOP Logstash Chef Puppet Kibana Grafana Linux kafka Springboot Ansible Node.js Elasticsearch Python **Must have **:Elasticsearch(ELK) Skillset.**: **Role...
-
Senior Site Reliability Engineer
8 hours ago
Singapore AKAMAI TECHNOLOGIES APJ PTE. LTD. Full timeAs a Senior Site Reliability Engineer, you will influence a wide array of teams. You will be responsible for the performance and reliability of Akamai’s delivery products by working with the Product, Engineering and Support teams to diagnose, mitigate and solve outages. You will have to solve some of the most complex problems in distributed systems at...
-
Site Reliability Engineer
8 hours ago
Singapore Shopee Full timeDepartmentEngineering and Technology- LevelExperienced (Individual Contributor)- LocationSingaporeThe Engineering and Technology team is at the core of the Shopee platform development. The team is made up of a group of passionate engineers from all over the world, striving to build the best systems with the most suitable technologies. Our engineers do not...
-
Site Reliability Engineers/Platform Engineers
2 weeks ago
Singapore Razer Full timeJoining Razer will place you on a global mission to revolutionize the way the world games. Razer is a place to do great work, offering you the opportunity to make an impact globally while working across a global team located across 5 continents. Razer is also a great place to work, providing you the unique, gamer-centric #LifeAtRazer experience that will put...
-
Site Reliability Engineer Sre
2 weeks ago
Singapore NodeFlair Full time**Job Summary**: **Salary** S$7,000 - S$9,000 / Monthly **Job Type** **Seniority** Mid **Years of Experience** At least 4 years **Tech Stacks** Analytics Spring Shell OOP Logstash Chef Puppet UNIX Kibana Grafana Linux kafka Springboot Ansible Node.js Elasticsearch Python **NTT DATA Singapore PTE Ltd is a wholly owned subsidiary of NTT DATA Corp, a part...
-
Singapore TIKTOK PTE. LTD. Full timeA leading content e-commerce platform in Singapore is looking for a skilled Site Reliability Engineer to provide SRE solutions, enhance infrastructure capabilities, and ensure system reliability. The ideal candidate will have a Bachelor's degree in a relevant field, at least 5 years of programming experience, and familiarity with e-commerce systems. Join...
-
Site Reliability Engineer, Traffic Platform
1 week ago
Singapore ByteDance Full timeResponsibilities About ByteDance Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create...
-
Site Reliability Engineer
2 weeks ago
Singapore Crystal Equation Corporation Full timeWe are seeking a skilled Site Reliability Engineer (SRE) to join our team. SRE will be responsible for keeping all internal user-facing applications and other production systems running smoothly. This hybrid role involves a combination of both development and operations skills to build and manage systems that are both efficient and reliable. The Enterprise...
-
Site Reliability Engineer
6 days ago
Singapore Rapsys Technologies Full time**Roles and Responsibilities**: 2. Set up and operate the server infrastructure and software (Linux, Elasticsearch, Logstash, Grafana, Kibana, Kafka, Nginx) based on bank’s security standards and industry’s security standards. 3. Perform continuous improvement for the platform covering areas such as: capacity planning, observability, monitoring,...