ASE - Site Reliability Engineering Manager

2 weeks ago


Singapore APPLE SOUTH ASIA PTE. LTD. Full time
Roles & Responsibilities

Job Summary

Apple Services Engineering team is one of the most exciting examples of Apple’s long-held passion for combining art and technology. Join Apple Services Engineering Cloud Service Infrastructure team, as a Site Reliability Engineering Manager, to help support and scale cloud services for millions of Apple users. This is a hands-on role, to establish SRE practices for a private cloud service, to accelerate our ability to reliably and consistently deliver thousands of applications. You will lead a team of Site Reliability Engineers who thrive in a fast-paced workplace, where drive and collaboration are the keys to success.

Key Qualifications
  • 8+ years in critical, large scale distributed systems experience, combining Hardware, Operating Systems and Software
  • 3+ years experience building and leading engineering teams; ideally SRE or Production Engineering
  • Strong emphasis on SRE as an engineering subject area, with proficiency in at least in one of the following languages (Golang, Rust, Python, Swift)
  • Understanding of SRE principals, including monitoring, alerting, error budgets, fault analysis, and other common reliability engineering concepts, with a keen eye for opportunities to eliminate toil by code and process improvements
  • Superb interpersonal skills, capable of working with multi-functional technical and business teams and varying levels of management, influencing decision making
Description

The Apple Services Engineering Cloud Services SRE organization is looking for a strong, hands-on leader. The leader will lead a platform focused SRE team, and be responsible for the reliability of the platform. The platform serves workloads that provide our organization and our customers with their favorite applications, services, and tools.

We are domain experts in fleet management, systems, and software engineering. We build automations, instrument reliability tools, and respond to alerts and incidents which may pose a risk to the reliability of the platform. Team’s focus is on infrastructure capabilities and processes, improving the reliability and efficiency of the systems, at scale.

RESPONSIBILITIES INCLUDE:

  • Act as the Service Owner, designing and mapping key performance indicators to achieve the organization’s mission
  • Lead the definition of requirements, priorities and planning of engineering deliverables
  • Implement structured engineering and operations processes
  • Lead the team in daily agile SRE practices, ensuring proper team focus on priorities, achievements, and deliverables
  • Optimize velocity and efficiency of delivery, and drive continuous improvement

Success depends on strong understanding of SRE principles and practices, combined with a track record of resolving issues in a live production environment, and implementing strategies to minimize them while driving clear action plans for the team.

The successful candidate will be highly self-motivated with a passion for excellence, quality, and detail. As a leader, they are responsible for coaching and mentoring their team members, helping them achieve service goals, and build career paths in alignment. It’s imperative for the leader to empower their team by providing appropriate context and timely feedback.

The leader will not only own the service, but will also collaborate with other teams within Apple. They will build trust with stakeholders and partner through diplomacy, discussion, and follow-through. This is a broad cross-organization role with high-visibility, collaborating with multiple teams. They are expected to invest in and build good relations with key partners. Their collaboration with internal customers, product engineering, and development groups is critical to success.

Education

Bachelors or Masters in Computer Science, Computer Engineering, or equivalent experience.

Additional Requirements

Apple is an Equal Opportunity Employer that is committed to inclusion and diversity. We also take affirmative action to offer employment and advancement opportunities to all applicants, including minorities, women, protected veterans, and individuals with disabilities. Apple will not discriminate or retaliate against applicants who inquire about, disclose, or discuss their compensation or that of other applicants. We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation.


Tell employers what skills you have

Hardware
Interpersonal Skills
Process Improvements
ability to influence
Fault Analysis
Software
Team Leading
monitoring
building team
Distributed Systems
Python
Operating Systems
Site Reliability Engineering
Rust
Production Engineering

  • Singapore Apple South Asia Pte. Ltd. Full time

    Job SummaryApple Services Engineering team is one of the most exciting examples of Apple's long-held passion for combining art and technology. Join Apple Services Engineering Cloud Service Infrastructure team, as a Site Reliability Engineering Manager, to help support and scale cloud services for millions of Apple users. This is a hands-on role, to establish...


  • Singapore APPLE SOUTH ASIA PTE. LTD. Full time

    Roles & ResponsibilitiesJob SummaryApple Services Engineering team is one of the most exciting examples of Apple’s long-held passion for combining art and technology. Join Apple Services Engineering Cloud Service Infrastructure team, as a Site Reliability Engineer, to help support and scale cloud services for millions of Apple users. We are building and...


  • Singapore APPLE SOUTH ASIA PTE. LTD. Full time

    Roles & ResponsibilitiesJob SummaryThe Apple Services Engineering (ASE) team is one of the most exciting examples of Apple’s long-held passion for combining art and technology. These are the people who power the App Store, Apple TV, Apple Music, Apple Podcasts, and Apple Books. And they do it on a massive scale, meeting Apple’s high expectations with...


  • Singapore Apple South Asia Pte. Ltd. Full time

    Job SummaryApple Services Engineering team is one of the most exciting examples of Apple's long-held passion for combining art and technology. Join Apple Services Engineering Cloud Service Infrastructure team, as a Site Reliability Engineer, to help support and scale cloud services for millions of Apple users. We are building and supporting new and existing...


  • Singapore Apple South Asia Pte. Ltd. Full time

    Job SummaryThe Apple Services Engineering (ASE) team is one of the most exciting examples of Apple's long-held passion for combining art and technology. These are the people who power the App Store, Apple TV, Apple Music, Apple Podcasts, and Apple Books. And they do it on a massive scale, meeting Apple's high expectations with high performance to deliver a...


  • Singapore ADYEN SINGAPORE PTE. LTD. Full time

    Roles & ResponsibilitiesThis is AdyenAdyen provides payments, data, and financial products in a single solution for customers like Meta, Uber, H&M, and Microsoft - making us the financial technology platform of choice. At Adyen, everything we do is engineered for ambition.For our teams, we create an environment with opportunities for our people to succeed,...


  • Singapore Adyen Singapore Pte. Ltd. Full time

    This is AdyenAdyen provides payments, data, and financial products in a single solution for customers like Meta, Uber, H&M, and Microsoft - making us the financial technology platform of choice. At Adyen, everything we do is engineered for ambition.For our teams, we create an environment with opportunities for our people to succeed, backed by the culture and...


  • Singapore Wipro Limited Full time

    Job Role  : Site Reliability Engineer Location : SingaporeExperience : 2+ Years of relevant experience Job Description : Responsibilities : Hands-on design, implement, and extend automation tools for infrastructure, application, and container management. Monitor Staging, Test and Development environments for a myriad of Products in an agile and dynamic...


  • Singapore LIVERAMP PTE. LTD. Full time

    Roles & ResponsibilitiesABOUT THIS JOBThe SRE team is responsible for owning and supporting deployments of global products, and providing first line operational support. We are looking for a Site Reliability engineer who is excited about establishing and advocating for best practices for product deployments and SRE. You will be able to leverage your software...


  • Singapore Liveramp Pte. Ltd. Full time

    ABOUT THIS JOBThe SRE team is responsible for owning and supporting deployments of global products, and providing first line operational support. We are looking for a Site Reliability engineer who is excited about establishing and advocating for best practices for product deployments and SRE. You will be able to leverage your software engineering expertise...


  • Singapore ADECCO PERSONNEL PTE LTD Full time

    Roles & ResponsibilitiesResponsibilitiesTo be responsible for reliability, availability, user experience, capacity planning, toil reduction, process enhancement and digitalization of the cloud-based internet services.Handle SRE role for assigned cloud services owning the KPIs for reliability, issue to resolution, service deployment, business continuity...


  • Singapore Adecco Personnel Pte Ltd Full time

    ResponsibilitiesTo be responsible for reliability, availability, user experience, capacity planning, toil reduction, process enhancement and digitalization of the cloud-based internet services.Handle SRE role for assigned cloud services owning the KPIs for reliability, issue to resolution, service deployment, business continuity management, security policy...


  • Singapore A-IT SOFTWARE SERVICES PTE LTD Full time

    Roles & ResponsibilitiesRole: Site Reliability EngineerJob Level: 3-5 years of relevant experience (L2)Job DescriptionJob Title: Site Reliability EngineerJob ObjectivesThe Site Reliability Engineer/Software Engineer is a contract position responsible software and systems engineering to build and run large-scale, distributed, fault-tolerant systems.As a...


  • Singapore Sciente Consulting Full time

    Mandatory Skill-set Bachelor's degree in Computer Science, Mathematics, Engineering, or any related field; Has 3 to 4 years of proven experience in monitoring application and systems; Expertise in Grafana, Elastic Stack (Elasticsearch, Logstash, Kibana, Beats), and Kafka, including setup, configuration, upgrades, patching, data management, monitoring,...

  • Site Engineer

    2 days ago


    Singapore SHANGHAI TUNNEL ENGINEERING CO (SINGAPORE) PTE LTD Full time

    Roles & ResponsibilitiesMain Duties: Overseeing the construction activities and progress, planning, implementation and monitoring work schedules in accordance to the master and detailed work programme Liaise with Professional Engineer on the Temporary works Liaise with consultants (QPS) for technical issues and coordinate the site activities with...


  • Singapore Shopee Full time

    Job Description:Set up, deploy and configure marketplace services in the private cloud platform.Continuously improve the marketplace services in the private cloud, including but not limited to stress test automation, capacity management, service autoscaler, disaster recovery, chat operations, knowledge base management, SOP automation, dynamic service...


  • Singapore SYGNUM PTE. LTD. Full time

    Roles & ResponsibilitiesAbout The RoleWe’re seeking a Site Reliability Engineer who is ready to work with new technologies and architectures in a forward-thinking organization, especially blockchain that’s always pushing boundaries. Here, you will take complete, end-to-end ownership of our applications. You will have experience building products across...


  • Singapore RECRUIT EXPRESS PTE LTD Full time

    Roles & ResponsibilitiesMy client is looking for a looking for an experienced individual to join the SRE team. The individual will support production monitoring and is expected to be hands-on using technology.Job Requirements: Java Programming Experience (2+ years) or equivalent level of coding knowledge Python/Shell Scripting (2+ years) or data...


  • Singapore Shopee Full time

    Job Description:Fun and energetic team culture with strong emphasis on learning, sharing and growth.Learning programme / roadmap for all new hires (applicable for both fresh / experienced).Wide exposure to enable rapid growth in personal skills and career.Deep dive into Marketplace core product lines.50:50 time spent between technical operations and software...


  • Singapore Sygnum Pte. Ltd. Full time

    About The RoleWe're seeking a Site Reliability Engineer who is ready to work with new technologies and architectures in a forward-thinking organization, especially blockchain that's always pushing boundaries. Here, you will take complete, end-to-end ownership of our applications. You will have experience building products across the stack and a firm...