ASE - Site Reliability Engineering Manager
2 weeks ago
Job Summary
Apple Services Engineering team is one of the most exciting examples of Apple’s long-held passion for combining art and technology. Join Apple Services Engineering Cloud Service Infrastructure team, as a Site Reliability Engineering Manager, to help support and scale cloud services for millions of Apple users. This is a hands-on role, to establish SRE practices for a private cloud service, to accelerate our ability to reliably and consistently deliver thousands of applications. You will lead a team of Site Reliability Engineers who thrive in a fast-paced workplace, where drive and collaboration are the keys to success.
Key Qualifications- 8+ years in critical, large scale distributed systems experience, combining Hardware, Operating Systems and Software
- 3+ years experience building and leading engineering teams; ideally SRE or Production Engineering
- Strong emphasis on SRE as an engineering subject area, with proficiency in at least in one of the following languages (Golang, Rust, Python, Swift)
- Understanding of SRE principals, including monitoring, alerting, error budgets, fault analysis, and other common reliability engineering concepts, with a keen eye for opportunities to eliminate toil by code and process improvements
- Superb interpersonal skills, capable of working with multi-functional technical and business teams and varying levels of management, influencing decision making
The Apple Services Engineering Cloud Services SRE organization is looking for a strong, hands-on leader. The leader will lead a platform focused SRE team, and be responsible for the reliability of the platform. The platform serves workloads that provide our organization and our customers with their favorite applications, services, and tools.
We are domain experts in fleet management, systems, and software engineering. We build automations, instrument reliability tools, and respond to alerts and incidents which may pose a risk to the reliability of the platform. Team’s focus is on infrastructure capabilities and processes, improving the reliability and efficiency of the systems, at scale.
RESPONSIBILITIES INCLUDE:
- Act as the Service Owner, designing and mapping key performance indicators to achieve the organization’s mission
- Lead the definition of requirements, priorities and planning of engineering deliverables
- Implement structured engineering and operations processes
- Lead the team in daily agile SRE practices, ensuring proper team focus on priorities, achievements, and deliverables
- Optimize velocity and efficiency of delivery, and drive continuous improvement
Success depends on strong understanding of SRE principles and practices, combined with a track record of resolving issues in a live production environment, and implementing strategies to minimize them while driving clear action plans for the team.
The successful candidate will be highly self-motivated with a passion for excellence, quality, and detail. As a leader, they are responsible for coaching and mentoring their team members, helping them achieve service goals, and build career paths in alignment. It’s imperative for the leader to empower their team by providing appropriate context and timely feedback.
The leader will not only own the service, but will also collaborate with other teams within Apple. They will build trust with stakeholders and partner through diplomacy, discussion, and follow-through. This is a broad cross-organization role with high-visibility, collaborating with multiple teams. They are expected to invest in and build good relations with key partners. Their collaboration with internal customers, product engineering, and development groups is critical to success.
EducationBachelors or Masters in Computer Science, Computer Engineering, or equivalent experience.
Additional RequirementsApple is an Equal Opportunity Employer that is committed to inclusion and diversity. We also take affirmative action to offer employment and advancement opportunities to all applicants, including minorities, women, protected veterans, and individuals with disabilities. Apple will not discriminate or retaliate against applicants who inquire about, disclose, or discuss their compensation or that of other applicants. We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation.
Tell employers what skills you have
Hardware
Interpersonal Skills
Process Improvements
ability to influence
Fault Analysis
Software
Team Leading
monitoring
building team
Distributed Systems
Python
Operating Systems
Site Reliability Engineering
Rust
Production Engineering
-
ASE - Site Reliability Engineering Manager
2 weeks ago
Singapore Apple South Asia Pte. Ltd. Full timeJob SummaryApple Services Engineering team is one of the most exciting examples of Apple's long-held passion for combining art and technology. Join Apple Services Engineering Cloud Service Infrastructure team, as a Site Reliability Engineering Manager, to help support and scale cloud services for millions of Apple users. This is a hands-on role, to establish...
-
ASE - Site Reliability Engineer
2 weeks ago
Singapore APPLE SOUTH ASIA PTE. LTD. Full timeRoles & ResponsibilitiesJob SummaryApple Services Engineering team is one of the most exciting examples of Apple’s long-held passion for combining art and technology. Join Apple Services Engineering Cloud Service Infrastructure team, as a Site Reliability Engineer, to help support and scale cloud services for millions of Apple users. We are building and...
-
ASE - Service Reliability Engineer
2 weeks ago
Singapore APPLE SOUTH ASIA PTE. LTD. Full timeRoles & ResponsibilitiesJob SummaryThe Apple Services Engineering (ASE) team is one of the most exciting examples of Apple’s long-held passion for combining art and technology. These are the people who power the App Store, Apple TV, Apple Music, Apple Podcasts, and Apple Books. And they do it on a massive scale, meeting Apple’s high expectations with...
-
ASE - Site Reliability Engineer
2 weeks ago
Singapore Apple South Asia Pte. Ltd. Full timeJob SummaryApple Services Engineering team is one of the most exciting examples of Apple's long-held passion for combining art and technology. Join Apple Services Engineering Cloud Service Infrastructure team, as a Site Reliability Engineer, to help support and scale cloud services for millions of Apple users. We are building and supporting new and existing...
-
ASE - Service Reliability Engineer
2 weeks ago
Singapore Apple South Asia Pte. Ltd. Full timeJob SummaryThe Apple Services Engineering (ASE) team is one of the most exciting examples of Apple's long-held passion for combining art and technology. These are the people who power the App Store, Apple TV, Apple Music, Apple Podcasts, and Apple Books. And they do it on a massive scale, meeting Apple's high expectations with high performance to deliver a...
-
Site Reliability Engineer
4 weeks ago
Singapore ADYEN SINGAPORE PTE. LTD. Full timeRoles & ResponsibilitiesThis is AdyenAdyen provides payments, data, and financial products in a single solution for customers like Meta, Uber, H&M, and Microsoft - making us the financial technology platform of choice. At Adyen, everything we do is engineered for ambition.For our teams, we create an environment with opportunities for our people to succeed,...
-
Site Reliability Engineer
4 weeks ago
Singapore Adyen Singapore Pte. Ltd. Full timeThis is AdyenAdyen provides payments, data, and financial products in a single solution for customers like Meta, Uber, H&M, and Microsoft - making us the financial technology platform of choice. At Adyen, everything we do is engineered for ambition.For our teams, we create an environment with opportunities for our people to succeed, backed by the culture and...
-
Site Reliability Engineer
3 weeks ago
Singapore Wipro Limited Full timeJob Role : Site Reliability Engineer Location : SingaporeExperience : 2+ Years of relevant experience Job Description : Responsibilities : Hands-on design, implement, and extend automation tools for infrastructure, application, and container management. Monitor Staging, Test and Development environments for a myriad of Products in an agile and dynamic...
-
Site Reliability Engineer
2 weeks ago
Singapore LIVERAMP PTE. LTD. Full timeRoles & ResponsibilitiesABOUT THIS JOBThe SRE team is responsible for owning and supporting deployments of global products, and providing first line operational support. We are looking for a Site Reliability engineer who is excited about establishing and advocating for best practices for product deployments and SRE. You will be able to leverage your software...
-
Site Reliability Engineer
3 weeks ago
Singapore Liveramp Pte. Ltd. Full timeABOUT THIS JOBThe SRE team is responsible for owning and supporting deployments of global products, and providing first line operational support. We are looking for a Site Reliability engineer who is excited about establishing and advocating for best practices for product deployments and SRE. You will be able to leverage your software engineering expertise...
-
Site Reliability Engineer
1 week ago
Singapore ADECCO PERSONNEL PTE LTD Full timeRoles & ResponsibilitiesResponsibilitiesTo be responsible for reliability, availability, user experience, capacity planning, toil reduction, process enhancement and digitalization of the cloud-based internet services.Handle SRE role for assigned cloud services owning the KPIs for reliability, issue to resolution, service deployment, business continuity...
-
Site Reliability Engineer
7 days ago
Singapore Adecco Personnel Pte Ltd Full timeResponsibilitiesTo be responsible for reliability, availability, user experience, capacity planning, toil reduction, process enhancement and digitalization of the cloud-based internet services.Handle SRE role for assigned cloud services owning the KPIs for reliability, issue to resolution, service deployment, business continuity management, security policy...
-
Site Reliability Engineer
11 hours ago
Singapore A-IT SOFTWARE SERVICES PTE LTD Full timeRoles & ResponsibilitiesRole: Site Reliability EngineerJob Level: 3-5 years of relevant experience (L2)Job DescriptionJob Title: Site Reliability EngineerJob ObjectivesThe Site Reliability Engineer/Software Engineer is a contract position responsible software and systems engineering to build and run large-scale, distributed, fault-tolerant systems.As a...
-
Site Reliability Engineer
3 weeks ago
Singapore Sciente Consulting Full timeMandatory Skill-set Bachelor's degree in Computer Science, Mathematics, Engineering, or any related field; Has 3 to 4 years of proven experience in monitoring application and systems; Expertise in Grafana, Elastic Stack (Elasticsearch, Logstash, Kibana, Beats), and Kafka, including setup, configuration, upgrades, patching, data management, monitoring,...
-
Site Engineer
2 days ago
Singapore SHANGHAI TUNNEL ENGINEERING CO (SINGAPORE) PTE LTD Full timeRoles & ResponsibilitiesMain Duties: Overseeing the construction activities and progress, planning, implementation and monitoring work schedules in accordance to the master and detailed work programme Liaise with Professional Engineer on the Temporary works Liaise with consultants (QPS) for technical issues and coordinate the site activities with...
-
Site Reliability Expert Engineer
3 weeks ago
Singapore Shopee Full timeJob Description:Set up, deploy and configure marketplace services in the private cloud platform.Continuously improve the marketplace services in the private cloud, including but not limited to stress test automation, capacity management, service autoscaler, disaster recovery, chat operations, knowledge base management, SOP automation, dynamic service...
-
Senior Site Reliability Engineer
1 week ago
Singapore SYGNUM PTE. LTD. Full timeRoles & ResponsibilitiesAbout The RoleWe’re seeking a Site Reliability Engineer who is ready to work with new technologies and architectures in a forward-thinking organization, especially blockchain that’s always pushing boundaries. Here, you will take complete, end-to-end ownership of our applications. You will have experience building products across...
-
Site Reliability Engineer #IAC
2 weeks ago
Singapore RECRUIT EXPRESS PTE LTD Full timeRoles & ResponsibilitiesMy client is looking for a looking for an experienced individual to join the SRE team. The individual will support production monitoring and is expected to be hands-on using technology.Job Requirements: Java Programming Experience (2+ years) or equivalent level of coding knowledge Python/Shell Scripting (2+ years) or data...
-
Senior Site Reliability Engineer
4 weeks ago
Singapore Shopee Full timeJob Description:Fun and energetic team culture with strong emphasis on learning, sharing and growth.Learning programme / roadmap for all new hires (applicable for both fresh / experienced).Wide exposure to enable rapid growth in personal skills and career.Deep dive into Marketplace core product lines.50:50 time spent between technical operations and software...
-
Senior Site Reliability Engineer
5 days ago
Singapore Sygnum Pte. Ltd. Full timeAbout The RoleWe're seeking a Site Reliability Engineer who is ready to work with new technologies and architectures in a forward-thinking organization, especially blockchain that's always pushing boundaries. Here, you will take complete, end-to-end ownership of our applications. You will have experience building products across the stack and a firm...