Service Reliability Senior Administrator
1 month ago
We are seeking a highly skilled Service Reliability Senior Administrator to join our team at Riot Games. As a key member of our operations team, you will be responsible for ensuring the reliability and availability of our services.
Responsibilities- Triage and investigate live incidents to identify root causes and implement corrective actions.
- Execute technical return to service actions in a fast-paced, distributed systems environment to quickly restore service and protect player experience.
- Monitor the health of our distributed services using observability tools and identify gaps in alerting, runbook steps, processes, or tools.
- Runbook execution and maintenance to keep documentation up to date.
- Onboard new team members and provide support and coordination during major launches, events, and release deployments.
- Contribute to project work with some guidance to develop automation scripts, utilities, and new processes to continuously improve the incident management process.
- Document details of incident response as needed to identify problems and improve overall incident management/response.
- Participate in post-incident RCA meetings as required.
- Computer Science/IT Systems/Information Technology diploma or equivalent.
- 2+ years of Service Reliability Administration or equivalent role (System Analyst, System Administrator/Engineer, Live Operations, Network Administrator, NOC Engineer, etc).
- Experience with incident management and a good understanding of ITIL processes.
- Familiarity with the core concepts of operating systems, networking, SDLC, and Agile methodologies.
- Good troubleshooting skills with triaging incidents in a high-capacity, high-availability, and highly distributed environment.
- Experience with monitoring solutions (e.g., Datadog, NewRelic, Nagios, Elastic Search, Grafana), event management tools (e.g., BigPanda, Moogsoft), and ITIL-based Ticketing systems (e.g., ServiceNow, JIRA).
- Computer Science/IT Systems/Information Technology degree or equivalent.
- Understanding of relational databases (e.g., MySQL), CI/CD pipelines (especially Jenkins), and experience working on deployments in a live environment.
- Experience working in container-based ecosystems (e.g., Docker) and with a container scheduler (e.g., Kubernetes, Amazon EKS/ECS, or GKE).
- AWS Cloud Services experience/certification/training or equivalent, Linux+, and Network+ or equivalents.
- Experience building automation scripts/utilities/jobs using Python, PowerShell, JavaScript, or Bash.
- Familiarity with Site Reliability Engineering (SRE) principles and best practices.
- Full health insurance for you, your spouse, and children.
- Open paid time off.
- Retirement benefits with company matching.
- Life insurance, parental leave, plus short-term and long-term disability.
- Play Fund to broaden and deepen your knowledge of our players and community through games.
- We will double down on your donations of time and money to non-profits.
We will certainly be looking at your past studies and experience, but for this role, we also look for dedicated people with a personal relationship with games. If you embody player empathy and care about the experiences of players, this could be the role for you.
Don't forget to include a resume and cover letter. We receive many applications, but we'll notice a fun, well-written intro that shows us you Dare to Dream and Execute with Excellence.
-
Service Reliability Senior Administrator
4 months ago
Singapur, Singapore Riot Games Full timeResponsibilities: Triage and investigation of live incidents Execute technical return to service actions in a fast-paced, distributed systems environment specifically microservices to quickly restore service and protect player experience Monitor the health of Riot’s distributed services using observability tools, identify gaps with alerting,...
-
Senior Reliability Engineer
1 month ago
Singapur, Singapore DBS Bank Full timeJob Title: SVP/VP, Specialist, SRE, EASRE, TechnologyAbout the RoleDBS Bank is seeking a highly experienced and skilled Senior Reliability Engineer to lead our production operations and ensure high availability and reliability of services to internal and external customers. As a key member of our Technology and Operations team, you will work closely with...
-
Senior Cloud Reliability Engineer
1 month ago
Singapur, Singapore NodeFlair Full timeSenior Site Reliability EngineerWe are working with a leading pioneer in the Cryptocurrency space, utilizing one of the largest data platforms, and as part of their continued growth, NodeFlair has been engaged to search for a Senior Site Reliability Engineer to join their Singapore/Remote team.Key Responsibilities:Collaborate with the team on software...
-
Senior Engineer, Electrical Reliability
4 months ago
Singapur, Singapore Celanese Corporation Full timeResponsibilities 职责: Job Description - Senior Reliability Engineer (Electrical) / Electrical - Subject Matter Expert Electrical Reliability and Maintenance: -Provide technical subject matter expertise to enhance the electrical reliability and ensuring all KPIs are met. -Improve reliability of electrical equipment by implementing repair...
-
Senior CMOS Reliability Engineer
1 month ago
Singapur, Singapore Micron Full timeTransforming the Future of InformationMicron is a leader in the development of innovative memory and storage solutions. We are seeking a highly skilled Senior DDQA CMOS Reliability Engineer to join our team.About the Role:As a Senior DDQA CMOS Reliability Engineer, you will be responsible for developing and optimizing quality and reliability criteria related...
-
Senior Administration Executive
1 month ago
Singapur, Singapore Flintex Consulting Pte Ltd Full timeJob DescriptionKey Responsibilities:As a Senior Administration Executive, you will be responsible for providing administrative support to the Singapore Office, ensuring the facilitation and provision of effective, efficient, and reliable administrative logistical support for the company.Key Responsibilities:1. Manage the administration function of the...
-
Reliability Engineering Lead
4 weeks ago
Singapur, Singapore IHiS Full timeJob OverviewThe Reliability Lead will collaborate with senior management to discuss strategies for improving application and system reliability. This role will also manage the reliability team and ensure that existing site reliability engineering initiatives are on track.Key ResponsibilitiesStrive for automation in production systemsIdentify significant...
-
Senior Grid Reliability Engineer
1 month ago
Singapur, Singapore SP Group Full timeAt SP Group, we are seeking a highly motivated and experienced Senior Grid Reliability Engineer to join our team. As a key member of our distribution network planning team, you will be responsible for planning and designing the distribution network to meet customer demand and ensuring the reliability of our grid.Key Responsibilities:Develop and implement...
-
Senior Site Reliability Engineer
1 month ago
Singapur, Singapore Shopee Full timeAbout the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our Engineering and Technology team in Singapore. As a key member of our team, you will be responsible for managing the technical operations of Shopee's core marketplace businesses, including product lines such as shopee voucher management, shopee discount/coins...
-
Singapur, Singapore Flintex Consulting Pte Ltd Full timeJob DescriptionMon to Fri 8.30am - 5.30pm Key Job Objective and Job Responsibility:To properly discharge responsibilities in undertaking Singapore Office general office administration function in order to ensure the facilitation and provision of effective, efficient and reliable administrative logistical support for company with a view to achieving the...
-
Site Reliability Specialist
6 months ago
Singapur, Singapore IHiS Full timePosition OverviewThe Reliability Lead will support the reliability principal with senior management in strategy discussion for application & system improvement, and will also manage the reliability team. He/She will ensure that the existing site reliability engineering (SREs) initiatives, such as monitoring availability, uplifting capability and automoation...
-
Senior DDQA NAND CMOS Reliability Engineer
5 months ago
Singapur, Singapore Micron Full timeOur vision is to transform how the world uses information to enrich life for all. Join an inclusive team passionate about one thing: using their expertise in the relentless pursuit of innovation for customers and partners. The solutions we build help make everything from virtual reality experiences to breakthroughs in neural networks possible. We do it...
-
Singapur, Singapore Celanese Corporation Full timeJob SummaryWe are seeking a highly skilled Principal Engineer/Senior Engineer to join our team as a Fixed Equipment Reliability Expert. The successful candidate will be responsible for developing and implementing effective reliability strategies to improve the reliability of static equipment.Key ResponsibilitiesDevelop and implement reliability strategies to...
-
Site Reliability Engineer
1 month ago
Singapur, Singapore Sea Full timeJob Title: Site Reliability EngineerAt Sea, our Infrastructure team is responsible for providing end-to-end managed services and solutions for our entire Internet infrastructure. We excel in building architecture, providing solutions, and operating data centers, connectivity, cloud, networking, systems, storage, and security.As a Site Reliability Engineer,...
-
Site Reliability Engineer
1 month ago
Singapur, Singapore Sea Full timeOur Infrastructure team provides the end-to-end managed services and solutions for the Group's entire Internet infrastructure alongside running business applications. We excel in building the architecture, providing solutions and operations of data centre, connectivity, cloud, networking, system, storage and security. We are a proud provider of high-quality...
-
Expert/Senior Site Reliability Engineer
2 months ago
Singapur, Singapore Sea Full timeOur Infrastructure team provides the end-to-end managed services and solutions for the Group's entire Internet infrastructure alongside running business applications. We excel in building the architecture, providing solutions and operations of data centre, connectivity, cloud, networking, system, storage and security. We are a proud provider of high-quality...
-
Electrical Reliability Engineer
4 weeks ago
Singapur, Singapore Celanese Corporation Full timeJob Summary:Celanese Corporation is seeking a highly skilled Electrical Reliability Engineer to join our team. As a key member of our electrical discipline, you will be responsible for enhancing electrical reliability and ensuring all KPIs are met.Key Responsibilities:Provide technical subject matter expertise to enhance electrical reliability and ensure all...
-
Senior Site Reliability Engineer
1 month ago
Singapur, Singapore Sea Full timeAbout Sea LabsAt Sea Labs, we're at the forefront of the Sea platform's development, supporting diverse business lines across e-commerce, supply chain, games, payment, and finance. Our strong growth and unique positioning have led to the launch of Sea Labs Indonesia, where passionate engineers drive the best experience for our users in Indonesia and...
-
Senior Office Administrator
1 month ago
Singapur, Singapore moomoo Full timeAbout MoomooMoomoo Financial Singapore Pte. Ltd. is a leading financial technology company that is revolutionizing the investing experience. Our digitalized brokerage and wealth management platform, moomoo, offers a seamless and intuitive experience for users. With a strong focus on user experience and risk management, we are committed to providing the best...
-
Technical Officer
1 month ago
Singapur, Singapore SP Group Full timeAbout the Role:SP Group is seeking a highly motivated and experienced Technical Officer to join our team. As a key member of our grid reliability team, you will play a critical role in upholding Singapore's world-class grid reliability and contributing to a sustainable energy future.Key Responsibilities:Implement service connection projects to provide timely...