Senior Staff Engineer, Site Reliability Engineering

1 week ago


Singapore Google Full time

Google will be prioritizing applicants who have a current right to work in Singapore, and do not require Google's sponsorship of a visa.

Minimum qualifications:

- 15 years of relevant work experience in a production environment.
- Experience programming in one or more of the following languages: C, C++, Java, Python, Go, Perl, or Ruby.
- Experience architecting, developing, and troubleshooting systems.
- Experience with algorithms and data structures and/or Unix/Linux systems internals (e.g., filesystems, system calls) and administration.

Preferred qualifications:

- Bachelor's degree in Computer Science, similar technical field of study, or equivalent practical experience.
- 10 years of experience in distributed systems, storage systems, or databases.
- Experience designing, analyzing, and troubleshooting large-scale distributed systems.
- Systematic problem-solving approach, coupled with excellent communication skills and a sense of ownership and drive.

**About the job**:
Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google's services—both our internally critical and our externally-visible systems—have reliability, uptime appropriate to users' needs and a fast rate of improvement. Additionally SRE’s will keep an ever-watchful eye on our systems capacity and performance. Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation.

On the SRE team, you’ll have the opportunity to manage the complex challenges of scale which are unique to Google, while using your expertise in coding, algorithms, complexity analysis and large-scale system design.

SRE's culture of diversity, intellectual curiosity, problem solving and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think big and take risks in a blame-free environment. We promote self-direction to work on meaningful projects, while we also strive to create an environment that provides the support and mentorship needed to learn and grow.

**To learn more**: check out our books on Site Reliability Engineering, watch a recorded Hangout on Air to meet some of our SREs, or read a career profile about why a Software Engineer chose to join SRE.

Behind everything our users see online is the architecture built by the Technical Infrastructure team to keep it running. From developing and maintaining our data centers to building the next generation of Google platforms, we make Google's product portfolio possible. We're proud to be our engineers' engineers and love voiding warranties by taking things apart so we can rebuild them. We keep our networks up and running, ensuring our users have the best and fastest experience possible.

**Responsibilities**:

- Lead designs of major software components, systems, and features to improve the availability, scalability, latency, and efficiency of Google's services.
- Lead sustainable incident response, blameless postmortems, and production improvements that result in direct business opportunities for Google.
- Provide guidance to other team members on managing end-to-end availability and performance of mission critical services, on building automation to prevent problem recurrence, and building automated responses for non-exceptional service conditions.
- Mentor and train other team members on design techniques and coding standards, and to cultivate innovation and collaboration across multiple teams.
- Manage individual projects priorities, deadlines, and deliverables.

Google is proud to be an equal opportunity workplace and is an affirmative action employer. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. See also Google's EEO Policy and EEO is the Law. If you have a disability or special need that requires accommodation, please let us know by completing our Accommodations for Applicants form.



  • Singapore Oxford Knight Full time

    Senior Site Reliability Engineer Job OverviewOxford Knight is seeking a highly skilled Senior Site Reliability Engineer to join our team and support our Linux trading infrastructure.Key ResponsibilitiesDesign and implement software components and systems to improve trading services.Provide level II support, including emergency response and advanced...


  • Singapore Sea Limited Full time

    Engineering and Technology - Infrastructure, Singapore - Experienced (Individual Contributor) Our DevOps Engineering team plays an important role in developing and maintaining the internal systems and tools for the Infrastructure team. As a Senior Site Reliability Operation Engineer, you are responsible for improving the availability and reliability of our...


  • Singapore AKAMAI TECHNOLOGIES APJ PTE. LTD. Full time

    **Join our Site Reliability team**: **Help us shape the future of the Internet**: As a Senior Site Reliability Engineer, you will be responsible for: - Deploying, managing, and operating scalable, highly available, and fault-tolerant systems on the Akamai Zero Trust Cloud Platform - Analysing and improving security, stability, speed, and capacity of Akamai...


  • Singapore NodeFlair Full time

    **Job Summary**: **Salary** S$11,500 - S$16,500 / Monthly **Job Type** **Seniority** Senior **Years of Experience** At least 7 years **Tech Stacks** Microsoft Puppet Java Ansible Python **This is Adyen** Adyen provides payments, data, and financial products in a single solution for customers like Meta, Uber, H&M, and Microsoft - making us the...


  • Singapore NextWave Partners Full time

    Location: - Singapore- Job Type: - Permanent- Discipline: - Software Engineering- Salary: - Negotiable- Contact: - Chelsea Phan**Senior Site Reliability Engineer** **Singapore** **About the role** We are working with a climate technology, who is currently working on a smart carbon measurement, accounting, and management Saas platform that allows...


  • Singapore Sea Limited Full time

    Engineering and Technology - Infrastructure, Singapore - Entry Level Our DevOps Engineering team plays an important role in developing and maintaining the internal systems and tools for the Infrastructure team. As a Site Reliability Engineer, you are responsible for improving the availability and reliability of our Infrastructure services. - Responsible for...


  • Singapore GK CONSULTING PTE. LTD. Full time

    Roles & ResponsibilitiesWe're seeking an experienced Senior Site Reliability Engineer to ensure the reliability, availability, and performance of our cloud-based internet services.Key Responsibilities1. Own reliability, availability, and user experience for assigned cloud services2. Develop and implement service governance initiatives to increase reliability...


  • Singapore IFUN GAMES Full time

    **Responsibilities** - Design, implement, and maintain tools and processes for monitoring, alerting, and incident response - Collaborate with developers to improve the design and operation of systems, with a focus on reliability, performance, and scalability - Participate in on-call rotations to respond to incidents and handle escalations - Analyze system...


  • Singapore SILICON BOX PTE. LTD. Full time

    **Position Summary** **Senior/Principal Engineer (Reliability) **is responsible for supporting and managing activities of Reliability lab. **Key Responsibilities** - Schedule and Support Reliability Test Request raised by internal and external customers. - Provide machine readiness and perform buyoff for new Reliability tools and capacity ramp-up. -...


  • Singapore ONE STOP ENGINEERING PTE. LTD. Full time

    Title**:Reliability Engineer Purpose Statement (2-3 Sentences): - Ensures reliability and maintainability of equipment, processes, utilities, facilities and controls with an objective to constantly improve site production and cost performance. - Develops engineering solutions to repetitive failures and all other problems that adversely affect plant...


  • Singapore Oxford Knight Full time

    Senior Site Reliability Engineer - Singapore or Hong Kong **Salary**: up to 250-275k SGD base **Summary** High-frequency prop trading firm with offices worldwide looking for skilled Senior Site Reliability Engineer developer to support and maintain their Linux trading infrastructure on a day-to-day basis. This is a pivotal role where you will lead...


  • Singapore Shopify Full time

    Company Description Shopify is the leading omni-channel commerce platform. Merchants use Shopify to design, set up, and manage their stores across multiple sales channels, including mobile, web, social media, marketplaces, brick-and-mortar locations, and pop-up shops. The platform also provides merchants with a powerful back-office and a single view of...


  • Singapore ASIA GULF CLOUD PTE. LTD. Full time

    Roles & ResponsibilitiesPosition Summary:We are looking for a skilled and driven Senior Site Reliability Engineer (SRE) / Team Lead to join our digital banking platform. In this leadership role, you will manage a team of 5–6 engineers and take end-to-end ownership of system reliability, scalability, and operational excellence. You'll work closely with...


  • Singapore ASIA GULF CLOUD PTE. LTD. Full time

    Roles & ResponsibilitiesPosition Summary:We are looking for a skilled and driven Senior Site Reliability Engineer (SRE) / Team Lead to join our digital banking platform. In this leadership role, you will manage a team of 5–6 engineers and take end-to-end ownership of system reliability, scalability, and operational excellence. You'll work closely with...


  • Singapore Hays Full time

    **Your new company** A global leading provider of self‐developed PC‐client and mobile games to worldwide users. **Your new role** As a Senior Site Reliability Engineer, you will be responsible for the construction and maintenance of the network and weak current system in the office, including switch and firewall configuration and tuning....


  • Singapore DADACONSULTANTS PTE. LTD. Full time

    Roles & ResponsibilitiesSenior Site Reliability Engineer (SRE) | Big DataResponsibilities:Manage the full lifecycle of services, from design to deployment and maintenance.Develop and improve automation tools for scalability and reliability.Troubleshoot and resolve software and infrastructure issues, ensuring data security.Optimize system architecture and...


  • Singapore ST Engineering Full time

    Company OverviewWe are a leading global engineering company that provides innovative solutions to meet the evolving needs of our customers. As a System Reliability Engineer, you will play a key role in ensuring the effectiveness of our products and services.Job DescriptionThe primary responsibility of this role is to perform Reliability, Maintainability, and...


  • Singapore The Edge Asia Full time

    Our client is a US hedge fund and their Technology group is constantly improving the company’s IT infrastructure, positioning them at the forefront of a rapidly evolving technology landscape. They are a team of experts experimenting, discovering new ways to harness the power of open-source solutions, and embracing enterprise agile methodology. Their...


  • Singapore Gravitas Recruitment Group Full time

    Job details - Location - Singapore - Salary - S$9000 - S$13000 per month - Job Type - Permanent - Ref - BBBH137137_1690786002 - Posted - about 1 hour ago Job summary **Our client, a trading firm, is looking for a Site Reliability Engineer to join their team. They are seeking team players who demonstrate a creative approach to problem-solving and take...


  • Singapore Quess Corp Limited Full time

    **Job Information**: Industry **Insurance** *** Salary **6000 - 8000** *** Work Experience **2-4 Years** *** City **singapore** *** State/Province **singapore** *** Country **Singapore** *** Zip/Postal Code **189557** *** - Income IT is adopting site reliability engineering (SRE) principles to implement continuous operation support to business...