Site Reliability Engineer

3 weeks ago


Singapore GXS BANK PTE. LTD. Full time
Roles & Responsibilities

Job Description & Requirements

Get to know the Role:

  • As a Site Reliability Engineer (SRE) you will help build a meaningful engineering discipline, combining software and systems to develop creative engineering solutions to operations problems.
  • Much of our support and software development focuses on optimizing existing systems, building infrastructure and reducing work through automation.
  • You’ll join a team of curious problem solvers with a diverse set of perspectives who are thinking big and taking risks.
  • As an SRE you’ll be focused on running better production applications and systems.
  • SRE is a key contributor to core infrastructure and functional development teams throughout the life cycle to help support software for reliability and scale.
  • Key areas of focus include automation, application/platform uptime and quality, packaging/distribution techniques, platform design “operability”, analytics, deployment, adoption, and tool development, among others.
  • The position will wear many hats from owning day to day health and performance, to identifying incidents/developing remediation plans, to working with open source software and experienced packaging techniques, to working with development teams and contributing to the strategic roadmap and execution.
  • Candidates from a variety of software, platform, or automation engineering backgrounds will be considered for this position.

The day-to-day activities:

  • Design, code, test and deliver software to automate manual operational work
  • Troubleshoot priority incidents, facilitate blameless post-mortems and ensure permanent closure of incidents
  • Engage with development team throughout the life cycle to help develop software for reliability and scale, ensuring minimal refactoring or changes
  • Perform analytics on previous incidents and usage patterns to better predict issues and take proactive actions
  • Perform the L1/L2/L3 support activities for the Production Support project with analysis and design work, including impact of requirements across all system components
  • Build and drive adoption for greater self-healing and resiliency patterns
  • Design automated software and product upgrades, change management, and release management solutions
  • Participate in the 24x7 support coverage as needed

The must haves:

  • Bachelor's degree in information systems, information technology, computer science, or similar.
  • 1-3+ years professional experience in a software management position.
  • Experience with dockers / containers / k8s.
  • Direct production operations experience in a cloud environment.
  • Experience contributing to technology and product strategy.
  • Experience leading capability building initiatives across diverse areas such as infrastructure and operations automation, software quality, delivery automation and other core engineering.
  • Demonstrated experience of driving operational efficiency and transparency of a growing engineering organization.

Tell employers what skills you have

Remediation
Kubernetes
Change Management
Open Source Software
Release Management
Transparency
Information Technology
Reliability
Strategy
Networking
Packaging
Python
Ansible
Java
Linux
Software Development

  • Singapore NLS Full time

    My client, a global hedge fund, is actively seeking a hands on a highly skilled and motivated SRE to join their team. As an SRE, you will play a critical role in driving the adoption of Site Reliability Engineering practices within their organization. The ideal candidate will have a strong technical background and a passion for driving operational...


  • Singapore PERSOLKELLY Full time

    We have partnered with a renowned global leader in information and communications technology (ICT) infrastructure and smart devices. They are providing full-stack, all-scenario solutions for products and services carriers, enterprises, governments, and individual consumers worldwide. Our client is looking for an enthusiastic Site Reliability Engineer to...


  • Singapore LUXOFT MALAYSIA SDN. BHD. Full time

    With award-winning mobile banking apps and trading systems, our technology platforms help Bank deliver best-in-class products to clients. Naturally, we make sure that the phones work, emails are delivered and PCs run - but we also develop innovative collaboration platforms and workspaces that help our people share their knowledge, their expertise and their...


  • Singapore Jpmorgan Chase Bank, N.a. Full time

    Job SummaryJpmorgan Chase Bank, N.a. seeks a skilled Site Reliability Leader to assume a critical role in defining the future of the firm and drive significant impact across site reliability.About the RoleThis position involves leading initiatives to improve application reliability and stability using data-driven analytics to enhance service levels....


  • Singapore GXS Bank Full time

    Site Reliability Engineer Apply Location: Singapore, Singapore Time Type: Full Time Posted On: Posted 30+ Days Ago Job Requisition ID: R-2024-11-101247About the Team: Our team treats infrastructure and operations as software engineering problems. We are responsible for building and progressing software platforms that enable the provisioning and management...


  • Singapore Qlik Full time

    Job DescriptionThe Regional Director, Site Reliability Engineer (SRE) RoleOur organization is seeking an experienced and execution-minded Regional Director of Site Reliability Engineering (SRE) to lead and build a robust regional SRE organization. This role will be instrumental in aligning the local SRE teams with global SRE strategies while fostering...


  • Singapore GXS Bank Full time

    We're seeking a highly motivated Site Reliability Expert to join our team at GXS Bank. As a Site Reliability Expert, you'll be responsible for ensuring the reliability and scalability of our software platforms, using your expertise in infrastructure and operations as software engineering problems.You'll participate in designing and architecting new...


  • Singapore LUXOFT INFORMATION TECHNOLOGY (SINGAPORE) PTE. LTD. Full time

    Roles & ResponsibilitiesWith award-winning mobile banking apps and trading systems, our technology platforms help Bank deliver best-in-class products to clients. Naturally, we make sure that the phones work, emails are delivered and PCs run - but we also develop innovative collaboration platforms and workspaces that help our people share their knowledge,...


  • Singapore LUXOFT INFORMATION TECHNOLOGY (SINGAPORE) PTE. LTD. Full time

    Roles & ResponsibilitiesWith award-winning mobile banking apps and trading systems, our technology platforms help Bank deliver best-in-class products to clients. Naturally, we make sure that the phones work, emails are delivered and PCs run - but we also develop innovative collaboration platforms and workspaces that help our people share their knowledge,...


  • Singapore GK CONSULTING PTE. LTD. Full time

    Roles & ResponsibilitiesWe're seeking an experienced Senior Site Reliability Engineer to ensure the reliability, availability, and performance of our cloud-based internet services.Key Responsibilities1. Own reliability, availability, and user experience for assigned cloud services2. Develop and implement service governance initiatives to increase reliability...


  • Singapore GK CONSULTING PTE. LTD. Full time

    Roles & ResponsibilitiesWe're seeking an experienced Senior Site Reliability Engineer to ensure the reliability, availability, and performance of our cloud-based internet services.Key Responsibilities1. Own reliability, availability, and user experience for assigned cloud services2. Develop and implement service governance initiatives to increase reliability...


  • Singapore This is an IT support group Full time

    Job OverviewA critically important role awaits an exceptional candidate in the IT support group. This individual will have a profound impact on shaping the future of a globally recognized firm.Key Responsibilities:Conduct resiliency design reviews to ensure high levels of reliability and stability for applications and platforms.Collaborate with team members...


  • Singapore Jpmorgan Chase Bank, N.a. Full time

    Company Overview:JPMorgan Chase Bank, N.A. is a leading global financial services firm committed to delivering innovative solutions and exceptional client experiences.Job Description:As a Lead Site Reliability Engineer within the Infrastructure Platforms team, you will play a critical role in defining the future of our globally recognized firm. You will hold...


  • Singapore JPMorganChase Full time

    Job OverviewWe're seeking an experienced Chief Site Reliability Architect to join our Infrastructure Platforms team at JPMorgan Chase. As a key member of our team, you'll play a critical role in designing and implementing reliable and scalable systems that meet the needs of our clients.Key Responsibilities* Lead initiatives to improve application reliability...


  • Singapore TIKTOK PTE. LTD. Full time

    About UsTikTok PTE. LTD. is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy.At TikTok, we believe that creation is at the core of our purpose. We empower our employees to thrive in an environment that fosters innovation and collaboration. As a Site Reliability Engineer on our Recommendation Architecture...


  • Singapore Oxford Knight Full time

    Senior Site Reliability Engineer - Singapore or Hong Kong **Salary**: up to 250-275k SGD base **Summary** High-frequency prop trading firm with offices worldwide looking for skilled Senior Site Reliability Engineer developer to support and maintain their Linux trading infrastructure on a day-to-day basis. This is a pivotal role where you will lead...


  • Singapore TREEBOX SOLUTIONS PTE. LTD. Full time

    Roles & ResponsibilitiesJob descriptionAs a Site Reliability Engineer at TreeBox Solutions, you will be collaborating with cross-functional teams to design, build, and operate reliable and highly scalable infrastructure for internal and customer-facing applications. You must be self-motivated and has proactive approach to problem-solving. You must also...


  • Singapore TREEBOX SOLUTIONS PTE. LTD. Full time

    Roles & ResponsibilitiesJob descriptionAs a Site Reliability Engineer at TreeBox Solutions, you will be collaborating with cross-functional teams to design, build, and operate reliable and highly scalable infrastructure for internal and customer-facing applications. You must be self-motivated and has proactive approach to problem-solving. You must also...


  • Singapore PERSOLKELLY SINGAPORE PTE. LTD. Full time

    Roles & ResponsibilitiesResponsibilities• To be responsible for reliability, availability, user experience, capacity planning, toil reduction, process enhancement and digitalization of the cloud-based internet services.• Handle SRE role for assigned cloud services owning the KPIs for reliability, issue to resolution, service deployment, business...


  • Singapore PERSOLKELLY SINGAPORE PTE. LTD. Full time

    Roles & ResponsibilitiesResponsibilities• To be responsible for reliability, availability, user experience, capacity planning, toil reduction, process enhancement and digitalization of the cloud-based internet services.• Handle SRE role for assigned cloud services owning the KPIs for reliability, issue to resolution, service deployment, business...