Lead Site Reliability Engineer, Cloud Technology

7 hours ago


Singapore JPMorganChase Full time

Public Cloud SRE is responsible for engineering and operating the cloud infrastructure and platforms of JPMC ensuring reliability, resiliency, and security. We have a Senior Software Engineer, Site Reliability position to build the infrastructure and tooling for JPMC’s Public Cloud Platform.

As a Lead Site Reliability Engineer at JPMorgan Chase within the Cloud Reliability Services, you hold a leadership role in your team, demonstrate strong knowledge across multiple technical domains, and advise others on the technical and business issues facing them. Take lead and conduct resiliency design reviews, break up complex problems into digestible work for other engineers, act as a technical lead for medium to large-sized products, and provide advice and mentoring to other engineers.

**Job responsibilities**
- Engage in and improve the lifecycle of cloud services from inception, design, deployment, and operation
- Automate repeated manual tasks, develop tools and automation to improve the efficiency of the platform and infrastructure.
- Analyze defects, propose improvements and drive efficiencies in systems and processes.
- Helps to develop new cloud engineering strategies and implementations for the firm
- As part of Site Reliability, you have the responsibility of ensuring the reliability, availability, and performance of the cloud infrastructure and platform.
- Demonstrates site reliability principles and practices every day and champions the adoption of site reliability throughout your team
- Develop observability and telemetry tools.
- Author and improve the quality of technical engineering documentation
- Debug and solve issues in a production environmentParticipates in SRE on-call rotations and escalation workflows.

**Required qualifications, capabilities, and skills**
- Formal training or certification on software engineering or site reliability engineering and 5+ years applied experience
- Bachelor’s Degree in Computer Science or equivalent
- Expertise in building solutions with AWS cloud services.
- Knowledge in Infrastructure as Code, tools such as Terraform
- Fluency in at least one programming language such as Python and Java.
- Proficiency and experience in observability such as white and black box monitoring, SLO alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, etc.
- Proficiency in continuous integration and continuous delivery tools (e.g., Jenkins, GitLab, Terraform, etc.)
- Experience with container and container orchestration (e.g., ECS, Kubernetes, Docker, etc.)
- Experience with troubleshooting common networking technologies and issues
- Ability to identify and solve problems related to complex data structures and algorithms
- Drive to self-educate and evaluate new technology
- Ability to teach new programming languages to team members
- Ability to expand and collaborate across different levels and stakeholder groups
- Excellent communication skills working with stakeholders and domain experts across the company to design solutions to user problems
- Self-disciplined, self-managed, self-motivated and strong sense of ownership, urgency, and drive

**Preferred qualifications, capabilities, and skills**
- AWS certifications will be a bonus.



  • Singapore JPMorganChase Full time

    Lead Site Reliability Engineer, Cloud Technology Join to apply for the Lead Site Reliability Engineer, Cloud Technology role at JPMorganChase Lead Site Reliability Engineer, Cloud Technology 20 hours ago Be among the first 25 applicants Join to apply for the Lead Site Reliability Engineer, Cloud Technology role at JPMorganChase Get AI-powered advice on...


  • Singapore JPMorganChase Full time

    Public Cloud SRE is responsible for engineering and operating the cloud infrastructure and platforms of JPMC ensuring reliability, resiliency, and security. We have a Senior Software Engineer, Site Reliability position to build the infrastructure and tooling for JPMC's Public Cloud Platform.As a Lead Site Reliability Engineer at JPMorgan Chase within the...


  • Singapore JPMorganChase Full time

    Public Cloud SRE is responsible for engineering and operating the cloud infrastructure and platforms of JPMC ensuring reliability, resiliency, and security. We have a Senior Software Engineer, Site Reliability position to build the infrastructure and tooling for JPMC's Public Cloud Platform.As a Lead Site Reliability Engineer at JPMorgan Chase within the...


  • Singapore JPMorganChase Full time

    Public Cloud SRE is responsible for engineering and operating the cloud infrastructure and platforms of JPMC ensuring reliability, resiliency, and security. We have a Senior Software Engineer, Site Reliability position to build the infrastructure and tooling for JPMC's Public Cloud Platform. As a Lead Site Reliability Engineer at JPMorgan Chase within the...


  • Singapore TRUEWATCH TECHNOLOGY INC PTE. LTD. Full time

    **Responsibility**: - Run production environment by monitoring availability and taking a holistic view of the system health. - Achieve site reliability automation, minimize system downtime, and reduce site reliability cost. - Manage risks and resolves issues that affect the release scope, schedule and quality. - Suggest architecture improvements, push for...


  • Singapore ASTEK SINGAPORE INNOVATION TECHNOLOGY PTE. LTD. Full time

    Astek is proposing an opportunity for **Site Reliability Engineer (Alibaba Cloud) **to support our project based in Singapore. **Responsibilities** - Build cloud resources in Alibaba and Azure. - Build up IaaS/PaaS service on cloud and compliant with the company’s naming convention and security regulations. - Setup the networking and security...


  • Singapore NodeFlair Full time

    **Job Summary**: **Salary** S$11,250 - S$22,500 / Monthly **Job Type** **Seniority** Lead **Years of Experience** At least 16 years **Tech Stacks** Strategy Container AWS Terraform Jenkins Go Camunda Fluentd RHEL Windows Server Jaeger Elastic CI ubuntu ELK Git Azure Java Grafana Prometheus Splunk Kubernetes Springboot SQL **Description** **Some...


  • Singapore DORMAKABA PRODUCTION GMBH & CO. KG. Full time

    Site Reliability Engineer is responsible for keeping all Cloud Platform Services and Solutions (CPSS) services and other cloud solutions running smoothly. You will be a key contributor on a dynamic team, expand your skillset and become an expert in the most popular cloud software development strategies for dormakaba. We are looking for an independent,...


  • Singapore TikTok Full time

    Site Reliability Engineer, Monetization TechnologySite Reliability Engineer, Monetization Technology1 week ago Be among the first 25 applicantsGet AI-powered advice on this job and more exclusive features.ResponsibilitiesTikTok will be prioritizing applicants who have a current right to work in Singapore, and do not require TikTok sponsorship of a...


  • Singapore Tardis Group Full time

    Direct message the job poster from Tardis Group Recruiter at Tardis Group | Finding Top Talent in Tech & Quant About the Company A rapidly growing technology firm operating at the forefront of artificial intelligence and advanced software solutions. The company fosters a fast-paced, collaborative, and innovation-driven culture, uniting talent across...