Lead Site Reliability Engineer, Ai/ml Platform
1 week ago
**JOB DESCRIPTION**
Assume a critical role in defining the future of a globally recognized firm and have a direct and significant effect in a realm tailored for top achievers in site reliability.
As a **Lead Site Reliability Engineer** at JPMorgan Chase within the Chief Technology Office, AI/ML Technology team, you hold a leadership role in your team, demonstrate strong knowledge across multiple technical domains, and advise others on the technical and business issues facing them. Take lead and conduct resiliency design reviews, break up complex problems into digestible work for other engineers, act as a technical lead for medium to large-sized products, and provide advice and mentoring to other engineers.
**Job responsibilities**
- Demonstrates and champions site reliability culture and practices and exerts technical influence throughout your team
- Guides and assists others in the areas of building appropriate level designs and gaining consensus from peers where appropriate
- Collaborates with team members to identify comprehensive service level indicators and stakeholders to establish reasonable service level objectives and error budgets with customers
- Demonstrates a high level of technical expertise within one or more technical domains and proactively identifies and solves technology-related bottlenecks in your areas of expertise
- Documents and shares knowledge within your organization via internal forums and communities of practice
- Understands service level indicators and utilizes service level objectives to proactively resolve issues before they impact customers
- Supports adoption of site reliability engineering best practices within your team
**Required qualifications, capabilities, and skills**
- Bachelor’s Degree in Computer Science / Information Systems / Engineering or related disciplines
- Minimally 5 years of site reliability engineering or related experience
- Fluency in at least one programming language such as (e.g., Python, Java Spring Boot,.Net, etc.)
- Proficiency and experience in observability such as white and black box monitoring, SLO alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, etc.
- Proficiency in continuous integration and continuous delivery tools (e.g., Jenkins, GitLab, Terraform, etc.)
- Experience with container and container orchestration (e.g., ECS, Kubernetes, Docker, etc.)
- Experience in observability such as white and black box monitoring, service level objective alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, and others
- Experience with troubleshooting common networking technologies and issues
- Ability to identify and solve problems related to complex data structures and algorithms
- Ability to expand and collaborate across different levels and stakeholder groups
- Strong AWS and Python skills needed
**ABOUT US**
J.P. Morgan is a global leader in financial services, providing strategic advice and products to the world’s most prominent corporations, governments, wealthy individuals and institutional investors. Our first-class business in a first-class way approach to serving clients drives everything we do. We strive to build trusted, long-term partnerships to help our clients achieve their business objectives.
We recognize that our people are our strength and the diverse talents they bring to our global workforce are directly linked to our success. We are an equal opportunity employer and place a high value on diversity and inclusion at our company. We do not discriminate on the basis of any protected attribute, including race, religion, color, national origin, gender, sexual orientation, gender identity, gender expression, age, marital or veteran status, pregnancy or disability, or any other basis protected under applicable law. In accordance with applicable law, we make reasonable accommodations for applicants’ and employees’ religious practices and beliefs, as well as any mental health or physical disability needs.
**ABOUT THE TEAM**
Our professionals in our Corporate Functions cover a diverse range of areas from finance and risk to human resources and marketing. Our corporate teams are an essential part of our company, ensuring that we’re setting our businesses, clients, customers and employees up for success.
-
Aiml - Site Reliability Engineer, Ml Platform &
2 weeks ago
Singapore NodeFlair Full time**Job Summary**: **Salary** S$10,000 - S$20,000 / Monthly **Job Type** **Seniority** Senior **Years of Experience** At least 10 years **Tech Stacks** Go play Datadog ELK Splunk Kubernetes Python **Job Summary**: Apple is a place where extraordinary people gather to do their best work. Together we create products and experiences people once couldn’t...
-
ML Platform Engineer
4 days ago
Singapore LUXOFT INFORMATION TECHNOLOGY (SINGAPORE) PTE. LTD. Full timeProject Description We are seeking a skilled ML Platform Engineer, responsible for automating, deploying, patching, and maintaining our machine learning platform infrastructure. You need to have hands‐on experience with Cloudera Data Science Workbench (CDSW), Cloudera Data Platform (CDP), Docker, Kubernetes, Python, Ansible, GitLab, and MLOps best...
-
ML Platform Engineer
6 days ago
Singapore Luxoft Full time $120,000 - $180,000 per yearProject description We are seeking a skilled ML Platform Engineer, responsible for automating, deploying, patching, and maintaining our machine learning platform infrastructure. You need to have hands-on experience with Cloudera Data Science Workbench (CDSW), Cloudera Data Platform (CDP), Docker, Kubernetes, Python, Ansible, GitLab, and MLOps best...
-
Solution Architect – AI/ML Platforms
2 weeks ago
Singapore PEOPLESEARCH PTE. LTD. Full timeOverview Solution Architect – AI/ML Platforms Our client is looking for an experienced solution architect to lead the design and integration of secure, scalable and business-aligned AI solutions across enterprise systems and platforms. Responsibilities Architect Scalable AI Solutions: Design end-to-end AI systems covering data ingestion, model deployment,...
-
Solution Architect
2 weeks ago
Singapore PEOPLESEARCH PTE. LTD. Full timeOverview Solution Architect – AI/ML Platforms Our client is looking for an experienced solution architect to lead the design and integration of secure, scalable and business-aligned AI solutions across enterprise systems and platforms. Responsibilities Architect Scalable AI Solutions: Design end-to-end AI systems covering data ingestion, model deployment,...
-
AI/ML Engineer
2 weeks ago
Singapore GECO Asia Pte Ltd Full timeOverview We are looking for a highly motivated and experienced AI Engineer to join our team and lead the development of intelligent solutions using Advanced Python, Machine Learning (ML) and Artificial Intelligence (AI) techniques. The ideal candidate will have strong expertise in API integration, Natural Language Processing (NLP), and building scalable...
-
AL/ ML Engineering Lead
6 days ago
Singapore Newbridge Full timeWe're seeking an experienced Senior AI/ML Engineer Lead to join our clients team. As a Senior AI/ML Engineer Lead, you will lead the development and deployment of AI and machine learning solutions, driving innovation and growth across the organization. You will work closely with cross-functional teams to design, develop, and integrate AI/ML models into...
-
AI/ML Engineer
2 weeks ago
Singapore Certis Full timeAs an AI/ML Engineer in the AI Development Group, you will work on the rapid prototyping, development, and integration of AI capabilities across Certis’ current and next-generation technology platforms. This role is pivotal in advancing Certis’ AI adoption journey, making AI a core driver of innovation and operational excellence. You will work closely...
-
Lead AI Platform Operations Engineer- #AIDA
2 weeks ago
Singapore Singtel Group Full timeSelect how often (in days) to receive an alert: Lead AI Platform Operations Engineer- #AIDA To lead the next phase of our AI evolution, we’ve launched a new business unit AIDA – Artificial Intelligence & Data Analytics – a strategic engine driving our transformation designed to scale our AI ambitions with precision and purpose.This marks a pivotal...
-
Lead Ai Platform Operations Engineer- #aida
1 week ago
Singapore Singtel Full time**Lead AI Platform Operations Engineer - #AIDA**: **Date**:18 Sept 2025 **Location**: Singapore, Singapore **Company**:Singtel Group **Powering the Future with AIDA** To lead the next phase of our AI evolution, we’ve launched a new business unit **AIDA** - **_Artificial Intelligence & Data Analytics_** - a strategic engine driving our transformation...