Aiml - Site Reliability Engineer, Ml Platform &

3 days ago


Singapore NodeFlair Full time

**Job Summary**:
**Salary**
S$10,000 - S$20,000 / Monthly

**Job Type**

**Seniority**

Senior

**Years of Experience**
At least 10 years

**Tech Stacks**
Go play Datadog ELK Splunk Kubernetes Python

**Job Summary**:
Apple is a place where extraordinary people gather to do their best work. Together we create products and experiences people once couldn’t have envisioned — and now can’t imagine living without. If you’re excited by the idea of making an impact, joining a team where we pride ourselves in being one of the most diverse and expansive companies in the world, a career with Apple might be your dream job If you wish to play a part in revolutionizing how people use their computers and mobile devices; build ground breaking technology for algorithmic search, machine learning, natural language processing & artificial intelligence; and work with the teams building the most scalable big-data systems in existence. This is the role for you

**Key Qualifications**:

- 10 or more years of experience in a Site Reliability Engineering, observability or ML Ops focused role supporting internet services and distributed systems
- Proficiency in using Go, Python or other higher-level languages for automation, observability and infrastructure management
- Experience building and supporting telemetry, observability and logging solutions for incident, cost and performance management
- Experience with infrastructure or dashboards as code and provisioning tools for Kubernetes and cloud based services
- Working knowledge of open source or commercial monitoring and observability frameworks and platforms such as ELK, Splunk, OpenCensus, Datadog
- Working knowledge of ML Ops systems and tools advantageous
- Good interpersonal skills shown through previous projects or assignments

**Description**:

- Monitor production, staging and development environments for a myriad of services in an agile and dynamic organization.
- Employ metrics for data driven solutions for reliability, performance and service insights.
- Design, implement, and extend automation tools for monitoring, logging, ML and data processing pipelines.
- Resolve future needs for capacity and investigate new features and products.
- Strong problem solving ability will be used daily; a successful Engineer will take steps on self-initiative basis to isolate issues and resolve root cause through investigative analysis.
- Responsible for writing justifications, incident reports, best practices documentation and solution specifications.

**Education**:
Bachelor Degree in Computer Science or Computer Engineering or equivalent



  • Singapore Pfizer, S.A. de C.V Full time

    We're in relentless pursuit of breakthroughs that change patients' lives. We innovate every day to make the world a healthier place. To fully realize Pfizer's purpose – Breakthroughs that change patients' lives – we have established a clear set of expectations regarding "what" we need to achieve for patients and "how" we will go about achieving those...


  • Singapore Luxoft Full time $120,000 - $180,000 per year

    Project description We are seeking a skilled ML Platform Engineer, responsible for automating, deploying, patching, and maintaining our machine learning platform infrastructure. You need to have hands-on experience with Cloudera Data Science Workbench (CDSW), Cloudera Data Platform (CDP), Docker, Kubernetes, Python, Ansible, GitLab, and MLOps best...


  • Singapore Barings LLC Full time

    Cloud Platform Site Reliability Engineer page is loaded## Cloud Platform Site Reliability Engineerlocations: Hong Kong: SG - SINGAPORE - 1 WALLICH STtime type: Full timeposted on: Posted 30+ Days Agojob requisition id: JR\_ At Barings, we are as invested in our associates as we are in our clients. We recognize those who work diligently for us and reward them...


  • Singapore Barings LLC Full time

    Cloud Platform Site Reliability Engineer page is loaded## Cloud Platform Site Reliability Engineerlocations: Hong Kong: SG - SINGAPORE - 1 WALLICH STtime type: Full timeposted on: Posted 30+ Days Agojob requisition id: JR\_ At Barings, we are as invested in our associates as we are in our clients. We recognize those who work diligently for us and...


  • Singapore Barings Full time

    Overview Cloud Platform Site Reliability Engineer – Barings. We are seeking a highly motivated and skilled professional to design, implement, and maintain Cloud infrastructure solutions for enterprise-level organizations. The role combines cloud engineering and operations with a focus on reliability, performance, monitoring, security, and cloud platform...


  • Singapore Russell Tobin Full time

    Job Opportunity: Senior Python Developer (AIML)Experience: 5-10 years Employment Type: Full-time, Morning Shift, Onsite NOTE: Only Singaporean locals or PR holders can apply. About the Role We are seeking an experienced Machine Learning Engineer with deep expertise in Python , AI/ML frameworks , and cloud-native infrastructure . You will play a key role in...


  • Singapore PEOPLESEARCH PTE. LTD. Full time

    Senior Cloud Operations Engineer - AI/ML Platforms Our client is looking for an experienced cloud specialist to lead and optimise the operation, reliability and security of Azure-based AI platform to ensure seamless, scalable and secure delivery of AI/ML services. Responsibilities: Ensure high availability, performance and reliability of Azure-based AI cloud...


  • Singapore PEOPLESEARCH PTE. LTD. Full time

    Senior Cloud Operations Engineer - AI/ML Platforms Our client is looking for an experienced cloud specialist to lead and optimise the operation, reliability and security of Azure-based AI platform to ensure seamless, scalable and secure delivery of AI/ML services. Responsibilities: Ensure high availability, performance and reliability of Azure-based AI cloud...

  • Site Reliability

    1 week ago


    Singapore Canonical Full time

    Join to apply for the Site Reliability / Gitops Engineer role at Canonical 1 day ago Be among the first 25 applicants Join to apply for the Site Reliability / Gitops Engineer role at Canonical Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is very widely...


  • Singapore eTeam Full time

    Description Site Reliability Engineer (SRE)We are looking for a seasoned Site Reliability Engineer (SRE) with 5–10 years of experience to join our Platform Engineering team. This role is ideal for someone who thrives in a fast‐paced environment, is passionate about reliability, and enjoys solving complex challenges. You will play a key role in building...