Senior Sre

20 hours ago


Singapore Oxford Knight Full time

Senior SRE (High Performance Computing) | Singapore or Hong Kong

**Salary**: up to 250-275k SGD base

**Summary**

High-frequency prop trading firm with offices worldwide looking for skilled Senior Site Reliability Engineer developer to join their High Performance Computing team, developing and supporting their large-scale compute and storage platform.

This platform is designed to solve demanding problems - both business and financial - through computer modelling, simulation and analysis. You will be responsible for the deployment, operation and support of HPC infrastructure (focusing on diverse and distributed on-prem & cloud storage), schedulers - e.g. HTCondor or SLURM, and the container orchestration platform (Kubernetes), as well as managing hardware and software vendor relationships.

The successful SRE will have excellent communication skills, and previous exposure to at least one cloud platform.

**Requirements**:

- Solid Linux admin experience - in a large-scale research environment infrastructure would be ideal (scientific, financial, data analytics)
- Experience with managing a medium to large-scale platform environments, e.g. Kubernetes or Mesos
- Hands-on experience with at least one programming language (preferably Python)
- Degree (or equivalent) in Computer Science or related field

**Benefits**
- Competitive salary + performance-based bonuses
- Generous benefits, including medical insurance and gym membership
- Collaborative and friendly environment with smart, highly engaged colleagues
- Relaxed, dress-down office culture, with breakfast, lunch and snacks provided

**Contact**
If this sounds like you or you would like to know more, please get in touch:
**Andy Stirling-Martin**
+44 (0)20 3137 9579

Job ID jCw9oDkWTJgE
- ABOUT COMPANY
- Oxford Knight
- London, United Kingdom
38 Employees HR & Recruitment

Welcome to Oxford Knight We are dedicated International recruiters. We assist leading technologists and finance professionals into high-end roles wo...


  • SRE Lead

    1 week ago


    Singapore Selby Jennings Full time

    Our client is a leading global investment firm and they are seeking an SRE lead to be based in their Singapore office. Strong focus on Enterprise and Reference Data Systems. Key Responsibilities of SRE Lead: Design and implement automated solutions for operational efficiency and reliability Troubleshoot and resolve production issues related to reference...

  • SRE Lead

    3 weeks ago


    Singapore Selby Jennings Full time

    Our client is a leading global investment firm and they are seeking an SRE lead to be based in their Singapore office. Strong focus on Enterprise and Reference Data Systems. Key Responsibilities of SRE Lead: Design and implement automated solutions for operational efficiency and reliability Troubleshoot and resolve production issues related to reference...

  • Senior Manager

    1 week ago


    Singapore Dropsuite Full time

    Senior Manager – Site Reliability Engineering (SRE) Join to apply for the Senior Manager – Site Reliability Engineering (SRE) role at Dropsuite Senior Manager – Site Reliability Engineering (SRE) 1 day ago Be among the first 25 applicants Join to apply for the Senior Manager – Site Reliability Engineering (SRE) role at Dropsuite Get...


  • Singapore TechBridge Market Full time

    If you are passionate about playing a key role in the success of a purpose-led organization that is building a meaningful future through innovation, technology, and collective knowledge, we want to hear from you! Our client is a well-established brand in the Technology industry and is now looking for a passionate and driven **Production Management/SRE **to...


  • Singapore Tencent Full time

    Overview Tencent Overseas Big Data Platform SRE Engineer/Senior SRE Responsibilities Oversee operation and maintenance of Tencent's overseas big data platforms to ensure platform and cluster stability. Assist users in resolving issues related to big data platform usage, including but not limited to platform operations, component errors, and performance...

  • Senior Sre

    7 days ago


    Singapore PROXIMA BETA PTE. LIMITED Full time

    Responsible for high-quality and efficient delivery/change of supporting SRE/Devops work. - Responsible for the ability to quickly troubleshoot system defects and debug online problems. - Responsible for the construction of DevOps platform, improving development efficiency through research on new technologies. - Establish automated, intelligent operation and...


  • Singapore HCLTech Full time

    Direct message the job poster from HCLTech Deputy Manager - Talent Acquisition Growth Markets, APME at HCLTech The following responsibilities and requirements describe the role of a Senior Site Reliability Engineer (SRE) with 10–15 years of experience. The candidate will focus on building, managing, and optimizing reliable, scalable, and secure systems...

  • Senior SRE

    4 days ago


    Singapore Oxford Knight Full time

    Salary: up to k SGD base Summary High-frequency prop trading firm with offices worldwide looking for skilled Senior Site Reliability Engineer developer to join their High Performance Computing team, developing and supporting their large-scale compute and storage platform. This platform is designed to solve demanding problems - both business and...

  • Senior SRE

    1 week ago


    Singapore Oxford Knight Full time

    Salary: 200k base + bonus Summary High-frequency prop trading firm with offices worldwide looking for skilled Senior Site Reliability Engineer developer to join their High Performance Computing team, developing and supporting their large-scale compute and storage platform. This platform is designed to solve demanding problems - both business and...

  • Senior SRE

    1 week ago


    Singapore Oxford Knight Full time

    Salary: 200k base + bonus Summary High-frequency prop trading firm with offices worldwide looking for skilled Senior Site Reliability Engineer developer to join their High Performance Computing team, developing and supporting their large-scale compute and storage platform. This platform is designed to solve demanding problems - both business and...