Reliability Software Engineer

1 week ago


Singapore Squarepoint Capital Full time

Squarepoint Capital is a global investment management firm that utilizes a diversified portfolio of systematic and quantitative strategies across financial markets to achieve high-quality, uncorrelated returns for our clients. We have deep expertise in trading, technology, and operations and attribute our success to rigorous scientific research. As a technology and data-driven firm, we design and build our own cutting-edge systems, from high-performance trading platforms to large-scale data analysis and compute farms. With offices around the globe, we emphasize true, global collaboration by aligning our investment, technology, and operations teams functionally around the world.

Role: Reliability Software Engineer

Team: Risk / Market Access

Department: Development

As a Reliability Software Engineer, you will play a critical role in ensuring the performance, stability, and availability of our software systems, as well as their day-to-day operations. As such, the team requires a high software development capacity, along with strong analytical skills.

You will primarily be developing reliability features directly in our applications, implementing observability capabilities, running benchmarks to measure performance, and building automation and tooling to support the operations of our systems. Operations are important to ensure business continuity, they include responding to level-2 support escalations, monitoring our infrastructure capacity, and tweaking system configuration to address user requests.

Position Overview:

  • System Reliability: Develop incremental stability, recovery, scalability, and performance improvements. Perform root cause analyses to understand the source of incidents. Suggest and implement remedial actions in response to incidents
  • Observability: Monitor, measure, and analyze the performance, availability, and stability of technology systems to identify areas of improvement and allow the team to take data-driven decisions
  • Performance Optimization: Optimize performance of production systems to address bottlenecks and improve system response times, resource utilization, and overall application performance
  • Automation and Tooling: Develop and maintain automation systems and tooling for operations, deployment, and incident management to reduce manual intervention and enhance system stability
  • Production Management: Provide level-2 support for incident response to ensure business uptime. Work closely with core developers and support teams to plan and prepare for scaling technology systems to accommodate user demands

Required Qualifications:

  • Education: Bachelor's degree in Computer Science or related subject
  • Experience: 4+ years proven experience in Software Engineering, Software Reliability, or similar role with hand-on experience in software development and providing L2 support
  • Experience of developing in Python or similar, and familiarity with version control systems such as git
  • Experience working in a Linux environment
  • Problem-Solving Skills: Strong analytical and problem-solving skills with a keen eye for detail and a proactive approach to resolving issues
  • Communication: Excellent communication and collaboration skills to work effectively with cross-functional teams
  • Adaptability: Ability to work in a fast-paced and dynamic environment, adapting to changing priorities and requirements
  • Automation and Tooling: Experience developing automation tools and implementing configuration management

Nice to have:

  • C++ or KDB/q development experience
  • Experience with Slurm, Airflow or middleware such as Kafka and AMPS


  • Singapore Squarepoint Capital Full time

    Job Role: Reliability Software EngineerTeam: Market AccessAs a Reliability Software Engineer at Squarepoint Capital, you will play a critical role in ensuring the performance, stability, and availability of our software systems, as well as their day-to-day operations. The team requires a high software development capacity, along with strong analytical...


  • Singapore Squarepoint Capital Full time

    Squarepoint Capital, a global investment management firm, seeks a skilled Software Systems Specialist to join its technology team.The ideal candidate will have a strong background in software engineering and a proven track record of delivering high-quality, reliable systems. As a key member of the team, they will be responsible for designing, building, and...


  • Singapore Squarepoint Capital Full time

    Squarepoint Capital is a global investment management firm that leverages a diversified portfolio of systematic and quantitative strategies across financial markets to achieve high-quality, uncorrelated returns for clients.We have deep expertise in trading, technology, and operations, and attribute our success to rigorous scientific research. As a technology...


  • Singapore BYTEDANCE PTE. LTD. Full time

    About the JobAt ByteDance, we are looking for a talented Site Reliability Engineer to join our team. In this role, you will be responsible for ensuring the reliability and normal operation of multiple core systems for big data and online computing, while paying attention to system capacity and stability.Key Responsibilities Ensure the reliability and normal...


  • Singapore BYTEPLUS PTE. LTD. Full time

    Role OverviewAt ByteDance, we're seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you'll be responsible for ensuring the reliability and normal operation of multiple core systems for big data and online computing. This includes building automated operation solutions for large-scale systems, cooperating with the...


  • Singapore Aptitude Asia Full time

    Job SummaryAptitude Asia seeks a skilled Site Reliability Engineer to ensure the high reliability, availability, and performance of applications throughout their lifecycle.Key ResponsibilitiesReliability and Performance: Ensure applications operate with high reliability, availability, and performance.Automation and Innovation: Automate repetitive tasks and...

  • Reliability Engineer

    1 month ago


    Singapore Cognizant Full time

    About the RoleThe Reliability Engineer plays a crucial role in ensuring the stability and reliability of the manufacturing plant's systems, infrastructure, and digital capabilities. This position requires a proactive approach to prevent downtime, optimize system performance, and rapidly deploy new features and capabilities.Key ResponsibilitiesSystem...


  • Singapore ADDVALUE INNOVATION PTE LTD Full time

    Roles & ResponsibilitiesReliability EngineerResponsibilities Work with product development teams to develop relaibility requirements, establish a reliability / test program and perform appropriate analyse to ensure that new products meet all the relaibility targets. Perform risk / reliabilty analysis (FMEA, FMECA, MTBF) for existing and new products Able...

  • Reliability Engineer

    1 month ago


    Singapore Cognizant Full time

    About the role The Reliability Engineer ensures stability of the manufacturing plant, systems health, lifecycle management, user satisfaction. Prioritizing digital capabilities and infrastructure's reliability, performance, and efficiency is a must. All employees involved in the development and maintenance of these services must work collaboratively to...


  • Singapore JOHN CRANE SINGAPORE PTE LTD Full time

    Roles & ResponsibilitiesPurpose of roleTo provide on-site support and manage reliability contract of mechanical seals for customers in Singapore.Roles and Responsibilities Assist and guide seal installation / removal on site On site pump /seal initial assessment and inspection Commissioning of seal system Perform 5-point checks on the equipment’s prior...


  • Singapore HORIZON SOFTWARE PTE. LTD. Full time

    Roles & ResponsibilitiesJob SummaryWe are seeking an experienced Senior DevOps Engineer to lead our DevOps initiatives, optimize deployment pipelines, and ensure the scalability and reliability of our systems. The ideal candidate will have a strong background in cloud architecture, automation, and be able to mentor junior team members.Key Responsibilities ...


  • Singapore Aptitude Asia Full time

    At Aptitude Asia, we're seeking a skilled Site Reliability Engineer to join our team. This role is crucial in ensuring the high reliability, availability, and performance of our applications throughout their lifecycle.Key Responsibilities:Develop and implement automation scripts to streamline repetitive tasks and address recurring issues.Collaborate with...


  • Singapore Oxford Knight Full time

    Job Title: Senior Site Reliability EngineerOxford Knight is seeking a highly skilled Senior Site Reliability Engineer to join our team. As a Senior Site Reliability Engineer, you will be responsible for designing, developing, and maintaining our Linux trading infrastructure on a day-to-day basis.Key Responsibilities:Lead the design and development of major...


  • Singapore ITCAN PTE. LIMITED Full time

    Roles & ResponsibilitiesRoles & Responsibilities:The Site Reliability Engineer (SRE) combines software development and system engineering to build and run distributed solutions in a secured multi-tier heterogeneous environment to safeguard, provide and continuously improve the software and systems behind the organization’s cloud platform solutions.The Job:...


  • Singapore AVAGO TECHNOLOGIES INTERNATIONAL SALES PTE. LIMITED Full time

    Roles & ResponsibilitiesMinimum qualifications: Bachelor’s degree in electrical engineering, or an engineering field. 10 years of experience as a Semiconductor IC, Quality, Reliability or Product Engineer. Experience in a technical team and/or people management. Experience with quality control, test coverage, and IC qualification.Preferred...


  • Singapore NodeFlair Full time

    Senior Site Reliability EngineerWe are working with one of the leading pioneers in the Cryptocurrency space as part of their continued growth, NodeFlair has been engaged to search for a Senior Site Reliability Engineer to join their team.About the RoleOur client, a top player in cryptocurrency data monitoring, tracks over 10,000 tokens on 400+ exchanges with...


  • Singapore NodeFlair Full time

    Senior Site Reliability EngineerWe are working with one of the leading pioneers in the Cryptocurrency space as one of the largest data platforms, and as part of their continued growth, NodeFlair has been engaged to search for a Senior Site Reliability Engineer to join their Singapore/Remote team.About the RoleOur client, a top player in cryptocurrency data...


  • Singapore NodeFlair Full time

    Senior Site Reliability EngineerWe are working with NodeFlair, a leading pioneer in the Cryptocurrency space, to find a Senior Site Reliability Engineer to join their Singapore/Remote team.About the RoleOur client, a top player in cryptocurrency data monitoring, tracks over 10,000 tokens on 400+ exchanges with 300 million page views from 100+ countries.Key...


  • Singapore NodeFlair Full time

    Senior Site Reliability EngineerWe are working with NodeFlair, a leading pioneer in the Cryptocurrency space, to search for a Senior Site Reliability Engineer to join their Singapore/Remote team.About the RoleOur client, a top player in cryptocurrency data monitoring, tracks over 10,000 tokens on 400+ exchanges with 300 million page views from 100+...


  • Singapore NodeFlair Full time

    About the RoleWe are working with one of the leading pioneers in the Cryptocurrency space as one of the largest data platforms, and as part of their continued growth, NodeFlair has been engaged to search for a Senior Site Reliability Engineer to join their team.Key ResponsibilitiesCollaborate with the software development team to design and implement...