Reliable Infrastructure Specialist

4 days ago


Singapore beBeeSite Full time $150,000 - $200,000
Job Description:

We are seeking a skilled Site Reliability Engineer to join our team in ensuring the stability, scalability, and performance of a cutting-edge platform.

This role requires a strong commitment to on-call ownership and a passion for building resilient, observable, and self-healing infrastructure.

The ideal candidate will have extensive experience operating and maintaining low-latency, distributed systems in production environments.

Key Responsibilities include designing, implementing, and maintaining scalable infrastructure for a high-performance, low-latency trading platform.

Operate and enhance Kubernetes and Nomad-based environments to ensure system stability, scalability, and security.

Develop infrastructure automation and deployment pipelines using Terraform, Ansible, ArgoCD, and GitHub Actions.

Collaborate with engineering teams to streamline service onboarding, automate repetitive tasks, and improve deployment efficiency.

Enhance observability and reliability through improved logging, metrics, tracing, and alerting using the Grafana ecosystem.

Perform root cause analysis and postmortems for production incidents, driving continuous improvements in system resilience and incident response.

Work with security and compliance teams to ensure infrastructure meets regulatory and organizational standards.

Support multi-environment deployments (dev, staging, testnet, mainnet) with a focus on safe rollouts, rollbacks, and configuration management.

Contribute to capacity planning, cost optimization, and infrastructure scaling strategies to support platform growth.

Experience & Skills Requirements:

  • 5+ years of relevant experience as DevOps/ SRE Engineers.
  • Proven ability to participate in an on-call rotation, demonstrating ownership in incident response and a focus on long-term system stability.
  • Extensive experience operating and maintaining low-latency, distributed systems in production environments.
  • Proficiency with cloud-native platforms and container orchestration tools, including AWS, GCP, Kubernetes, and Nomad.
  • Strong knowledge of Linux/Unix internals and the TCP/IP networking stack.
  • Proficiency in one or more of: Bash, Go, or Python.
  • Expertise in root cause analysis, performance tuning, and system-level debugging in complex service architectures.
  • Experience building and managing end-to-end infrastructure, including infrastructure as code, CI/CD pipelines, and monitoring systems.
  • Familiarity with modern GitOps workflows and tools such as GitHub Actions, ArgoCD, Argo Workflows, and Argo Events.
  • Ability to own production systems end-to-end, from infrastructure as code to automated monitoring and deployment workflows.

Bonus:

Experience with the Aeron messaging system is a strong advantage.



  • Singapore beBeeInfrastructure Full time $80,000 - $150,000

    Job Title: Reliability Infrastructure SpecialistJob SummaryWe are seeking a skilled Reliability Infrastructure Specialist to manage the reliability of game-related platforms and infrastructure across both cloud and on-premise environments.Key Responsibilities:Deploy, change, and troubleshoot infrastructure for overseas games and relevant components and...


  • Singapore Amazon Asia-Pacific Resources Private Limited (Singapore) Full time

    Bachelor's or Master’s degree in Reliability Engineering, Physics, Electrical, Mechanical or Materials Engineering or related field - 6+ years of Reliability Engineering work experience in high reliability industry - 4+ years experience with failure analysis activities and root cause analysis - 4+ years experience with accelerated life testing, stress...


  • Singapore beBeeInfrastructure Full time $35,000 - $55,000

    Job Title: Depot Infrastructure SpecialistThe Depot Infrastructure Specialist plays a vital role in ensuring the optimal performance and reliability of depot infrastructure.


  • Singapore beBeeInfrastructure Full time $60,000 - $90,000

    Job Title: Cloud Infrastructure SpecialistWe are seeking a highly skilled Cloud Infrastructure Specialist to design and implement scalable, secure, and cost-effective cloud infrastructure. In this role, you will be responsible for managing AWS services and delivering reliable cloud solutions to support our business needs.


  • Singapore beBeeInfrastructure Full time

    IT Infrastructure SpecialistAre you a technical expert looking for a challenging role in IT infrastructure management? We are seeking a skilled individual to join our team as an IT Infrastructure Specialist.This is a great opportunity to work with cutting-edge technology and develop your skills in a fast-paced environment. As an IT Infrastructure Specialist,...


  • Singapore ByteDance Full time

    [About ByteDance] Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok, Helo, and Resso, as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create...


  • Singapore beBeeReliability Full time $150,000

    **Job Title:** Infrastructure Reliability Engineer We are seeking an experienced Infrastructure Reliability Engineer to join our Production Engineering team. As an integral member of this team, you will play a pivotal role in shaping the future of our product delivery. The ideal candidate will have a strong background in designing and implementing...


  • Singapore beBeeBlockchain Full time $130,000 - $180,000

    Job TitleWe are seeking a skilled Infrastructure Specialist to join our team.The successful candidate will have a strong background in system administration, networking, and security. They will be responsible for designing, implementing, and maintaining secure and reliable infrastructure to support the growth of our business.Key responsibilities...


  • Singapore beBeeReliability Full time

    **System Reliability Specialist**Are you passionate about ensuring the smooth operation of complex systems? We are seeking a skilled and experienced System Reliability Specialist to join our team.About the RoleThis is an exciting opportunity for a motivated and organized individual to work on building and maintaining robust infrastructure and tools that...


  • Singapore beBeeEngineering Full time

    Lead Infrastructure Maintenance SpecialistJob Description:The Lead Infrastructure Maintenance Specialist will be responsible for enhancing the safety and reliability of the railway infrastructure system by leading workflow effectiveness and efficiency improvement of its inspection, maintenance, and other supplementary activities.The scope can include the...