
High Availability Infrastructure Engineer
2 days ago
High-Availability Systems Engineer
- Design, build, and maintain high-availability systems for customers.
We are seeking a highly skilled professional to join our team. As a key member of our infrastructure team, you will be responsible for designing, building, and maintaining high-availability systems that meet the needs of our customers.
Key Responsibilities:
- Develop and oversee performance-critical infrastructure for financial markets, ensuring maximum throughput, high resiliency, and minimal operational risk.
- Leverage deep Linux kernel expertise to fine-tune scheduling policies, interrupt routing, and NUMA resource allocation, ensuring predictable performance at scale.
- Build and maintain high-availability containerized environments using Kubernetes, Docker, and advanced orchestration tools with a strong focus on scalability and security.
- Lead automation initiatives with Ansible, Bash, and Python, eliminating manual intervention and improving system efficiency.
- Manage hybrid cloud infrastructure (AWS, Azure, GCP) with strict performance SLAs, security compliance, and cost-optimized deployments.
- Oversee infrastructure monitoring and observability using ELK Stack, Grafana, Site24x7, Splunk, and other enterprise-grade tools, ensuring proactive incident detection and resolution.
- Administer and troubleshoot enterprise storage and networking stacks like RAID, NFS, SAN/NAS, TCP/IP networking, VMware/vCenter, BigIP load balancers.
- Collaborate with development, DevOps, and security teams to design fault-tolerant systems and enforce infrastructure governance policies.
- Execute predictive capacity modeling, OS hardening and patch compliance, coupled with benchmark-driven performance optimization for trading and real-time compute platforms.
- Provide expert-level outage resolution, coordinating cross-functional teams to deliver sustainable remediation and operational resilience.
Requirements:
- 8+ years of progressive experience in system administration, performance engineering, and reliability operations across enterprise and financial domains.
- Advanced proficiency in Linux internals with specialization in kernel performance tuning, NUMA-aware optimizations, and real-time workload handling.
- Proven hands-on experience with Kubernetes, Docker, and Ansible for large-scale automation and orchestration.
- Strong scripting/programming in Bash, Python, and experience with perf/eBPF for system analysis.
- Demonstrated expertise in cloud operations across AWS, Azure, and GCP.
- Strong background in networking protocols (TCP/IP, FIX) and high-performance trading environments.
- Familiarity with storage systems (SAN, NAS, RAID) and database tuning (MySQL optimization).
- Experience implementing observability and monitoring solutions like ELK, Grafana, Splunk, Corvil.
-
High Availability Infrastructure Engineer
3 days ago
Singapore beBeeInfrastructure Full time $150,000 - $200,000High-Availability Systems EngineerDesign, build, and maintain high-availability systems for customers.We are seeking a highly skilled professional to join our team. As a key member of our infrastructure team, you will be responsible for designing, building, and maintaining high-availability systems that meet the needs of our customers.Key...
-
High Availability Engineer
3 days ago
Singapore beBeeReliability Full time $180,000 - $200,000Reliability Engineering SpecialistJob Overview:We are seeking an experienced Reliability Engineering Specialist to join our team. The ideal candidate will have a strong background in system administration, performance engineering, and reliability operations across enterprise and financial domains.Key Responsibilities:Develop and maintain high-availability...
-
High Availability Engineer
2 weeks ago
Singapore beBeeHighAvailabilityEngineer Full time $90,000 - $120,000Job Title: High Availability EngineerKey Responsibilities:Ensure the stability, reliability, and efficient operation of services at all times.Responsible for core operational tasks such as resource provisioning and management, incident response, capacity management, monitoring, and reliability improvements.Review technical architecture design, assess...
-
High Availability Engineer
6 days ago
Singapore beBeeResilience Full time $80,000 - $120,000Cloud Infrastructure Resiliency SpecialistJob Overview:Our team is seeking a skilled Cloud Infrastructure Resiliency Specialist to join our organization. In this role, you will be responsible for designing and implementing high availability (HA) and disaster recovery (DR) solutions in Microsoft Azure.Key Responsibilities:Develop and implement HA and DR...
-
Singapore beBeeCloudEngineer Full time $80,000 - $120,000Job Title: Cloud Infrastructure Engineer"> As a key member of our Site Reliability Engineering team, you will play a pivotal role in ensuring the high availability and performance of our cloud-based applications. This involves deploying, monitoring, and supporting applications in a Kubernetes multi-tenant environment, with a focus on promoting overall...
-
Fostering High Availability
4 days ago
Singapore beBeeTechnology Full time $80,000 - $120,000Role OverviewWe are seeking a skilled Site Reliability Engineer to join our team. This role involves ensuring the smooth operation of our systems and infrastructure.">Key ResponsibilitiesDesigning, implementing, and maintaining high-availability systemsCollaborating with cross-functional teams to identify and resolve technical issuesDeveloping and enforcing...
-
High-Availability Network Specialist
1 week ago
Singapore beBeeinfrastructure Full time $60,000 - $105,000Network Infrastructure SpecialistWe are seeking a skilled Network Infrastructure Specialist to support our enterprise network environment. This role will focus on ensuring high availability, security, and performance through effective maintenance, optimization, and upgrades of our network infrastructure.
-
Singapore beBeeInfrastructure Full time $80,000 - $120,000Job Opportunity: System Infrastructure Specialist">">We are seeking a skilled system infrastructure specialist to work on various tasks in an on-premises and cloud environment.">">About the Role:">">The selected candidate will be part of the infrastructure team, contributing to mission-critical systems security, reliability, and scalability.">">Key...
-
Singapore beBeeInfrastructure Full timeJob Opportunity: System Infrastructure Specialist ">"> We are seeking a skilled system infrastructure specialist to work on various tasks in an on-premises and cloud environment. "> "> About the Role: ">"> The selected candidate will be part of the infrastructure team, contributing to mission-critical systems security, reliability, and scalability. "> ...
-
High Availability System Engineer
6 days ago
Singapore beBeeReliability Full time $120,000 - $180,000Job SummaryA highly skilled system reliability engineer is required to ensure the stability, reliability, and efficient operation of global business operations.ResponsibilitiesDevelop and implement strategic plans for improving system reliability, ensuring high availability of services at all times.Lead core operational tasks such as resource provisioning...