Site Reliability Engineer

4 weeks ago


Singapore Vega Solutions Full time

Join to apply for the Site Reliability Engineer role at Vega Solutions

Join to apply for the Site Reliability Engineer role at Vega Solutions

Get AI-powered advice on this job and more exclusive features.

Tokka Labs | Singapore | Full-Time
Tokka Labs is a proprietary trading firm with a focus on close collaboration, rigorous research, and cutting-edge technology. We are market makers, searchers, and solvers for top protocols on the most popular blockchains in the world. We design and implement our own trading systems and strategies to provide liquidity in the most diverse and challenging environments. At the core of it all lies our unwavering commitment to pushing boundaries of decentralized finance and we are always on the lookout for like-minded individuals to join us on this journey. If you think you have what it takes, apply now

Tokka Labs | Singapore | Full-Time
Tokka Labs is a proprietary trading firm with a focus on close collaboration, rigorous research, and cutting-edge technology. We are market makers, searchers, and solvers for top protocols on the most popular blockchains in the world. We design and implement our own trading systems and strategies to provide liquidity in the most diverse and challenging environments. At the core of it all lies our unwavering commitment to pushing boundaries of decentralized finance and we are always on the lookout for like-minded individuals to join us on this journey. If you think you have what it takes, apply now
Position Summary
As a Site Reliability Engineer (SRE), you will play a crucial role in maintaining and enhancing the security, stability, scalability, and cost-effectiveness of our systems. You will leverage your expertise in tools like Terraform, Ansible, Kubernetes, and AWS, as well as your networking skills, to build and manage a robust infrastructure.
Key Responsibilities

  • System Monitoring and Incident Response:
○ Continuously monitor the performance, availability, and security of systems.
○ Quickly respond to incidents, conducting root cause analysis, and implementing solutions to prevent recurrence.
  • Infrastructure Automation:
○ Automate infrastructure deployment and management using Terraform, Ansible, and related tools.
○ Optimize cloud environments, particularly AWS, to ensure efficient resource use and cost control.
  • Kubernetes and Container Management:
○ Manage containerized applications using Kubernetes, ensuring high availability and scalability.
○ Develop and implement strategies for effective container orchestration and management.
  • Security and Compliance:
○ Implement and maintain security best practices across the infrastructure.
○ Conduct regular security audits and vulnerability assessments to protect against potential threats.
  • Network Management:
○ Design, implement, and manage network infrastructure to support system stability and performance.
○ Troubleshoot and resolve network-related issues, ensuring minimal downtime.
  • Capacity Planning and Performance Optimization:
○ Plan for future infrastructure needs, ensuring the system scales efficiently.
○ Continuously analyze system performance and apply improvements for better stability and cost efficiency.
○ Continuously looking for better infrastructure suppliers, and benchmark the strength and weakness.
○ Explore and operate blockchain technologies, includes: blockchain node, network optimisation, etc.
  • Collaboration and Knowledge Sharing:
○ Work closely with software development, DevOps, and IT teams to align infrastructure strategies with business needs.
○ Document processes, share knowledge with team members, and mentor junior engineers.
Required Qualifications
  • Education: Bachelor's degree in Computer Science, Engineering, or related field (or equivalent experience).
  • Experience:
○ 3+ years of experience in Site Reliability Engineering, DevOps, or a related role.
○ Proven experience with Terraform, Ansible, Kubernetes, and AWS.
Skills
○ Strong networking skills and experience with cloud networking.
  • Skills:
  • Demonstrated expertise in scripting and automation, with proficiency in Python, Bash, and related tools.
  • Extensive knowledge of Unix/Linux systems, including system administration and troubleshooting.
  • Strong analytical capabilities, with a proven track record in performance tuning and cost optimization.
  • Exceptional communication and interpersonal skills, with the ability to collaborate effectively across cross-functional teams.
  • Consistently meets deadlines and ensures timely completion of tasks through effective time management and attention to detail.
  • Proactive, accountable, and highly self-motivated, with a strong sense of ownership and ability to work independently with minimal supervision.
  • Continuously strives for improvement and seeks opportunities to enhance processes and outcomes.
Preferred Qualifications
  • Experience with multi-cloud environments.
  • Familiarity with database management and data security.
  • Knowledge of CI/CD pipelines and automation tools.
Seniority level
  • Seniority level Mid-Senior level
Employment type
  • Employment type Full-time
Job function
  • Job function Engineering and Information Technology
  • Industries Blockchain Services

Referrals increase your chances of interviewing at Vega Solutions by 2x

Sign in to set job alerts for "Site Reliability Engineer" roles. Site Reliability Engineer Intern - 2025 Start Production Engineer / Site Reliability Engineer Software Engineer Intern, Dev Infra - 2025 Start

Bedok, East Region, Singapore 10 hours ago

WeChat - Senior Site Reliability Engineer Information Technology - Cloud/DevOps Engineer Site Reliability Engineer-(Fresh-Grad)(A98145) Software Development Engineer in Test Intern , TikTok - 2025 Start Backend Software Engineer, Global LIVE Fund Safety Intern- 2025 Start Site Reliability Engineer (SRE) (GovTech) Site Reliability Engineer (EMEA, Japan, Singapore, Australia) Tencent Hunyuan LLM Site Reliability Engineer / Senior SRE

We're unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

#J-18808-Ljbffr

  • Singapore Sea Limited Full time

    Engineering and Technology - Infrastructure, Singapore - Entry Level Our DevOps Engineering team plays an important role in developing and maintaining the internal systems and tools for the Infrastructure team. As a Site Reliability Engineer, you are responsible for improving the availability and reliability of our Infrastructure services. - Responsible for...


  • Singapore Hyphen Connect Full time

    Site Reliability Engineer (Crypto Trading) Join to apply for the Site Reliability Engineer (Crypto Trading) role at Hyphen Connect Site Reliability Engineer (Crypto Trading) 2 days ago Be among the first 25 applicants Join to apply for the Site Reliability Engineer (Crypto Trading) role at Hyphen Connect We are hiring for one of our ecosystem projects in...


  • Singapore DHATCH CONSULTANCY PTE. LTD. Full time

    Site Reliability Engineer: **Preferred Qualifications** - 3+ years of experience in site reliability engineering, DevOps, or software engineering roles. - Proven skills in: - Monitoring & alerting tools (Grafana, New Relic) - CI/CD pipelines (Git, Jenkins, GitHub Actions, etc.) - Container orchestration (Docker, Kubernetes) - Infrastructure-as-code...


  • Singapore TRUEWATCH TECHNOLOGY INC PTE. LTD. Full time

    **Responsibility**: - Run production environment by monitoring availability and taking a holistic view of the system health. - Achieve site reliability automation, minimize system downtime, and reduce site reliability cost. - Manage risks and resolves issues that affect the release scope, schedule and quality. - Suggest architecture improvements, push for...


  • Singapore TEAMLEASE DIGITAL CONSULTING PTE. LTD. Full time

    As a Site Reliability Engineer, you will be filling a mission-critical role ensuring that our systems are healthy, monitored, automated, fault-tolerant and designed to scale. You will collaborate and work closely with engineering teams to continually improve our production services, facilitating fast delivery of new products, and reducing downtime. Key...


  • Singapore HCLTech Full time

    Get AI-powered advice on this job and more exclusive features. This role combines software and systems engineering to build run, and maintain high performant, distributed, fault tolerant and resilient financial systems. Site Reliability Engineers focus on ensuring a joyful customer journey. As a Site Reliability Engineer you will be filling a...


  • Singapore Vega Solutions Full time

    Join to apply for the Site Reliability Engineer role at Vega SolutionsJoin to apply for the Site Reliability Engineer role at Vega SolutionsGet AI-powered advice on this job and more exclusive features.Tokka Labs | Singapore | Full-TimeTokka Labs is a proprietary trading firm with a focus on close collaboration, rigorous research, and cutting-edge...


  • Singapore Tardis Group Full time

    Direct message the job poster from Tardis Group Recruiter at Tardis Group | Finding Top Talent in Tech & Quant About the Company A rapidly growing technology firm operating at the forefront of artificial intelligence and advanced software solutions. The company fosters a fast-paced, collaborative, and innovation-driven culture, uniting talent across...


  • Singapore HCLTech Full time

    Get AI-powered advice on this job and more exclusive features.This role combines software and systems engineering to build run, and maintain high performant, distributed, fault tolerant and resilient financial systems. Site Reliability Engineers focus on ensuring a joyful customer journey.As a Site Reliability Engineer you will be filling a mission-critical...


  • Singapore JJ Consulting Services Full time

    Our Client is a fast growing company in Singapore, who is seeking to recruit a Site Reliability Engineer. **Site Reliability Engineer** **Key Roles & Responsibilities** - Providing ancillary support of Enterprise-Grade Products and solutions at customer's sites - Ironing out deployment issues or challenges that our customers may face - Responsible for...