Staff Site Reliability Engineer, Platform

3 days ago


Singapur, Singapore GEMINI Full time

About the Role

Gemini's Platform organization is responsible for enabling the company to scale effectively and empower engineering teams to focus on building innovative financial products and experiences. As a Staff Site Reliability Engineer, you will play a key role in leading Gemini's engineering teams towards modern DevOps practices, developing and providing modern automation and operational tooling, and working cross-functionally to influence and shape development practices and culture.

Responsibilities

  • Provide primary operational support and engineering for various Gemini services
  • Improve reliability, quality, and time-to-market across all Gemini services and offerings
  • Guide engineering teams onto the various supported services provided by Platform
  • Run ongoing performance evaluations and improvements for Gemini systems
  • Architecture recommendations and engagement as part of the SDLC
  • Create production-ready scorecards to evaluate the health of systems pre-launch
  • Implement and teach monitoring, alerting, and automated resolution best practices
  • Define SLIs, SLOs with engineering teams
  • Educate and guide engineering teams on reliability and resiliency best practices, like statelessness, chaos testing, blue/green deployments, etc.
  • Design, build, and maintain operational tooling and automation that streamline processes and enhance system reliability

Qualifications

  • 7+ years using monitoring, alerting, and automation tooling to understand and remediate performance and health issues in systems at scale
  • Good knowledge of various cloud technology providers like AWS, GCP, or Azure
  • Expert in an infrastructure as code environment (Terraform), developing automated solutions to solve support and operational issues
  • Experience as a Technical Leader within a team, helping evaluate and make tech decisions for the team
  • Expert working with containerization such as Nomad, EKS (k8s), Docker, etc.
  • Expert working with Configuration Management such as Ansible, Chef, Puppet
  • Proficient writing scripts or CLI tools that help increase developer productivity in high-level languages like Python, Go, etc.
  • Expert analyzing system and application performance, identifying bottlenecks, and recommending architectural or systemic improvements
  • Experience working with engineering teams, teaching, training, and mentoring on how to implement best-practice technical solutions

What We Offer

  • Comprehensive health plans covered at 100% for employees and dependents
  • Long-term incentive in the form of a new hire equity grant
  • Paid Parental Leave
  • Up to 14 paid vacation days (in addition to public/bank holidays)


  • Singapur, Singapore GEMINI Full time

    Department : Platform Our Platform organization’s purpose is to enable Gemini to scale effectively and empower our engineering teams to focus on building innovative financial products and experiences for individuals around the world. Platform focuses around building a scalable and secure foundations platform, enabling Engineering to deploy, validate,...


  • Singapur, Singapore Tencent Full time

    About the RoleTencent Games is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the stability and performance of our cloud-based data platforms.ResponsibilitiesDesign and implement automated operation and maintenance tools to improve deployment efficiency and reduce...


  • Singapur, Singapore Tencent Full time

    Job Title: Site Reliability EngineerTencent Games is a leading global platform for game development, operations, and publishing, and the largest online game community in China. We are seeking a highly skilled Site Reliability Engineer to join our team.Responsibilities:Maintain big data suites of overseas cloud platforms, monitoring and resource management,...


  • Singapur, Singapore Tencent Full time

    About TencentTencent Games is a leading global platform for game development, operations, and publishing, and the largest online game community in China.Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for maintaining big data suites of overseas cloud platforms,...


  • Singapur, Singapore GEMINI Full time

    About the RoleWe are seeking a highly skilled Staff Site Reliability Engineer to join our Platform organization at Gemini. As a key member of our team, you will play a critical role in enabling our engineering teams to focus on building innovative financial products and experiences for individuals around the world.Key ResponsibilitiesProvide primary...


  • Singapur, Singapore Citadel Securities Full time

    Job Title: Site Reliability EngineerAbout the Role:We are seeking a highly skilled Site Reliability Engineer to join our team at Citadel Securities. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our distributed systems and applications.Responsibilities:Design and implement scalable and efficient...


  • Singapur, Singapore Shopee Full time

    Company Name: ShopeeTitle: Lead Site Reliability EngineerJob Overview:Join a dedicated Engineering and Technology team at Shopee, where innovation meets reliability.Engage in the development and upkeep of essential marketplace operations.Work across comprehensive platforms and solutions, focusing on system design and optimization.Experience a vibrant work...


  • Singapur, Singapore Shopee Full time

    Company Name: ShopeeTitle: Lead Site Reliability EngineerJob Overview:Join a dedicated Engineering and Technology team at Shopee, where innovation meets reliability.Engage in the enhancement and upkeep of essential marketplace operations.Develop and refine comprehensive platforms and solutions, focusing on system design and optimization.Experience personal...


  • Singapur, Singapore Shopee Full time

    Company Name: ShopeeTitle: Lead Site Reliability EngineerJob Overview:Join a dedicated Engineering and Technology team at Shopee, where innovation meets reliability.Engage in the enhancement and upkeep of essential marketplace operations.Work across comprehensive platforms and solutions, focusing on system design and optimization.Experience personal and...


  • Singapur, Singapore Citadel Securities Full time

    Job Title: Site Reliability EngineerAbout the Role:We are seeking a highly skilled Site Reliability Engineer to join our team at Citadel Securities. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our distributed systems and applications.Responsibilities:Design and implement scalable and efficient...


  • Singapur, Singapore Shopee Full time

    Company Name: ShopeeTitle: Lead Site Reliability EngineerJob Overview:Join a dedicated Engineering and Technology team at Shopee, where innovation meets passion.Engage in the development and upkeep of essential marketplace operations.Work across comprehensive platforms and solutions, focusing on system design and optimization.Experience personal and...


  • Singapur, Singapore Shopee Full time

    Company Name: ShopeeTitle: Lead Site Reliability EngineerJob Overview:An exceptional chance to become part of a dedicated Engineering and Technology team at Shopee.Engage in the enhancement and upkeep of essential marketplace operations.Contribute to comprehensive platform solutions, focusing on system design and optimization.Experience personal and...


  • Singapur, Singapore Citadel Securities Full time

    Job Title: Site Reliability EngineerAbout the Role:We are seeking a highly skilled Site Reliability Engineer to join our team at Citadel Securities. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our distributed systems and applications.Responsibilities:Design and implement scalable and reliable...


  • Singapur, Singapore Helius Full time

    Job Title: Site Reliability EngineerHelius is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design and implement infrastructure as code (IaC) using cloud services such...


  • Singapur, Singapore Helius Full time

    Job Title: Site Reliability EngineerHelius is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design and implement infrastructure as code (IaC) using cloud services such...

  • Staff Engineer

    3 weeks ago


    Singapur, Singapore Centre for Strategic Infocomm Technologies Full time

    About the RoleWe are seeking a highly skilled Staff Engineer to join our team at the Centre for Strategic Infocomm Technologies. As a key member of our engineering team, you will be responsible for designing, building, and implementing platforms and tools for messaging and collaboration to enable seamless communication and knowledge sharing across the...


  • Singapur, Singapore Shopee Full time

    About the RoleAs a Senior Site Reliability Engineer at Shopee, you will be responsible for managing the technical operations of our core marketplace businesses. This includes product lines such as shopee voucher management, shopee discount/coins management, shopee selling listing online, shopee intelligence and data, and more.Key ResponsibilitiesDesign and...


  • Singapur, Singapore Shopee Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our Engineering and Technology team at Shopee. As a key member of our team, you will be responsible for managing the technical operations of our core marketplace businesses, including product lines such as shopee voucher management, shopee discount/coins management, shopee...


  • Singapur, Singapore TikTok Full time

    About the team Our Compute Platform SRE team supports all Big Data services and products across the company. We are a newly established team and waiting for talents like you to shape the team's future together. We are responsible for the reliability of all the company's major data warehouse products, services, and query engines. We serve business needs...


  • Singapur, Singapore Citadel Securities Full time

    Job Title: Site Reliability EngineerJob Summary:Citadel Securities is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our distributed systems and applications.Responsibilities:Candidates with less than 3 years of experience should...