
Staff Site Reliability Engineer, Platform
2 weeks ago
Staff Site Reliability Engineer, Platform
**About the Company**
Gemini is a global crypto and Web3 platform founded by Tyler Winklevoss and Cameron Winklevoss in 2014. Gemini offers a wide range of crypto products and services for individuals and institutions in over 70 countries.
Crypto is about giving you greater choice, independence, and opportunity. We are here to help you on your journey. We build crypto products that are simple, elegant, and secure. Whether you are an individual or an institution, we help you buy, sell, and store your bitcoin and cryptocurrency.
At Gemini, our mission is to unlock the next era of financial, creative, and personal freedom.
**The Department: Platform**
Our Platform organization's purpose is to enable Gemini to scale effectively and empower our engineering teams to focus on building innovative financial products and experiences for individuals around the world. Platform focuses around building a scalable and secure foundations platform, enabling Engineering to deploy, validate, and operate their services in production, improve resiliency of the service and increase organizational efficiency by reducing operational toil and increase system efficiency through architectural evolution.
The Site Reliability Engineering team engages directly with our other engineering teams to onboard them onto our platform systems, reviewing and recommending design and architectural decisions, and guiding our engineering teams on how to implement the tooling provided by the larger Platform organization required to ensure systems can scale and react to changing conditions, with continuous improvement loops.
**The Role**:Staff Site Reliability Engineer**
You will be an integral part of leading Gemini's engineering teams towards modern DevOps practices, both by developing and providing modern automation and operational tooling, and working cross-functionally across Gemini's engineering teams to influence and shape our development practices and culture.
**Responsibilities**:
- Provide primary operational support and engineering for various Gemini services
- Improve reliability, quality and time-to-market across all Gemini services and offerings
- Guide engineering teams onto the various supported services provided by Platform
- Run on-going performance evaluations and improvements for Gemini systems
- Architecture recommendations and engagement as part of SDLC
- Create "Production-ready Scorecards" to evaluate the health of systems pre-launch
- Implement and teaching monitoring, alerting and automated resolution best practices
- Define SLIs, SLOs with Engineering teams
- Educate and guide Engineering teams on reliability and resiliency best practices, like statelessness, chaos testing, blue/green deployments, etc.
- Design, build, and maintain operational tooling and automation that streamline processes and enhance system reliability
**Qualifications**:
- 7+ years using monitoring, alerting, and automation tooling to understand and remediate performance and health issues in systems at scale
- Good knowledge for various cloud technology providers like AWS, GCP, or Azure
- Expert in an infrastructure as code environment (Terraform), developing automated solutions to solve support and operational issues
- Experience as a Technical Leader within a team, helping evaluating and making tech decisions for the team
- Expert working with containerization such as Nomad, EKS (k8s), Docker, etc.
- Expert working with Configuration Management such as Ansible, Chef, Puppet
- Proficient writing scripts or cli tools that help increase Developer Productivity in high-level languages like Python, Go, etc.
- Experience working with Engineering teams, teaching, training, and mentoring on how to implement best-practice technical solutions
**It Pays to Work Here**
We take a holistic approach to compensation at Gemini, which includes:
- Comprehensive health plans covered at 100% for employees and dependents
- Long-term incentive in the form of a new hire equity grant
- Paid Parental Leave
- Competitive paid time off
In Singapore, we have a hybrid work policy. Employees are expected to work from the office part of the week. We believe our hybrid approach increases productivity through more in-person collaboration where possible.
At Gemini, we strive to build diverse teams that reflect the people we want to empower through our products, and we are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, or Veteran status. Equal Opportunity is the Law, and Gemini is proud to be an equal opportunity workplace. If you have a specific need that requires accommodation, please let a member of the People Team know.
Job ID 6194408
-
Site Reliability Engineer
1 week ago
Singapore Shopee Full timeDepartmentEngineering and Technology- LevelExperienced (Individual Contributor)- LocationSingaporeThe Engineering and Technology team is at the core of the Shopee platform development. The team is made up of a group of passionate engineers from all over the world, striving to build the best systems with the most suitable technologies. Our engineers do not...
-
Site Reliability Engineer
4 weeks ago
Singapore Hyphen Connect Full timeSite Reliability Engineer (Crypto Trading) Join to apply for the Site Reliability Engineer (Crypto Trading) role at Hyphen Connect Site Reliability Engineer (Crypto Trading) 2 days ago Be among the first 25 applicants Join to apply for the Site Reliability Engineer (Crypto Trading) role at Hyphen Connect We are hiring for one of our ecosystem projects in...
-
Site Reliability Engineer
1 week ago
Singapore IDEMIA Full timeJoin to apply for the Site Reliability Engineer role at IDEMIA Join to apply for the Site Reliability Engineer role at IDEMIA Get AI-powered advice on this job and more exclusive features. PurposeThis role plays a critical part in ensuring reliability, scalability, and performance of our systems and services. You will work closely with development and...
-
Site Reliability Engineer
5 days ago
Singapore IDEMIA Full timeJoin to apply for the Site Reliability Engineer role at IDEMIA Join to apply for the Site Reliability Engineer role at IDEMIA Get AI-powered advice on this job and more exclusive features. PurposeThis role plays a critical part in ensuring reliability, scalability, and performance of our systems and services. You will work closely with development and...
-
Site Reliability Engineer
2 weeks ago
Singapore Hyphen Connect Full timeSite Reliability Engineer (Crypto Trading) Join to apply for the Site Reliability Engineer (Crypto Trading) role at Hyphen Connect Site Reliability Engineer (Crypto Trading) 2 days ago Be among the first 25 applicants Join to apply for the Site Reliability Engineer (Crypto Trading) role at Hyphen Connect We are hiring for one of our ecosystem...
-
Site Reliability Engineer
5 days ago
Singapore IDEMIA Full timeJoin to apply for the Site Reliability Engineer role at IDEMIA Join to apply for the Site Reliability Engineer role at IDEMIA Get AI-powered advice on this job and more exclusive features. Purpose This role plays a critical part in ensuring reliability, scalability, and performance of our systems and services. You will work closely with development and...
-
Site Reliability Engineer, Traffic Platform
4 weeks ago
Singapore ByteDance Full timeSite Reliability Engineer, Traffic Platform - 2025 StartJoin to apply for the Site Reliability Engineer, Traffic Platform - 2025 Start role at ByteDanceSite Reliability Engineer, Traffic Platform - 2025 Start3 days ago Be among the first 25 applicantsJoin to apply for the Site Reliability Engineer, Traffic Platform - 2025 Start role at ByteDanceGet...
-
Singapore Airwallex Full time**About Airwallex** Airwallex is the only unified payments and financial platform for global businesses. Powered by our unique combination of proprietary infrastructure and software, we empower over 150,000 businesses worldwide - including Brex, Rippling, Navan, Qantas, SHEIN and many more - with fully integrated solutions to manage everything from business...
-
Site Reliability Engineer
4 weeks ago
Singapore Tardis Group Full timeDirect message the job poster from Tardis Group Recruiter at Tardis Group | Finding Top Talent in Tech & Quant About the Company A rapidly growing technology firm operating at the forefront of artificial intelligence and advanced software solutions. The company fosters a fast-paced, collaborative, and innovation-driven culture, uniting talent across...
-
Site Reliability Engineer
2 weeks ago
Singapore Tardis Group Full timeDirect message the job poster from Tardis Group Recruiter at Tardis Group | Finding Top Talent in Tech & Quant About the Company A rapidly growing technology firm operating at the forefront of artificial intelligence and advanced software solutions. The company fosters a fast-paced, collaborative, and innovation-driven culture, uniting talent across...