Senior Site Reliability Engineer, Platform
1 month ago
Department
: PlatformOur Platform organization’s purpose is to enable Gemini to scale effectively and empower our engineering teams to focus on building innovative financial products and experiences for individuals around the world. Within Platform, the Site Reliability Engineering team is responsible for partnering with Gemini’s other engineering teams to ensure all our systems are architected, engineered and deployed to be resilient, reliable and performant.
The Embedded SRE team is a part of Site Reliability Engineering with a focus on engaging directly with our other engineering teams to onboard them onto our platform systems, reviewing and recommending design and architectural decisions, and guiding our engineering teams on how to implement the tooling provided by the larger Platform organization required to ensure systems can scale and react to changing conditions, with continuous improvement loops.
The Role: Senior Site Reliability Engineer
You will be an integral part of leading Gemini’s engineering teams towards modern DevOps practices, both by developing and providing modern automation and operational tooling, and working cross-functionally across Gemini’s engineering teams to influence and shape our development practices and culture.
Responsibilities:
Provide primary operational support and engineering for various Gemini services Improve reliability, quality and time-to-market across all Gemini services and offerings Guide engineering teams onto the various supported services provided by Platform Run on-going performance evaluations and improvements for Gemini systems Architecture recommendations and engagement as part of SDLC Create “Production-ready Scorecards” to evaluate the health of systems pre-launch Implement and teaching monitoring, alerting and automated resolution best practices Define SLIs, SLOs with Engineering teams Educate and guide Engineering teams on reliability and resiliency best practices, like statelessness, chaos testing, blue/green deployments etc. Build operational tooling and automationsQualifications:
7+ years using monitoring, alerting, and automation tooling to understand and remediate performance and health issues in systems at scale Good knowledge for various cloud technology providers like AWS, GCP, or Azure Experience in a code-first environment, developing automated solutions to solve support and operational issues Experience as a Technical Leader within a team, helping evaluating and making tech decisions for the team Experience working with containerization such as Nomad, EKS (k8s), Docker, etc. Experience working with Configuration Management such as Ansible, Chef, Puppet Experience writing scripts or cli tools that help increase Developer Productivity in high-level languages like Python, Go, etc. Experience analyzing system and application performance, identifying bottlenecks, and recommending architectural or systemic improvements Experience working with Engineering teams, teaching, training, and mentoring on how to implement best-practice technical solutions Experience working in a code-drive, automation-first public cloud infrastructure (Terraform)It Pays to Work Here
We take a holistic approach to compensation at Gemini, which includes:
Health, Vision and Dental insurance covered at 100% for employees and dependents At least 12 weeks paid Parental Leave Up to 14 paid vacation days (in addition to public/bank holidays) Business Travel Medical Insurance-
Site Reliability Engineer
1 month ago
Singapur, Singapore Encora Inc. Full timeSite Reliability Engineer Location: Singapore Experience: 5 years Job Mode: Full-time Work Mode: On-site The Site Reliability Engineer/Software Engineer is a contract position responsible software and systems engineering to build and run large-scale, distributed, fault-tolerant systems. As a SRE you will help to ensure that our services are reliable,...
-
Site Reliability Engineer
4 weeks ago
Singapur, Singapore Encora Inc. Full timeSite Reliability Engineer Location: Singapore Experience: 5 years Job Mode: Full-time Work Mode: On-site The Site Reliability Engineer/Software Engineer is a contract position responsible software and systems engineering to build and run large-scale, distributed, fault-tolerant systems. As a SRE you will help to ensure that our services are reliable,...
-
Senior Site Reliability Engineer
2 months ago
Singapur, Singapore Sea Full timeOur Infrastructure team provides the end-to-end managed services and solutions for the Group's entire Internet infrastructure alongside running business applications. We excel in building the architecture, providing solutions and operations of data centre, connectivity, cloud, networking, system, storage and security. We are a proud provider of high-quality...
-
Senior Site Reliability Engineer
2 weeks ago
Singapur, Singapore Sea Full timeOur Infrastructure team provides the end-to-end managed services and solutions for the Group's entire Internet infrastructure alongside running business applications. We excel in building the architecture, providing solutions and operations of data centre, connectivity, cloud, networking, system, storage and security. We are a proud provider of high-quality...
-
Senior Site Reliability Engineer
4 weeks ago
Singapur, Singapore Sea Full timeOur Infrastructure team provides the end-to-end managed services and solutions for the Group's entire Internet infrastructure alongside running business applications. We excel in building the architecture, providing solutions and operations of data centre, connectivity, cloud, networking, system, storage and security. We are a proud provider of high-quality...
-
Site Reliability engineer
2 weeks ago
Singapur, Singapore Renesas Electronics Full timeJob DescriptionOverviewWe are seeking a skilled and experienced Site Reliability Engineer to join our team. In this role, you will be part of the AI & Cloud Engineering (ACE) Division and AI Workbench team. Our AI Workbench is a cloud-based environment to accelerate Automotive AI Software Development and Evaluation. The AI Workbench has 4 main functional...
-
Senior Site Reliability Engineer
3 weeks ago
Singapur, Singapore TIKTOK PTE. LTD. Full timeAbout TiktokTikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. TikTok has global offices including Los Angeles, New York, London, Paris, Berlin, Dubai, Singapore, Jakarta, Seoul, and Tokyo.Why Join UsCreation is the core of TikTok's purpose. Our platform is built to help imaginations thrive....
-
Site Reliability Engineer
2 months ago
Singapur, Singapore Sea Full timeOur Infrastructure team provides the end-to-end managed services and solutions for the Group's entire Internet infrastructure alongside running business applications. We excel in building the architecture, providing solutions and operations of data centre, connectivity, cloud, networking, system, storage and security. We are a proud provider of high-quality...
-
Senior Site Reliability Engineer
1 month ago
Singapur, Singapore Shopee Full timeSenior Site Reliability Engineer (Promotion) - Engineering Infra DepartmentEngineering and TechnologyLevelExperienced (Individual Contributor)LocationSingapore The Engineering and Technology team is at the core of the Shopee platform development. The team is made up of a group of passionate engineers from all over the world, striving to build the best...
-
Senior Site Reliability Engineer
4 weeks ago
Singapur, Singapore Shopee Full timeSenior Site Reliability Engineer (Promotion) - Engineering Infra DepartmentEngineering and TechnologyLevelExperienced (Individual Contributor)LocationSingapore The Engineering and Technology team is at the core of the Shopee platform development. The team is made up of a group of passionate engineers from all over the world, striving to build the best...
-
Site Reliability Engineer
4 weeks ago
Singapur, Singapore TikTok Full timeAbout the team Our Compute Platform SRE team supports all Big Data services and products across the company. We are a newly established team and waiting for talents like you to shape the team's future together. We are responsible for the reliability of all the company's major data warehouse products, services, and query engines. We serve business needs...
-
Singapur, Singapore United Overseas Bank Full timeAVP Site Reliability Engineer, Group Infrastructure Platform Services Posting Date: 21-May-2023 Location: Singapore, Singapore Company: United Overseas Bank Ltd About UOB United Overseas Bank Limited (UOB) is a leading bank in Asia with a global network of more than 500 branches and offices in 19 countries and territories in Asia Pacific,...
-
Singapur, Singapore United Overseas Bank Full timeAVP Site Reliability Engineer, Group Infrastructure Platform Services Posting Date: 21-May-2023 Location: Singapore, Singapore Company: United Overseas Bank Ltd About UOB United Overseas Bank Limited (UOB) is a leading bank in Asia with a global network of more than 500 branches and offices in 19 countries and territories in Asia Pacific,...
-
Site Reliability Specialist
2 months ago
Singapur, Singapore IHiS Full timePosition OverviewThe Reliability Lead will support the reliability principal with senior management in strategy discussion for application & system improvement, and will also manage the reliability team. He/She will ensure that the existing site reliability engineering (SREs) initiatives, such as monitoring availability, uplifting capability and automoation...
-
Site Reliability Specialist
2 weeks ago
Singapur, Singapore IHiS Full timePosition OverviewThe Reliability Lead will support the reliability principal with senior management in strategy discussion for application & system improvement, and will also manage the reliability team. He/She will ensure that the existing site reliability engineering (SREs) initiatives, such as monitoring availability, uplifting capability and automoation...
-
Site Reliability Specialist
4 weeks ago
Singapur, Singapore IHiS Full timePosition OverviewThe Reliability Lead will support the reliability principal with senior management in strategy discussion for application & system improvement, and will also manage the reliability team. He/She will ensure that the existing site reliability engineering (SREs) initiatives, such as monitoring availability, uplifting capability and automoation...
-
Site Reliability Engineer
1 month ago
Singapur, Singapore NTT DATA Full timeJob Description NTT is a leading global IT solutions and services organisation that brings together people, data and things to create a better and more sustainable future.In today’s ‘iNTTerconnected’ world, connections matter more now than ever. By bringing together talented people, world-class technology partners and emerging innovators, we help our...
-
Site Reliability Engineer
4 weeks ago
Singapur, Singapore NTT DATA Full timeJob Description NTT is a leading global IT solutions and services organisation that brings together people, data and things to create a better and more sustainable future.In today’s ‘iNTTerconnected’ world, connections matter more now than ever. By bringing together talented people, world-class technology partners and emerging innovators, we help our...
-
Site Reliability Engineer
2 weeks ago
Singapur, Singapore Sea Full timeAbout Sea Labs IndonesiaSea Labs is at the core of the Sea platforms development, supporting diverse business lines from e-commerce, supply chain, games, payment and finance, among many others. The strong growth and unique positioning of Sea's e-commerce business, Shopee, spurred the launch of Sea Labs Indonesia. Since its inception, the group of passionate...
-
Senior PaaS Site Reliability Engineer
4 weeks ago
Singapur, Singapore Tencent Full timeResponsibilities: About the Company Tencent is a leading global technology company focused on connecting people and developing innovative products and services that improve the quality of life of people around the world. Founded in 1998 and publicly traded on the Hong Kong Stock Exchange since 2004, Tencent offers a variety of products and services,...