Staff Platform/ SRE Engineer
2 weeks ago
About Grasshopper
Grasshopper is a quantitative trading technology provider based in Singapore, and is the holding company of Grasshopper Asset Management. Our state-of-the-art technology, built from the ground up in-house, puts us at the forefront of developments in electronic trading. An unbroken record of consistency and profitability is underpinned by firm values of curiosity, empowerment and flexibility.
About the role:
As a Staff Site Reliability Engineer on the Infrastructure Team, you will play a large role in advancing our research and batch computing capabilities. You will work closely with cross-functional teams to architect, develop and maintain scalable solutions on our Google Cloud and our on-premise Infrastructure.
As a key member of the Infrastructure Team, you'll
Design, implement, and maintain robust observability systems, including monitoring, logging, tracing and alerting, to ensure high availability, rapid incident detection, and deep system visibility across all services.Architect, develop and maintain scalable solutions on Google Cloud and on-premise infrastructureAdvancing and supporting our research clusterInvestigate infrastructure/application issues on a live production systemWorking together with developers to improve our development environment, including CI/CD, built tools, etc.Help drive an SRE mindset within the organisationWe'd love for you to have:
A background in either a high-frequency trading (HFT) or Research environment, preferably involving research and/or backtesting platforms.At least 8 years of solid background in Platform/SRE engineering.A good understanding of Kubernetes, encompassing both architectural design and operational management.Proficiency in GitOps principles, specifically with tools like Argo-CD and GitLab CI.Workflows (argo workflows) and batch/HPC type workloadsStrong grasp of cloud infrastructure, with practical experience in either AWS or GCP.Take initiative in investigating infrastructure/application issues on a live production system independentlyProficiency in programming languages such as Python or Go.Good interpersonal and collaboration skillsExcellent written and verbal communication skills with an eye for detail.Strong entrepreneurial spirit and the ability to adapt to changing requirements and technologies.A passion for learning and inventing new approaches to hard problemsIt will be highly useful if you have prior experience in the following technologies:
Previous exposure to K8s operators.Prior knowledge and experience in on-premises bare metal environments.Proficiency in containerisation technologies.Familiarity with configuration management tools such as Puppet, Chef, or Ansible.Understanding of security principles from both operational and implementation perspectives.Experience with Argo-CD and Argo Workflows for workflow automation.Knowledge of monitoring tools like Prometheus and the ELK stack (Elasticsearch, Logstash, Kibana).Familiarity with RedHat and CentOS-based Linux distributions.Prior contributions to open-source projects.Experience with CI tools like Gitlab CI and Jenkins for continuous integration and deployment.What we offer:
21 days annual leaveAn opportunity to learn from experienced professionals, fostering mentorship opportunities and personal growthComprehensive Insurance Package with extended coverage for dependentsWell stocked pantryAnnual Dental & Wellness budgetGym membershipEmployee bonus referralsCompetitive CompensationWhat you can expect working at Grasshopper:
At Grasshopper, you will be working in a diverse and dynamic environment with a flat hierarchy. With over 100 employees and 15 nationalities working in an open office, communication is essential to performance. To keep our edge as the "small giant" of trading technology, we give employees a high level of autonomy and encourage them to get creative, take risks, make mistakes and learn from them. The sprint is on
Grasshopper is an equal opportunity employer.
-
Staff Platform
5 days ago
Singapore Centre for Strategic Infocomm Technologies (CSIT) Full timeOverview Join to apply for the Staff Platform & SRE Engineer (Workplace Technology)role at Centre for Strategic Infocomm Technologies (CSIT) . You will be leading the design, development, integration, and operations of digital workplace platforms and end-user technologies. Drive platform architecture, software and security engineering practices, and site...
-
Staff Platform/sre Engineer
2 weeks ago
Singapore Grasshopper Pte Ltd Full time**What We Are Looking For**: As a Staff Engineer on the Infrastructure Team, you will play a large role in advancing our research and batch computing capabilities. You will work closely with cross-functional teams to architect, develop and maintain scalable solutions on our Google Cloud and our on-premise Infrastructure. **Responsibilities**: **As a Staff...
-
Staff Platform
5 days ago
Singapore Centre for Strategic Infocomm Technologies (CSIT) Full timeYou will be leading the design, development, integration, and optimizing enterprise-grade communication and collaboration platforms. Drive platform architecture, software and security engineering practices, and site reliability engineering (SRE) to ensure secure, scalable, and optimized systems. Champion modern engineering methodologies and contribute to...
-
Engineer, SRE
2 weeks ago
Singapore Rakuten Viki Full timeJoin to apply for the Engineer, SRE role at Rakuten Viki Rakuten International oversees 7 businesses with over 4,000 employees globally. The brand is recognized for leadership and innovation in e-commerce, digital content, advertising, entertainment and communications, bringing the joy of discovery and access to more than 1 billion members across the world....
-
Singapore Centre for Strategic Infocomm Technologies (CSIT) Full timeCSIT develops products to advance the national security interests of Singapore. We use our products in a wide range of operations, including but not limited to Counter-terrorism and Computer Network Defence. Join the awesome CSIT family and use cutting-edge technologies to protect the nation.- The Engineer will be responsible for ensuring the scalability,...
-
Staff Software Engineer
5 days ago
Singapore NodeFlair Full time**Job Summary**: **Salary** S$9,500 - S$12,500 / Monthly **Job Type** **Seniority** Senior **Years of Experience** At least 8 years **Tech Stacks** Hazelcast TCP Docker Elastic Java Grafana Linux Splunk NoSQL Kubernetes kafka SQL JSON Redis **Company Description** Visa is a world leader in digital payments, facilitating more than 215 billion payments...
-
DevOps / Sre Engineer
1 week ago
Singapore NodeFlair Full time**Job Summary**: **Salary** S$6,500 - S$8,000 / Monthly **Job Type** **Seniority** Mid **Years of Experience** At least 5 years **Tech Stacks** Container Powershell GitLab AWS Terraform Jenkins play GitLab CI CI ELK Git Azure Grafana Prometheus Splunk Kubernetes Ansible Python **Company Overview**: We are a leading technology fintech company at the...
-
Senior Observability
2 days ago
Singapore GIC Full timeA leading global investment firm in Singapore seeks an experienced AVP / VP, Observability & SRE Engineering to develop enterprise observability and service reliability strategies. The ideal candidate will have expertise in Datadog and AWS, with a deep understanding of SRE principles. This role encompasses the management of observability platforms and...
-
Site Reliability Engineer, Traffic Platform
2 weeks ago
Singapore ByteDance Full timeSite Reliability Engineer, Traffic Platform - Traffic SRE - 2025 Start Join to apply for the Site Reliability Engineer, Traffic Platform - Traffic SRE - 2025 Start role at ByteDance Site Reliability Engineer, Traffic Platform - Traffic SRE - 2025 Start 2 days ago Be among the first 25 applicants Join to apply for the Site Reliability Engineer, Traffic...
-
Data Platform Engineer
5 days ago
Singapore TENTEN PARTNERS PTE. LTD Full timeWe are partnering with a well-established financial institution who is seeking a highly skilled Data Platform Engineer (SRE) to join their Technology team. This role is critical in ensuring the reliability, scalability, and performance of enterprise data platforms that power investment management operations. What You'll Do Drive Site Reliability Engineering...