Site Reliability Engineer

6 days ago


Singapore HELLO PLANET PTE. LTD. Full time

We are a global dating app created to give everyone a chance at love. The sense of belonging and connectedness we get from relationships helps us survive and thrive, and we’re working to make it a little easier for people to find that. We’re inspired by the stories we hear from employees, friends, and family who have used our app to transform their lives, and you, too, can make a difference by joining us

We are looking for a talented Senior Site Reliability Engineer to help design the future of dating. This individual will bring extensive experience in running large-scale data sources in the cloud and will be responsible for modernizing our data source handling and maintaining our core infrastructure and services on AWS.

This role will be based in Singapore and report directly to the CTO.

**Responsibilities**:

- Architect, develop, and maintain our core infrastructure and services on AWS, focusing on high availability, performance, and scalability.
- Specific AWS services of interest include EC2, RDS, S3, ElastiCache, CloudWatch, RedShift, OpenSearch, and VPC.
- Implement and manage continuous deployment processes to achieve seamless deployment of services with mínimal downtime.
- Develop and maintain automated tools for infrastructure provisioning, configuration, and deployment.
- Work closely with development teams to integrate infrastructure builds and operational best practices into the software development lifecycle.
- Conduct root cause analysis for production errors and implement strategies to prevent future occurrences.
- Manage and optimize network configurations to ensure secure and efficient data flow and access.
- Administer and maintain databases, ensuring their reliability, performance, and security.
- Lead capacity planning efforts to ensure that our infrastructure scales in line with demand while optimizing costs and maintaining performance.
- Modernize data source handling (Redshift, Postgres, RDS, etc.).
- Manage Kubernetes workloads.

**Qualifications**:

- Bachelor's degree in Computer Science, Engineering, or a related field.
- 5+ years of industry experience.
- Proven experience as an SRE, DevOps Engineer, or similar role in a cloud-based environment.
- Strong expertise in AWS services and tools.
- Experience with database administration, including performance tuning, backup and recovery processes, and security management.
- Proficiency in scripting languages (e.g., Python, Bash) and automation tools (e.g., Terraform).
- Excellent problem-solving skills and the ability to work independently or as part of a team.
- **Strong Written and Verbal Communication**:Fluent in English (both written and verbal); proficiency in Chinese is a must.
- Significant experience in capacity planning and cost management within cloud environments.
- Experience with Kubernetes.
- Familiarity with Terraform for general systems maintenance.
- Experience with data sources like Redshift, Postgres (Citus, Patroni), and RDS.

**Preferred Qualifications**:

- AWS SysOps Administrator Associate or AWS Solutions Architect Professional (SAP) certification.
- Experience with Spotinst for cost optimization.
- Familiarity with additional scripting languages such as Go or JavaScript.

If you're passionate about tackling big challenges and have the skills to help us shape the future of online dating, we want to hear from you



  • Singapore RigNet Full time

    About us One team. Global challenges. Infinite opportunities. At Viasat, we’re on a mission to deliver connections with the capacity to change the world. For more than 35 years, Viasat has helped shape how consumers, businesses, governments and militaries around the globe communicate. We’re looking for people who think big, act fearlessly, and create an...


  • Singapore ABAXX SINGAPORE PTE. LTD. Full time

    Site Reliability Engineer - Networking We are seeking competent candidate joining our Infrastructure Team for the mission building and operating MAS regulated marketplace and clearing house. This role is ideal for someone with a strong foundation in AWS services, infrastructure as code, and cloud security, who is passionate about building scalable, secure,...


  • Singapore ABAXX SINGAPORE PTE. LTD. Full time

    Site Reliability Engineer - Networking We are seeking competent candidate joining our Infrastructure Team for the mission building and operating MAS regulated marketplace and clearing house. This role is ideal for someone with a strong foundation in AWS services, infrastructure as code, and cloud security, who is passionate about building scalable, secure,...


  • Singapore Abaxx Commodity Futures Exchange and Clearinghouse Full time

    Site Reliability Engineer - Networking We are seeking a competent candidate joining our Infrastructure Team for the mission building and operating a MAS regulated marketplace and clearing house. This role is ideal for someone with a strong foundation in AWS services, infrastructure as code, and cloud security, who is passionate about building scalable,...


  • Singapore NetEase Games Full time

    Overview Join to apply for the Site Reliability Engineer role at NetEase Games . As a leading internet technology company based in China, NetEase provides premium online services centered around content creation and operates a broad gaming ecosystem. Job Description Site Reliability Engineering (SRE) refers to using software engineering methods to manage...


  • Singapore NetEase Games Full time

    Overview Join to apply for the Site Reliability Engineer role at NetEase Games . As a leading internet technology company based in China, NetEase provides premium online services centered around content creation and operates a broad gaming ecosystem. Job Description Site Reliability Engineering (SRE) refers to using software engineering methods to manage...


  • Singapore Point72 Full time

    Join to apply for the Site Reliability Engineer role at Point72 About the role As part of Point72’s Technology Team, you will focus on developing and maintaining complex, distributed, real-time systems that support our Global Macro business. Your responsibilities will include optimizing operations through automation, building foundational SRE components,...


  • Singapore Point72 Full time

    Join to apply for the Site Reliability Engineer role at Point72About the role As part of Point72’s Technology Team, you will focus on developing and maintaining complex, distributed, real-time systems that support our Global Macro business. Your responsibilities will include optimizing operations through automation, building foundational SRE components,...


  • Singapore APPLE SOUTH ASIA PTE. LTD. Full time

    Summary At Apple, new ideas have a way of becoming excellent products, services, and customer experiences very quickly. Bring passion and dedication to your job and there’s no telling what you could accomplish. The people here at Apple don’t just build products - they craft the kind of wonder that’s revolutionized entire industries. It’s the...


  • Singapore APPLE SOUTH ASIA PTE. LTD. Full time

    Summary At Apple, new ideas have a way of becoming excellent products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish. The people here at Apple don't just build products - they craft the kind of wonder that's revolutionized entire industries. It's the diversity of...