Site Reliability Engineer
2 weeks ago
**What the role is**
- The mission of Housing & Development Board (HDB) is to provide affordable, quality housing and a great living environment where communities thrive. To achieve its mission, HDB aims to be data-driven to the core and adopt evidence-based decision making in developing better housing policies service, improving service delivery and optimising operations.
**What you will be working on**
- You will be part of the Information Services Group that leads the development and implementation of enterprise-wide ICT solutions for HDB, working closely with in-housed or outsourced development teams to create and maintain scalable and highly reliable systems. Your goal is to ensure the smooth delivery of digital services to delight our customers.
You will work in cross-functional teams. Be results oriented with strong ability to collaborate with and engage stakeholders. You should also possess good problem-solving skills and an analytical mind; have excellent communication skills, both verbal and written; and be resilient to work in a fast-paced environment.
- Define and implement systems’ metrics and perform monitoring activities
- Define and implement automations, visualisations and alerts on systems’ health
- Oversee staging releases to Production, ensuring stability and maintaining quality & efficiency in large scale cloud environments
- Respond to and troubleshoot incidents, providing post-mortem analysis/ areas of improvement
- Collaborate with stakeholders and management, to identify and implement improvements towards efficient daily operations
**What we are looking for**
- Strong background in computer science, computer engineering, information technology or related field
- Minimum 3 years of experience in a SRE (Site Reliability Engineer) role, Infrastructure Engineering or Application support with DevOps
- Minimum 3 years of experience in one or more programming languages - Python/ Java and configuration management/ IAC tools - Ansible/ Terraform
- Strong experience in a Continuous Integration/Continuous Delivery (CI/CD) with hands-on working knowledge in Jira, Confluence, Gitlab
- Experience in container technologies using Docker or Kubernetes
- Experience in AWS cloud architectures and infrastructure management (Terraform / CloudFormation)
- Experience in design and implementation of observability platforms
- Experience driving major production incidents and organised incident retrospective Meetings
- Good understanding of and experience in review and providing recommendation covering the AWS well architected framework
- Understanding of IT Service Management and Operations for Cloud
- Familiarity with the Singapore Government Tech Stack is preferred but not a must
Good to have:
- Team Player; we work together as a team
- Independent and take ownership of work responsibilities
All applicants will be notified on whether they are shortlisted for the position within 4 weeks of the closing date of this job posting.
-
Site Reliability Engineer
3 days ago
Singapore PERSOLKELLY Full timeWe have partnered with a renowned global leader in information and communications technology (ICT) infrastructure and smart devices. They are providing full-stack, all-scenario solution for products and services carriers, enterprises, governments, and individual consumers worldwide. Our client is looking for enthusiastic Site Reliability Engineer to...
-
Site Reliability Engineer
7 days ago
Singapore TRUEWATCH TECHNOLOGY INC PTE. LTD. Full timeSite Reliability Engineer**Roles and Responsibilities**The Site Reliability Engineer plays a crucial role in ensuring the availability, reliability, and performance of our production environment.Monitor system health and take a holistic view to ensure optimal operation. Implement site reliability automation to minimize downtime and reduce costs. Manage...
-
Site Reliability Engineer
1 week ago
Singapore TRUEWATCH TECHNOLOGY INC PTE. LTD. Full timeRoles & ResponsibilitiesResponsibility:Run production environment by monitoring availability and taking a holistic view of the system health. Achieve site reliability automation, minimize system downtime, and reduce site reliability cost. Manage risks and resolves issues that affect the release scope, schedule and quality. Suggest architecture...
-
Site Reliability Engineer
1 week ago
Singapore TRUEWATCH TECHNOLOGY INC PTE. LTD. Full timeRoles & ResponsibilitiesResponsibility: Run production environment by monitoring availability and taking a holistic view of the system health. Achieve site reliability automation, minimize system downtime, and reduce site reliability cost. Manage risks and resolves issues that affect the release scope, schedule and quality. Suggest architecture...
-
Senior Site Reliability Engineer
1 week ago
Singapore GK CONSULTING PTE. LTD. Full timeSenior Site Reliability EngineerWe are seeking an experienced Senior Site Reliability Engineer to ensure the reliability, availability, and performance of our cloud-based internet services. The ideal candidate will be responsible for owning the reliability, availability, and user experience for assigned cloud services.
-
Site Reliability Engineer
3 days ago
Singapore beBee Careers Full timeSite Reliability Engineer - Front End Operations\We are seeking a skilled Site Reliability Engineer to join our team, focusing on front end operations. This role involves building and maintaining scalable systems, ensuring high availability, and implementing automation to optimize performance.\The successful candidate will have experience in containerized...
-
Site Reliability Engineer, SealSuite
2 days ago
Singapore ByteDance Full timeResponsibilitiesAbout the TeamOur team is dedicated to elevating the level of cybersecurity to fully support Bytedance as well as our clients' digital journey. We aim high at building the next-generation cybersecurity. Rooted from years of practical experience in the enterprise security domain within ByteDance, the team now runs as a business. We provide a...
-
Site Reliability Engineer, SealSuite
1 day ago
Singapore ByteDance Full timeResponsibilities About the Team Our team is dedicated to elevating the level of cybersecurity to fully support Bytedance as well as our clients' digital journey. We aim high at building the next-generation cybersecurity. Rooted from years of practical experience in the enterprise security domain within ByteDance, the team now runs as a business. We provide...
-
Site Reliability Engineer
5 days ago
Singapore Aptitude Asia Full timeSite Reliability Engineer (SRE) - Top-tier Hedge Fund Our client, a top-tier hedge fund, is looking to hire a talented Site Reliability Engineer to join their growing SRE team in Singapore. Job Responsibilities: Ensure high reliability, availability, and performance of applications throughout their lifecycle. Automate repetitive tasks and systematically...
-
Site Reliability Engineer
1 week ago
Singapore JJ Consulting Services Full timeOur Client is a fast growing company in Singapore, who is seeking to recruit a Site Reliability Engineer. **Site Reliability Engineer** **Key Roles & Responsibilities** - Providing ancillary support of Enterprise-Grade Products and solutions at customer's sites - Ironing out deployment issues or challenges that our customers may face - Responsible for...