GEL – Site Reliability Engineer

3 weeks ago


Singapore TOSS-EX PTE. LTD. Full time
Roles & Responsibilities

Roles & Responsibilities

Job Purpose

The Site Reliability Engineer (SRE) combines software development and system engineering to build and run distributed solutions in a secured multi-tier heterogeneous environment to safeguard, provide and continuously improve the software and systems behind the organization’s cloud platform solutions.

The Job

· With a vigilant eye on their availability, latency, performance and capacity. Ultimately, you will view software as the primary tool to optimizing systems, building infrastructure and removing mundane work through automation.

· As part of Cloud Engineering Team, the SRE Engineer engages in and improves the full lifecycle of cloud platform solutions from design, deployment, operation and refinement with accuracy and in compliance with organization policies and security requirements.

· The SRE Engineer treats operations as a software problem and therefore will code to automate repetitive tasks and optimize cloud operations.

· Support services before go-live through activities like system design consulting, developing software platforms and launch reviews. Maintain post-live cloud operations by measuring and monitoring availability, latency and overall system health with any prompt and remediative actions.

· Scale sustainably through mechanisms like automation and evolve services/solutions, leveraging IaaS, CaaS and PaaS by pushing for changes that improves reliability and velocity.

· Deploy product updates as required while implementing integrations when they arise. Specifying, documenting and developing new product features, and writing automating scripts.

· Work with open-source technologies, CI/CD, SCM tools as necessary, and source control such as Bitbucket, implement organization containers (e.g. Docker and Kubernetes). Stay current with industry trends and propose new ways for business improvements.

· Takes accountability in considering business and regulatory compliance risks and takes appropriate steps to mitigate the risks.

· Maintains awareness of industry trends on regulatory compliance, emerging threats and technologies in order to understand the risk and better safeguard the company.

· Highlights any potential concerns /risks and proactively shares best risk management practices.

Our Requirements

· Strong hands on experience with using and designing VMware solution such as NSX-T, vRealize Suite, vSphere/vCenter is mandatory.

· Strong working experience on patch management for operating systems and middleware is mandatory. Eg, Windows, RedHat, Websphere, Weblogic, MSSQL etc.

· Hands on experience with creation and maintenance of VMware server templating/blueprints such as RedHat, Windows server templates.

· Hands on experience with infrastructure-as-code, orchestration, configuration management and provisioning tools is mandatory.

· Systematic problem-solving approach, coupled with effective communications skills and a sense of ownership and drive.

· Strong experience in a Continuous Integration/Continuous Delivery (CI/CD) environment with strong appreciation of change/version control process and methodologies.

· Worked with DevOps and Automation tools (E.g. Selenium, SOAPUI, Bamboo, Jenkins, Ansible, Marvin, Github, Bitbucket, Nexus, Jira, Confluence etc).

· Must code, debug and optimize code and automate mundane tasks.

· Experience in scripting languages such as Bash, Batch, Powershell, YAML etc.

· Experience implementing distributed solutions in a secured multi-tier heterogeneous environment is mandatory.

· High level of integrity, takes accountability of work and good attitude over teamwork.

· Takes initiative to improve current state of things and adaptable to embrace new changes.


Tell employers what skills you have

Kubernetes
VMware
Automation Tools
Scripting
Reliability
System Design
JIRA
Configuration Management
Operating Systems
Windows
Docker
Ansible
Orchestration
Software Development

  • Singapore TOSS-EX PTE. LTD. Full time

    Roles & ResponsibilitiesRoles & ResponsibilitiesJob PurposeThe Site Reliability Engineer (SRE) combines software development and system engineering to build and run distributed solutions in a secured multi-tier heterogeneous environment to safeguard, provide and continuously improve the software and systems behind the organization's cloud platform...


  • Singapore OCBC Full time

    Job Description:We are seeking a Site Reliability Engineer Leader to join our team at OCBC. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our infrastructure. This role requires strong expertise in automating releases, continuous integration/delivery systems, and relevant infrastructure...


  • Singapore COMBUILDER PTE LTD Full time

    Roles & ResponsibilitiesWe are seeking talented and driven professionals to join our Site Reliability Engineering (SRE) team. This role involves helping organizations enhance the availability, performance, and resilience of their applications and services through the deployment and administration of Observability Platforms.Key ResponsibilitiesDeploy and...


  • Singapore FUNFLY PTE. LTD. Full time

    Roles & ResponsibilitiesPosition OverviewAs a site reliability engineer, you will be responsible for ensuring the smooth operation of game services by maintaining, monitoring, and responding to faults daily. They will develop automation tools to enhance operational efficiency and manage game servers for optimal performance. The role includes collaborating...


  • Singapore GK CONSULTING PTE. LTD. Full time

    Roles & ResponsibilitiesWe're seeking an experienced Senior Site Reliability Engineer to ensure the reliability, availability, and performance of our cloud-based internet services.Key Responsibilities1. Own reliability, availability, and user experience for assigned cloud services2. Develop and implement service governance initiatives to increase reliability...


  • Singapore TRINITY CONSULTING SERVICES PTE. LTD. Full time

    Roles & Responsibilities· Must have minimum 5 years' experience.· Strong technical knowledge and experience in supporting enterprise-level applications.· Proficiency in troubleshooting application issues, performing log analysis, and using monitoring tools.· Experience with databases and SQL query language.· Familiarity with software development life...


  • Singapore FLOWDESK ASIA PTE. LTD. Full time

    Roles & ResponsibilitiesAbout the jobAre you passionate about maintaining robust and high-performing infrastructures? Do you thrive in managing complex network environments and ensuring system reliability?Join our infrastructure team and help us elevate operational excellence to new heights.As a Site Reliability Engineer at Flowdesk, you will be at the heart...


  • Singapore HELLO PLANET PTE. LTD. Full time

    Roles & ResponsibilitiesWe are a global dating app created to give everyone a chance at love. The sense of belonging and connectedness we get from relationships helps us survive and thrive, and we're working to make it a little easier for people to find that. We're inspired by the stories we hear from employees, friends, and family who have used our app to...


  • Singapore PATSNAP PTE. LTD. Full time

    Roles & ResponsibilitiesAbout the RoleWe are looking for a skilled and experienced DevOps Engineer / Site ReliabilityEngineer (SRE) to ensure the high availability, stability, and performance of ourbusiness platform. This role will be responsible for designing and implementing scalableand maintainable DevOps architecture and automation systems to...


  • Singapore Oxford Knight Full time

    RequirementsOxford Knight seeks a highly motivated and experienced Senior Site Reliability Engineer with a strong background in Linux administration, cloud computing, and programming languages (preferably Python). The ideal candidate should have a degree in Computer Science or a related field and excellent communication skills.Key Skills and Qualifications5+...


  • Singapore TIKTOK PTE. LTD. Full time

    Roles & ResponsibilitiesTikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. TikTok has global offices including Los Angeles, New York, London, Paris, Berlin, Dubai, Singapore, Jakarta, Seoul and Tokyo.Why Join UsAt TikTok, our people are humble, intelligent, compassionate and creative. We create...


  • Singapore Gravitas Recruitment Group Full time

    Our client, a leading investor in financial markets, are looking for an autonomous, critical thinking, Site Reliability Engineer to join their team in Singapore. The ideal candidate must have a strong academic background, having graduated from a top university with a bachelor's degree in computer science. This degree should have been applied...


  • Singapore SCIENTE INTERNATIONAL PTE. LTD. Full time

    Roles & ResponsibilitiesJob Summary:We are seeking a highly skilled Site Reliability Engineer (SRE) to join our team. As an SRE, you will play a critical role in enhancing system reliability, performance, and scalability while ensuring the seamless functioning of our production environments. This is an opportunity to work in a fast-paced, dynamic environment...


  • Singapore GXS Bank Full time

    About the Team: Our team treats infrastructure and operations as software engineering problems. We are responsible for building and progressing software platforms that enable the provisioning and management of all Digibank services in safe, reliable, and scalable ways. We consistently challenge the status quo and use new technologies to build platforms...


  • Singapore SOURCEO PTE. LTD. Full time

    Roles & ResponsibilitiesRequired Expertise and ExperienceAt least 3 years of experience in SRE, DevOps, or a related engineering role. Proficiency in Infrastructure as Code (IaC) using Terraform to manage complex infrastructure. Hands-on experience with log analytics and observability tools, including ELK (Elasticsearch, Logstash, Kibana) and the Grafana...


  • Singapore BYTEDANCE PTE. LTD. Full time

    Roles & ResponsibilitiesAbout Doubao (Seed)Founded in 2023, the ByteDance Doubao (Seed) Team, is dedicated to pioneering advanced AI foundation models. Our goal is to lead in cutting-edge research and drive technological and societal advancements.With a strong commitment to AI, our research areas span deep learning, reinforcement learning, Language, Vision,...

  • Associate VP

    4 days ago


    Singapore DBS Bank Limited Full time

    Business Function Group Technology enables and empowers the bank with an efficient, nimble and resilient infrastructure through a strategic focus on productivity, quality & control, technology, people capability and innovation. In Group Technology, we manage the majority of the Bank's operational processes and inspire to delight our business partners...

  • Reliability Engineer

    4 weeks ago


    Singapore UNITED MICROELECTRONICS CORPORATION (SINGAPORE BRANCH) Full time

    Roles & ResponsibilitiesJob Summary:We are seeking a motivated and detail-oriented Fab Reliability Engineer to join our dynamic team. The ideal candidate will play a crucial role in ensuring the reliability and performance of our manufacturing processes. This position involves process qualification, conformance, process change management, new process...

  • Associate VP

    5 days ago


    Singapore DBS Bank Limited Full time

    Business Function Group Technology enables and empowers the bank with an efficient, nimble and resilient infrastructure through a strategic focus on productivity, quality & control, technology, people capability and innovation. In Group Technology, we manage the majority of the Bank's operational processes and inspire to delight our business partners...


  • Singapore Oxford Knight Full time

    Salary: up to 250-275k SGD base Summary High-frequency prop trading firm with offices worldwide looking for skilled Senior Site Reliability Engineer developer to join their High Performance Computing team, developing and supporting their large-scale compute and storage platform. This platform is designed to solve demanding problems - both business and...