Site Reliability Engineer

5 days ago


Singapore NTT DATA SINGAPORE PTE. LTD. Full time
Roles & Responsibilities

EMAIL ID : Interested candidates may also send their resume via email to mike.ramos@nttdata.com

Only shortlisted candidates would be contacted for interview.


Role: Site Reliability Engineer - 12 months Renewable contract

Experience: Minimum of 5 years

Location : Changi Business Park


Summary:


We are seeking a highly motivated and experienced Site Reliability Engineer (SRE) to join our growing Observability team. The ideal candidate will have a strong background in building and maintaining robust observability environments, including monitoring, logging, and tracing systems. This role will focus on the design, implementation, and support of our observability infrastructure, ensuring the seamless onboarding of applications and providing critical support during incidents.



Responsibilities:

  • Observability Environment Management: Design, build, and maintain our observability infrastructure, including monitoring tools, logging platforms, and distributed tracing systems (e.g., Prometheus, Grafana, Elasticsearch, etc.). This includes capacity planning, performance tuning, and ensuring high availability.
  • Application Onboarding: Work with development teams to onboard applications to our observability platform, providing guidance on instrumentation best practices and ensuring data quality. This includes creating and maintaining documentation and training materials.
  • Incident Support: Provide timely and effective support during incidents, leveraging observability data to diagnose and resolve issues quickly. This includes contributing to post-incident reviews and implementing preventative measures.
  • Automation: Automate repetitive tasks and processes related to observability, improving efficiency and reducing manual effort. This may involve scripting, developing tools, or integrating with CI/CD pipelines.
  • Alerting and Monitoring: Develop and maintain effective alerting strategies, ensuring appropriate escalation procedures and minimizing noise. This includes creating dashboards and reports to visualize system health and performance.



Qualifications:

  • Bachelor’s degree in computer science or a related field, or equivalent experience.
  • 5+ years of experience as an SRE or in a similar role with a focus on observability.
  • Strong understanding of distributed systems and microservices architectures.
  • Experience with any monitoring, logging, and tracing tools (e.g., Prometheus, Grafana, Jaeger, Elasticsearch, Fluentd, Datadog, Dynatrace, etc.).
  • Proficiency in scripting languages such as Python, Go, or Bash.
  • Strong problem-solving and analytical skills.
  • Excellent communication and collaboration skills.



Bonus Points:

  • Experience with cloud platforms.
  • Experience with infrastructure-as-code tools (e.g., Terraform, Ansible)

Tell employers what skills you have

Kubernetes
Analytical Skills
Pipelines
High Availability
Scripting
Data Quality
Environment Management
Microservices
Reliability
Logging
Distributed Systems
Python
Dynatrace
Performance Tuning
Ansible
Instrumentation

  • Singapore HW Search & Selection Ltd Full time

    Site Reliability Engineer A new opportunity has arisen for a Site Reliability Engineer for a prestigious investment management firm in Singapore. You will be responsible for providing production support for the trading infrastructure.Your main responsibilities will include:Linux trading infrastructure supportProviding Level II supportUtilizing Python to...


  • Singapore HW Search & Selection Ltd Full time

    Site Reliability Engineer A new opportunity has arisen for a Site Reliability Engineer for a prestigious investment management firm in Singapore. You will be responsible for providing production support for the trading infrastructure. Your main responsibilities will include: Linux trading infrastructure support Providing Level II support Utilizing Python to...


  • Singapore Qlik Full time

    What makes us Qlik? A Gartner Magic Quadrant Leader for 14 years in a row, Qlik transforms complex data landscapes into actionable insights, driving strategic business outcomes. Serving over 40,000 global customers, our portfolio leverages pervasive data quality and advanced AI/ML capabilities that lead to better decisions, faster. We excel in integration...


  • Singapore Bright Vision Technologies Full time

    Bright Vision Technologies has an immediate Full-time opportunity for Site Reliability Engineer (SRE)Job Role:Site Reliability Engineer (SRE)Job Type: Full TimeCandidates Looking for Visa sponsorship and willing to relocate to USA are encouraged to apply.About Bright Vision Technologies: Bright Vision Technologies is a fast-growing technology company...


  • Singapore EXASOFT PTE. LTD. Full time

    Roles & ResponsibilitiesPOSITION OVERVIEW : Software Development AnalystResponsibilities and Requirements: Sound knowledge of operating Systems (like LINUX). Understanding all stages of software Development. Supporting incident escalation and troubleshooting. Documenting processes and related knowledge. Evaluating incidents after resolution. ...


  • Singapore EXASOFT PTE. LTD. Full time

    Roles & ResponsibilitiesPOSITION OVERVIEW : Software Development AnalystResponsibilities and Requirements: Sound knowledge of operating Systems (like LINUX). Understanding all stages of software Development. Supporting incident escalation and troubleshooting. Documenting processes and related knowledge. Evaluating incidents after resolution. ...


  • Singapore Aptitude Asia Limited Full time

    Our client, a top-tier hedge fund, is looking to hire a talented Site Reliability Engineer to join their growing SRE team in Singapore. Job Responsibilities: Ensure high reliability, availability, and performance of applications throughout their lifecycle. Automate repetitive tasks and systematically address recurring issues. Generate innovative ideas for...


  • Singapore HEXACON CONSTRUCTION PTE LTD Full time

    Job DescriptionAs a key member of the HEXACON CONSTRUCTION PTE LTD team, we are seeking a highly skilled and experienced Site Reliability Engineer to join our facilities operations department.The ideal candidate will have a strong background in maintenance and reliability engineering, with a proven track record of leading and guiding sub-contractors to...


  • Singapore CLIMATE IMPACT X PTE. LTD. Full time

    Roles & ResponsibilitiesWe are seeking a motivated Site Reliability Engineer (SRE) to join our team. The ideal candidate will ensure the reliability, performance, and scalability of CIX’s technology stack while supporting critical infrastructure needs globally. With a diverse client base across multiple jurisdictions, you are also required to cover London...


  • Singapore CLIMATE IMPACT X PTE. LTD. Full time

    Roles & ResponsibilitiesWe are seeking a motivated Site Reliability Engineer (SRE) to join our team. The ideal candidate will ensure the reliability, performance, and scalability of CIX’s technology stack while supporting critical infrastructure needs globally. With a diverse client base across multiple jurisdictions, you are also required to cover London...


  • Singapore Qlik Full time

    What makes us Qlik?A Gartner Magic Quadrant Leader for 14 years in a row, Qlik transforms complex data landscapes into actionable insights, driving strategic business outcomes. Serving over 40,000 global customers, our portfolio leverages pervasive data quality and advanced AI/ML capabilities that lead to better decisions, faster. We excel in integration...


  • Singapore GXS BANK PTE. LTD. Full time

    Roles & ResponsibilitiesJob Description & RequirementsGet to know the Role: As a Site Reliability Engineer (SRE) you will help build a meaningful engineering discipline, combining software and systems to develop creative engineering solutions to operations problems. Much of our support and software development focuses on optimizing existing systems,...


  • Singapore GXS BANK PTE. LTD. Full time

    Roles & ResponsibilitiesJob Description & RequirementsGet to know the Role: As a Site Reliability Engineer (SRE) you will help build a meaningful engineering discipline, combining software and systems to develop creative engineering solutions to operations problems. Much of our support and software development focuses on optimizing existing systems,...

  • Process engineer

    4 weeks ago


    Singapore The Chemical Engineer Full time

    Why Patients Need You Whether you are involved in the design and development of manufacturing processes for products or supporting maintenance and reliability, engineering is vital to making sure customers and patients have the medicines they need, when they need them. Working with our innovative engineering team, you'll help bring medicines to the...


  • Singapore Qlik Full time

    Director of Regional Site Reliability EngineeringQlik is seeking an experienced leader to oversee the development and scaling of our regional Site Reliability Engineering (SRE) organization in APAC. This role will be instrumental in ensuring the availability, scalability, and reliability of our services.About QlikWe are a global company that transforms...


  • Singapore Chemical Engineering Site Full time

    Job Title: MSAT Process Data Scientist Intern Location: EVolutive Facility (EVF) at 5 Tuas South Street 2, Singapore 639328Eligibility: Credit bearing internship with 12 months duration preferably (6 months minimum)Others: Company transport provision at designated MRT Station About the job Sanofi Manufacturing and Supply Organization is preparing its future...


  • Singapore DEUTSCHE BANK AKTIENGESELLSCHAFT Full time

    About the RoleWe are seeking an experienced Site Reliability Engineer to join our team at Deutsche Bank AKTIENGESELLSCHAFT. As a Site Reliability Engineer, you will play a critical role in ensuring the availability, performance, and security of our cloud-based infrastructure.


  • Singapore Luxoft Full time

    Project Description With award-winning mobile banking apps and trading systems, our technology platforms help Bank deliver best-in-class products to clients. Naturally, we make sure that the phones work, emails are delivered and PCs run - but we also develop innovative collaboration platforms and workspaces that help our people share their knowledge, their...


  • Singapore This is an IT support group Full time

    Bright Vision Technologies has an immediate Full-time opportunity for Site Reliability Engineer (SRE). Job Role: Site Reliability Engineer (SRE)Job Type: Full Time Candidates looking for visa sponsorship and willing to relocate to the USA are encouraged to apply. About Bright Vision Technologies: Bright Vision Technologies is a fast-growing technology...


  • Singapore This is an IT support group Full time

    Singapore, Singapore Relocation friendly DevOps BCM Industry 02/12/2024Req. VR-109808Project Description With award-winning mobile banking apps and trading systems, our technology platforms help Bank deliver best-in-class products to clients. Naturally, we make sure that the phones work, emails are delivered and PCs run - but we also develop innovative...