Site Reliability Engineer

5 days ago


Singapore TP-LINK CORPORATION PTE. LTD. Full time

Responsibilities:
Serve as technical SME for implementing and operating Microservices on Kubernetes cloud-based platforms.
Collaborate with the Cloud Technical Development and DevOps teams to deploy services to the Multi-Cloud Platform.
Performing Load Tests and Chaos Tests to ensure the scalability and reliability of microservices.
Build Observability for Microservices and cloud platforms like AWS, OCI, Azure, and GCP.
Write and Execute the Disaster recovery plans in collaboration with the Development and DevOps team.
Analyze and resolve production risks caused by insufficient resources, such as node groups, CPU, memory, HPA scheduling, JVM pre-warming, etc.
Write and maintain scripts for automation using languages like Python, Go, or Bash.
Define and maintain the KPIs (SLA/SLO/SLI) for all cloud microservices with development teams to better understand the business.
Create and maintain technical documentation, including architecture diagrams, design documents, and standard operating procedures.
Guarantee adherence to security and compliance standards, including ISO27001, SOC2, and GDPR.
Lead incident response efforts to troubleshoot and resolve production issues quickly.
Perform post-incident analysis to identify root causes and potential workarounds/solutions.
Assist with product/technology selection, including implementation of POCs
Be fluid and open to change and evolving processes and tools
Help to mentor and train less senior members of the team
Ability to be part of On-call rotation and provide support after work hours and on weekends.
Other duties as assigned
Requirements:
Bachelor's degree in Computer Science, Information Technology, or a related field.
1+ year of experience as a Site Reliability Engineer.
Proficiency in programming and scripting languages like Java, Python, Bash, or PowerShell.
Hands-on experience in SRE, DevOps, cloud operations, and cloud security best practices.
Strong knowledge of security technologies, including Identity and access management, Network security, Application security, and Data protection.
Strong problem-solving and analytical skills, with the ability to work independently and as part of a team.
Experience in developing and maintaining technical documentation and implementing compliance requirements
Additional Skills (Preferred):
Expert-level cloud certifications include AWS Solutions Architect, Professional, Azure Solutions
Architect Expert, and GCP Professional Cloud Architect.
Experience with container orchestration technologies (e.g., Kubernetes).
#J-18808-Ljbffr



  • Singapore IDEMIA Full time

    Join to apply for the Site Reliability Engineer role at IDEMIA Join to apply for the Site Reliability Engineer role at IDEMIA Get AI-powered advice on this job and more exclusive features. PurposeThis role plays a critical part in ensuring reliability, scalability, and performance of our systems and services. You will work closely with development and...


  • Singapore IDEMIA Full time

    Join to apply for the Site Reliability Engineer role at IDEMIA Join to apply for the Site Reliability Engineer role at IDEMIA Get AI-powered advice on this job and more exclusive features. PurposeThis role plays a critical part in ensuring reliability, scalability, and performance of our systems and services. You will work closely with development and...


  • Singapore IDEMIA Full time

    Join to apply for the Site Reliability Engineer role at IDEMIA Join to apply for the Site Reliability Engineer role at IDEMIA Get AI-powered advice on this job and more exclusive features. Purpose This role plays a critical part in ensuring reliability, scalability, and performance of our systems and services. You will work closely with development and...


  • Singapore beBeeSiteReliability Full time $90,000 - $120,000

    Unlock Your Full Potential in Site Reliability EngineeringAbout the RoleThis is an exciting opportunity to work with a global banking institution, leveraging your skills in production management and site reliability engineering to drive business growth.Develop and implement proactive, predictive models for shift production management using SRE...


  • Singapore beBeeSiteReliability Full time

    Unlock Your Full Potential in Site Reliability Engineering About the Role This is an exciting opportunity to work with a global banking institution, leveraging your skills in production management and site reliability engineering to drive business growth. Develop and implement proactive, predictive models for shift production management using SRE...


  • Singapore Hyphen Connect Full time

    Site Reliability Engineer (Crypto Trading) Join to apply for the Site Reliability Engineer (Crypto Trading) role at Hyphen Connect Site Reliability Engineer (Crypto Trading) 2 days ago Be among the first 25 applicants Join to apply for the Site Reliability Engineer (Crypto Trading) role at Hyphen Connect We are hiring for one of our ecosystem projects in...


  • Singapore DHATCH CONSULTANCY PTE. LTD. Full time

    Site Reliability Engineer: **Preferred Qualifications** - 3+ years of experience in site reliability engineering, DevOps, or software engineering roles. - Proven skills in: - Monitoring & alerting tools (Grafana, New Relic) - CI/CD pipelines (Git, Jenkins, GitHub Actions, etc.) - Container orchestration (Docker, Kubernetes) - Infrastructure-as-code...


  • Singapore Hyphen Connect Full time

    Site Reliability Engineer (Crypto Trading) Join to apply for the Site Reliability Engineer (Crypto Trading) role at Hyphen Connect Site Reliability Engineer (Crypto Trading) 2 days ago Be among the first 25 applicants Join to apply for the Site Reliability Engineer (Crypto Trading) role at Hyphen Connect We are hiring for one of our ecosystem...


  • Singapore HCLTech Full time

    Get AI-powered advice on this job and more exclusive features. This role combines software and systems engineering to build run, and maintain high performant, distributed, fault tolerant and resilient financial systems. Site Reliability Engineers focus on ensuring a joyful customer journey. As a Site Reliability Engineer you will be filling a...


  • Singapore HCLTech Full time

    Get AI-powered advice on this job and more exclusive features. This role combines software and systems engineering to build run, and maintain high performant, distributed, fault tolerant and resilient financial systems. Site Reliability Engineers focus on ensuring a joyful customer journey. As a Site Reliability Engineer you will be filling a...