Site Reliability Engineer
2 days ago
Before you apply to a job, select your language preference from the options available at the top right of this page.
Explore your next opportunity at a Fortune Global 500 organization. Envision innovative possibilities, experience our rewarding culture, and work with talented teams that help you become better every day. We know what it takes to lead UPS into tomorrow—people with a unique combination of skill + passion. If you have the qualities and drive to lead yourself or teams, there are roles ready to cultivate your skills and take you to the next level.
Job Description:
Job Summary:
We are seeking a skilled and proactive Site Reliability Engineer (SRE) with 5–8 years of experience and deep expertise in Google Cloud Platform (GCP). The ideal candidate will be responsible for the reliability, availability, and performance of cloud-based applications and infrastructure. You will collaborate with development, operations, and security teams to build and maintain scalable, secure, and highly available systems.
Key Responsibilities:
- Design, develop, and maintain reliable, scalable, and highly available systems on GCP.
- Build and manage CI/CD pipelines, infrastructure as code (IaC), and monitoring solutions.
- Proactively monitor and manage system performance, uptime, and capacity using observability tools.
- Troubleshoot and resolve infrastructure and application-level issues in real-time.
- Implement and maintain disaster recovery, failover mechanisms, and backup strategies.
- Automate repetitive tasks and processes to improve efficiency and reduce toil.
- Participate in on-call rotations, incident management, and root cause analysis (RCA).
- Ensure compliance with security standards, privacy regulations, and governance policies.
- Collaborate with cross-functional teams to support DevOps and SRE best practices.
- Drive improvements in SLAs, SLOs, and error budgets through data-driven insights.
Required Qualifications:
- 5–8 years of relevant experience as an SRE, DevOps Engineer, or Cloud Infrastructure Engineer.
- Strong hands-on experience with Google Cloud Platform (GCP) – Compute Engine, GKE, Cloud Functions, Cloud Storage, IAM, BigQuery, etc.
- Proficiency in Infrastructure as Code tools like Terraform, Deployment Manager, or CloudFormation.
- Experience with Kubernetes, Docker, and container orchestration.
- Proficiency in scripting languages like Python, Shell, or Go.
- Deep understanding of monitoring and logging tools such as Prometheus, Grafana, Stackdriver, or Datadog.
- Knowledge of CI/CD tools such as Jenkins, GitLab CI, or Cloud Build.
- Experience with incident response, postmortem analysis, and site reliability principles.
- Strong problem-solving and communication skills.
Preferred Qualifications:
- GCP certifications (e.g., Professional Cloud DevOps Engineer, Cloud Architect).
- Exposure to multi-cloud environments or hybrid cloud infrastructure.
- Familiarity with Agile and ITIL frameworks.
- Experience working in regulated environments with compliance standards (e.g., ISO, SOC2).
Employee Type:
Permanent
UPS is committed to providing a workplace free of discrimination, harassment, and retaliation.
-
Site Reliability Engineer
3 weeks ago
Singapore IDEMIA Full timeJoin to apply for the Site Reliability Engineer role at IDEMIA Join to apply for the Site Reliability Engineer role at IDEMIA Get AI-powered advice on this job and more exclusive features. PurposeThis role plays a critical part in ensuring reliability, scalability, and performance of our systems and services. You will work closely with development and...
-
Site Reliability Engineer
2 weeks ago
Singapore DHATCH CONSULTANCY PTE. LTD. Full timeSite Reliability Engineer: **Preferred Qualifications** - 3+ years of experience in site reliability engineering, DevOps, or software engineering roles. - Proven skills in: - Monitoring & alerting tools (Grafana, New Relic) - CI/CD pipelines (Git, Jenkins, GitHub Actions, etc.) - Container orchestration (Docker, Kubernetes) - Infrastructure-as-code...
-
Site Reliability Engineer
2 days ago
Singapore TRUEWATCH TECHNOLOGY INC PTE. LTD. Full time**Responsibility**: - Run production environment by monitoring availability and taking a holistic view of the system health. - Achieve site reliability automation, minimize system downtime, and reduce site reliability cost. - Manage risks and resolves issues that affect the release scope, schedule and quality. - Suggest architecture improvements, push for...
-
Site Reliability Engineer
4 days ago
Singapore JJ Consulting Services Full timeOur Client is a fast growing company in Singapore, who is seeking to recruit a Site Reliability Engineer. **Site Reliability Engineer** **Key Roles & Responsibilities** - Providing ancillary support of Enterprise-Grade Products and solutions at customer's sites - Ironing out deployment issues or challenges that our customers may face - Responsible for...
-
Site Reliability Engineer
5 days ago
Singapore Beijing Foreign Enterprise Management Consultants Co.,Ltd. Full timeDirect message the job poster from Beijing Foreign Enterprise Management Consultants Co.,Ltd. On behalf of Huawei, a world-renowned information and communication technology company, we are seeking passionate and talented individuals to join our team as Site Reliability Engineer Overview On behalf of Huawei, a world-renowned information and communication...
-
Site Reliability Engineer Assistant
3 weeks ago
Singapore Manpower Singapore Full timeSite Reliability Engineer Assistant (DevOps) Site Reliability Engineer Assistant (DevOps) This range is provided by Manpower Singapore. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Base pay range Responsible for the operation and maintenance of online game marketing services, to ensure the continuous...
-
Site Reliability Engineer
4 weeks ago
Singapore Shopify Full timeSite Reliability Engineer (EMEA, Japan, Singapore, Australia) Join to apply for the Site Reliability Engineer (EMEA, Japan, Singapore, Australia) role at Shopify . Overview We are not here to play zero-sum games. Shopify Engineering is focused on building the best product for our Merchants. You will enable entrepreneurship and create new value for the...
-
Site Reliability Engineer
2 weeks ago
Singapore The Edge Asia Full timeOur client is a US hedge fund and their Technology group is constantly improving the company’s IT infrastructure, positioning them at the forefront of a rapidly evolving technology landscape. They are a team of experts experimenting, discovering new ways to harness the power of open-source solutions, and embracing enterprise agile methodology. Their...
-
Site Reliability Engineer
2 days ago
Singapore People Profilers Full timeJob Description: **Responsibilities**: - Support services before they go live through activities such as system design consulting and launch reviews. - Develop and maintain tools, re-designing capacity planning infrastructure for greater scalability. - Troubleshooting, diagnosing and fixing software issues. - Suggesting architecture improvements, pushing...
-
Site Reliability Engineer
2 weeks ago
Singapore ABAXX SINGAPORE PTE. LTD. Full timeSite Reliability Engineer - Networking We are seeking competent candidate joining our Infrastructure Team for the mission building and operating MAS regulated marketplace and clearing house. This role is ideal for someone with a strong foundation in AWS services, infrastructure as code, and cloud security, who is passionate about building scalable, secure,...