
Principal Site Reliability Engineer
3 days ago
Job Description
At Oracle Cloud Infrastructure (OCI), we build the more intelligent future of cloud. OCI Sovereign Cloud is a team of smart, motivated, and diverse people that are focused on bringing the world's most important work to OCI. We build and operate our government, classified, and sovereign cloud regions to be reliable and high performance, just like our public cloud. Our customers and their mission are the center of what we do. We strive to improve our knowledge of the challenges our customers face which we use to enhance our cloud capabilities and work together to deliver their mission.
Job Description
At Oracle Cloud Infrastructure (OCI), we build the more intelligent future of cloud. OCI Sovereign Cloud is a team of smart, motivated, and diverse people that are focused on bringing the world's most important work to OCI. We build and operate our government, classified, and sovereign cloud regions to be reliable and high performance, just like our public cloud. Our customers and their mission are the center of what we do. We strive to improve our knowledge of the challenges our customers face which we use to enhance our cloud capabilities and work together to deliver their mission.
As a Site Reliability Engineer, you will be responsible for the operation of production environments, including systems and databases, supporting critical business operations for Singapore's governmental sovereign cloud environment. You will be focused on automation and optimization of operations for multiple production environments. You will recommend new and novel solutions to improve availability, performance, and supportability. This is an opportunity to bring a combination of deep technical knowledge with administration/analysis knowledge of Oracle's Cloud Infrastructure to provide escalation support to a wide range of complex production environment problems related to immense growth, scaling, leveraging the cloud, extremely high performance, and high availability requirements. As a Site Reliability Engineering, you will also guide junior engineers to solve complex problems, take part in large-scale incident bridges and help to build and optimize processes and procedures.
Career Level - IC4
Responsibilities
RESPONSIBILITIES
- Development of automation and optimization's focused on operational excellence.
- Deep dive, root cause and solve for systemic issues.
- Enhance Operations quality outcomes through scalable automations.
- Install, monitor, maintain, support, and optimize all production server hardware and software.
- Provide escalated technical support for complex technical issues which may include leading problem management cases and providing management status.
- Coordinate escalated support cases and lead appropriate internal technical resources and/or third-party vendors to resolution and coordinate a storage infrastructure of Oracle system and database appliances.
- Responsible for Oracle production environments; assist with server operating system and application upgrades, bug fixes, and patching; and work on standardization projects for both hardware and software under the Oracle technology stack while providing consistent system uptime as expected in a Cloud environment.
- Lead communications with key partners in solving complex technical problems.
- Provide technical guidance and leadership to junior members to enable them to grow in their careers.
- This team will provide support and administration on a 24/7 basis and will require rotation across day and night shifts.
- This role is open to Singaporeans and PRs only.
- This role will involve the successful applicant working on government projects which may require security clearance being obtained and maintained as a condition of employment. Candidates applying for this role must be willing to provide necessary personal details for the application and maintenance of necessary security clearance.
- Experience with Linux System Administration, Networking, Storage, Compute, and Virtualization
- An understanding and experience working with technologies such as Kubernetes, Terraform, Ansible, Chef and Puppet.
- Experience participating in or running incident bridges of significant scale
- Customer focus, with a passion for delighting customers
- Experience in SRE, cloud technical support, cloud operations, NOC or similar
- Demonstrate ability to quickly learn new technical disciplines and then train others
When everyone's voice is heard, we're inspired to go beyond what's been done before. It's why we're committed to expanding our inclusive workforce that promotes diverse insights and perspectives.
We've partnered with industry-leaders in almost every sector—and continue to thrive after 40+ years of change by operating with integrity.
Oracle careers open the door to global opportunities where work-life balance flourishes. We offer a highly competitive suite of employee benefits designed on the principles of parity and consistency. We put our people first with flexible medical, life insurance and retirement options. We also encourage employees to give back to their communities through our volunteer programs.
About Us
As a world leader in cloud solutions, Oracle uses tomorrow's technology to tackle today's challenges. We've partnered with industry-leaders in almost every sector—and continue to thrive after 40+ years of change by operating with integrity.
We know that true innovation starts when everyone is empowered to contribute. That's why we're committed to growing an inclusive workforce that promotes opportunities for all.
Oracle careers open the door to global opportunities where work-life balance flourishes. We offer competitive benefits based on parity and consistency and support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.
We're committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing or by calling in the United States.
Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans' status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.
Seniority level
Seniority level
Mid-Senior level
Employment type
Employment type
Full-time
Job function
Job function
Engineering and Information TechnologyIndustries
IT Services and IT Consulting
Referrals increase your chances of interviewing at Oracle by 2x
Get notified about new Site Reliability Engineer jobs in Singapore .
Site Reliability Engineer (EMEA, Japan, Singapore, Australia)
Python and Kubernetes Software Engineer - Data, AI/ML & Analytics
Software Engineer, Frontend (International Exchange)
Software Engineer - Solutions Engineering
Python and Kubernetes Software Engineer - Data, Workflows, AI/ML & Analytics
We're unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
#J-18808-Ljbffr-
Principal Site Reliability Engineer
2 weeks ago
Singapore GXS BANK PTE. LTD. Full timeWe are living in exciting times. Technology is reshaping how we live and we want to redefine how financial services are offered, which is why Singtel and Grab are coming together. Singtel is Asia’s leading communications group connecting millions of consumers and enterprises to essential digital services while Grab is the leading technology company in...
-
Senior Principal Reliability Engineer
2 days ago
Singapore NXP Semiconductors Full timeSenior Principal Reliability Engineer page is loaded## Senior Principal Reliability Engineerlocations: Singaporetime type: Full timeposted on: Posted Todayjob requisition id: R- We are looking for Reliability Engineer role in preparation for the formation of the joint venture of NXP and VIS, known as VSMC.**Job Description**This posting is for a Senior...
-
Site Reliability Engineer
1 week ago
Singapore Sea Limited Full timeEngineering and Technology - Infrastructure, Singapore - Entry Level Our DevOps Engineering team plays an important role in developing and maintaining the internal systems and tools for the Infrastructure team. As a Site Reliability Engineer, you are responsible for improving the availability and reliability of our Infrastructure services. - Responsible for...
-
Site Reliability Engineer
1 week ago
Singapore Hyphen Connect Full timeSite Reliability Engineer (Crypto Trading) Join to apply for the Site Reliability Engineer (Crypto Trading) role at Hyphen Connect Site Reliability Engineer (Crypto Trading) 2 days ago Be among the first 25 applicants Join to apply for the Site Reliability Engineer (Crypto Trading) role at Hyphen Connect We are hiring for one of our ecosystem projects in...
-
Site Reliability Engineer
4 weeks ago
Singapore Vega Solutions Full timeJoin to apply for the Site Reliability Engineer role at Vega Solutions Join to apply for the Site Reliability Engineer role at Vega Solutions Get AI-powered advice on this job and more exclusive features. Tokka Labs | Singapore | Full-TimeTokka Labs is a proprietary trading firm with a focus on close collaboration, rigorous research, and cutting-edge...
-
Site Reliability Engineer
2 weeks ago
Singapore DHATCH CONSULTANCY PTE. LTD. Full timeSite Reliability Engineer: **Preferred Qualifications** - 3+ years of experience in site reliability engineering, DevOps, or software engineering roles. - Proven skills in: - Monitoring & alerting tools (Grafana, New Relic) - CI/CD pipelines (Git, Jenkins, GitHub Actions, etc.) - Container orchestration (Docker, Kubernetes) - Infrastructure-as-code...
-
Site Reliability Engineer
24 hours ago
Singapore TRUEWATCH TECHNOLOGY INC PTE. LTD. Full time**Responsibility**: - Run production environment by monitoring availability and taking a holistic view of the system health. - Achieve site reliability automation, minimize system downtime, and reduce site reliability cost. - Manage risks and resolves issues that affect the release scope, schedule and quality. - Suggest architecture improvements, push for...
-
Site Reliability Engineer
3 days ago
Singapore TEAMLEASE DIGITAL CONSULTING PTE. LTD. Full timeAs a Site Reliability Engineer, you will be filling a mission-critical role ensuring that our systems are healthy, monitored, automated, fault-tolerant and designed to scale. You will collaborate and work closely with engineering teams to continually improve our production services, facilitating fast delivery of new products, and reducing downtime. Key...
-
Site Reliability Engineer
1 week ago
Singapore HCLTech Full timeGet AI-powered advice on this job and more exclusive features. This role combines software and systems engineering to build run, and maintain high performant, distributed, fault tolerant and resilient financial systems. Site Reliability Engineers focus on ensuring a joyful customer journey. As a Site Reliability Engineer you will be filling a...
-
Site Reliability Engineer
1 week ago
Singapore Vega Solutions Full timeJoin to apply for the Site Reliability Engineer role at Vega SolutionsJoin to apply for the Site Reliability Engineer role at Vega SolutionsGet AI-powered advice on this job and more exclusive features.Tokka Labs | Singapore | Full-TimeTokka Labs is a proprietary trading firm with a focus on close collaboration, rigorous research, and cutting-edge...