Site Reliability Developer 3

2 weeks ago


Singapore Oracle Full time

Overview Join to apply for the Site Reliability Developer 3role at Oracle . Job Description As a Senior Network Reliability Engineer on the OCI Network Availability team, you will play a crucial role in ensuring the high availability and performance of Oracle Cloud's global network infrastructure. This role involves applying engineering methodologies to measure, monitor, and automate the reliability of OCI's network, supporting millions of users across a vast, distributed environment. You will be part of a fast-paced, innovative team responsible for swiftly responding to network disruptions, identifying root causes, and collaborating with internal and external stakeholders to restore services. Your work will also focus on automating daily operations, improving workflow efficiency, and optimizing network performance. With OCI's expansive global footprint, you will manage hundreds of thousands of network devices across a mix of dedicated backbone infrastructure, CLoS networks, and the internet. Responsibilities Support and Operate OCI's Global Network: Design, deploy, and manage large-scale network solutions that power Oracle Cloud Infrastructure (OCI), ensuring reliability and performance at a global scale. Collaborate and Drive Change: Use best practices and tools to develop and execute network changes safely. Work closely with cross-functional teams to continuously improve network performance. Incident Response and Troubleshooting: Lead break-fix support for network events, provide escalation for complex issues, and perform post-event root cause analysis to prevent future disruptions. Automation and Efficiency: Create and maintain scripts to automate routine network tasks, working with business units and teams to streamline operations and increase productivity. Mentorship and Knowledge Sharing: Guide and mentor junior engineers, fostering a culture of collaboration, continuous learning, and technical excellence. Network Monitoring and Performance Analysis: Collaborate with network monitoring teams to gather telemetry data, build dashboards, and set up alert rules to track network health and performance. Vendor Collaboration: Work with network vendors and technical



  • Singapore Oracle Full time $120,000 - $180,000 per year

    DescriptionSolve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence.  As a Senior Network Reliability Engineer on the OCI Network Availability team, you will play a crucial role in ensuring the high availability and performance of Oracle Cloud's global network infrastructure. This role involves...


  • Singapore Oracle Full time $120,000 - $180,000 per year

    Description As a Senior Network Reliability Engineer on the OCI Network Availability team, you will play a crucial role in ensuring the high availability and performance of Oracle Cloud's global network infrastructure. This role involves applying engineering methodologies to measure, monitor, and automate the reliability of OCI's network, supporting millions...


  • Singapore Oracle Full time $120,000 - $180,000 per year

    DescriptionSolve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. As a Senior Network Reliability Engineer on the OCI Network Availability team, you will play a crucial role in ensuring the high availability and performance of Oracle Cloud's global network infrastructure. This role involves...


  • Singapore Ll Oefentherapie Full time

    As a Senior Network Reliability Engineer on the OCI Network Availability team, you will play a crucial role in ensuring the high availability and performance of Oracle Cloud's global network infrastructure. This role involves applying engineering methodologies to measure, monitor, and automate the reliability of OCI's network, supporting millions of users...


  • Singapore M2R System Technology Pte. Ltd. Full time

    **Responsibilities**: - Run production environment by monitoring availability and taking a holistic view of the system health - Achieve site reliability automation, minimize system downtime, and reduce site reliability cost - Manage risks and resolves issues that affect the release scope, schedule and quality - Suggest architecture improvements, push for...


  • Singapore Oracle Full time $120,000 - $180,000 per year

    Description At Oracle Cloud Infrastructure (OCI), we're building the future of cloud technology for enterprises. As a team of innovative, diverse creators and engineers, we operate with the agility of a startup, but the scale and customer-first mindset of the leading enterprise software company in the world. We thrive on equity, inclusion, and respect for...


  • Singapore Rapsys Technologies Full time

    **Experience**: 4+ Years **Location**: Changi, Singapore **Roles and Responsibilities**: 2. Set up and operate the server infrastructure and software (Linux, Elasticsearch, Logstash, Grafana, Kibana, Kafka, Nginx) based on bank’s security standards and industry’s security standards. 3. Perform continuous improvement for the platform covering areas...


  • Singapore Pan Asia Group Resources Full time

    **Key Responsibilities**: - Drive Site Reliability Engineering agenda to improve availability, reliability, and performance of services - Drive optimise-operate initiative, example, reduction of operation toil - Work with enterprise team in deploying SRE enablers/initiatives. - Strong background in machine learning and deep learning algorithms. -...


  • Singapore Retentia technology private limited Full time

    **3+ years of experience in Site Reliability Engineering, DevOps**, or a related field. - **Strong knowledge of cloud platforms (AWS, GCP, Azure) and containerization technologies (Docker, Kubernetes).** - Experience with automation and configuration management tools (e.g., T**erraform, Ansible, Chef, or Puppet).** - Proficiency in at least **one programming...


  • Singapore eTeam Full time

    Description Site Reliability Engineer (SRE) We are looking for a seasoned Site Reliability Engineer (SRE) with 5–10 years of experience to join our Platform Engineering team. This role is ideal for someone who thrives in a fast‑paced environment, is passionate about reliability, and enjoys solving complex challenges. You will play a key role in building...