Site Reliability Developer 4
7 days ago
1 week ago Be among the first 25 applicants Job Description Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Job Description Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. As a Principal Engineer within NRE, you will be responsible for ensuring the reliability, scalability, and security of OCI's network infrastructure. You will apply engineering principles to measure and automate the network's reliability, aligning it with Oracle's service-level objectives. This role will involve resolving complex network issues, collaborating across teams, and driving automation efforts that enhance the overall operational efficiency of the OCI network. You'll work with a team dedicated to proactively preventing network disruptions, performing root-cause analysis, and delivering innovative solutions that ensure the smooth operation of a global network environment. Responsibilities Lead Network Reliability Efforts : Develop, automate, and optimize network services that ensure high availability and performance across OCI's global infrastructure. Network Lifecycle Management : Drive key programs to manage and maintain the network lifecycle, defining objectives and coordinating delivery milestones to meet organizational goals. Troubleshoot and Resolve Complex Network Issues : Serve as the technical expert for network events, providing Tier 2 support and leading efforts to quickly restore services. Drive Automation : Develop scripts and automation tools to improve operational efficiency, reduce manual interventions, and support a rapidly evolving network environment. Collaborate Across Teams : Work closely with cross-functional teams—including engineering, product, and vendor partners—to design, implement, and optimize network solutions that meet the needs of both the business and end-users. Mentor and Lead : Provide technical leadership and mentorship to junior engineers, helping them develop their skills and grow within the organization. Innovate and Influence : Contribute to the roadmap for new network technologies, tools, and methodologies that enhance OCI's network performance and reliability. What You'll Need to Succeed: Technical Expertise : Extensive experience in network engineering, with a strong background in protocols like MPLS, BGP, OSPF, IS-IS, TCP/IP, IPv4, IPv6, DNS , and DHCP . Experience with VxLAN , EVPN , and SDN technologies is a plus. Automation Skills : Proficiency in scripting or programming, ideally with Python , to develop solutions that automate network operations and troubleshooting. Deep Understanding of Networking : Strong knowledge of networking protocols, monitoring tools, telemetry solutions, and network modeling techniques (e.g., YANG, OpenConfig, NETCONF ). Experience in Cloud or ISP Environments : Proven track record in large-scale cloud or ISP network environments, ideally supporting complex, multi-cloud infrastructures. Problem-Solving Mindset : Excellent analytical and troubleshooting skills, with a focus on proactive identification and resolution of network issues. Collaboration and Leadership : Ability to work effectively in a fast-paced, cross-functional team environment. Experience leading technical teams or projects is highly desirable. Educational Background : Bachelor's degree in Computer Science, Engineering, or a related field. A Master's degree is preferred. Preferred Experience: Experience with network modeling and automation frameworks for large-scale networks. Familiarity with cloud-native network architectures and modern network management tools. Experience with network monitoring , telemetry systems, and telemetry-based decision-making . What We Offer: Impact at Scale : Work on projects that support millions of users and some of the largest organizations in the world. Global Reach : Collaborate with engineers, leaders, and vendors across the globe to build and operate Oracle Cloud's network. Innovation and Growth : Opportunity to work with cutting-edge technologies and drive innovation in a fast-evolving field. Supportive Culture : A culture of collaboration, continuous learning, and growth, where your contributions matter. Additional Information: This role requires participation in an on-call rotation to provide 24/7 support for critical network events and incidents. You will work in a high-impact, high-visibility role with opportunities for technical leadership and career advancement. Qualifications Career Level - IC4About Us As a world leader in cloud solutions, Oracle uses tomorrow's technology to tackle today's challenges. We've partnered with industry-leaders in almost every sector—and continue to thrive after 40+ years of change by operating with integrity. We know that true innovation starts when everyone is empowered to contribute. That's why we're committed to growing an inclusive workforce that promotes opportunities for all. Oracle careers open the door to global opportunities where work-life balance flourishes. We offer competitive benefits based on parity and consistency and support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs. We're committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing or by calling in the United States. Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans' status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law. Seniority level Seniority level Mid-Senior level Employment type Employment type Full-time Job function Job function Engineering and Information Technology Industries IT Services and IT Consulting Referrals increase your chances of interviewing at Oracle by 2x Get notified about new Site Developer jobs in Singapore . Network Engineer (SD-WAN & Managed Services)Security/Embedded Systems Engineer (TEE)- Remote, Worldwide | Edinburgh, On Site Principal Network Development Engineer - Network Reliability Engineering Linux Cryptography and Security Engineer Smart Contract Security Engineer (Security Audit)Software Engineer (Python/Linux/Packaging)Software Engineer, Data Infrastructure & Acquisition - Asia We're unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI. #J-18808-Ljbffr
-
Site Reliability Developer 4
1 week ago
Singapore Ll Oefentherapie Full timeOverview Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. Design and develop designs, architectures, standards, and methods for large-scale distributed systems....
-
Site Reliability Engineer
2 weeks ago
Singapore Rapsys Technologies Full time**Roles and Responsibilities**: 2. Set up and operate the server infrastructure and software (Linux, Elasticsearch, Logstash, Grafana, Kibana, Kafka, Nginx) based on bank’s security standards and industry’s security standards. 3. Perform continuous improvement for the platform covering areas such as: capacity planning, observability, monitoring,...
-
Site Reliability Developer 3
5 days ago
Singapore Oracle Full timeOverview Join to apply for the Site Reliability Developer 3role at Oracle . Job Description As a Senior Network Reliability Engineer on the OCI Network Availability team, you will play a crucial role in ensuring the high availability and performance of Oracle Cloud's global network infrastructure. This role involves applying engineering methodologies to...
-
Site Reliability Engineer
3 days ago
Singapore Qlik Full time**What makes us Qlik?** A Gartner® Magic Quadrant Leader for 14 years in a row, Qlik transforms complex data landscapes into actionable insights, driving strategic business outcomes. Serving over 40,000 global customers, our portfolio leverages pervasive data quality and advanced AI/ML capabilities that lead to better decisions, faster. We excel in...
-
Site Reliability Engineer
5 days ago
Singapore TRUEWATCH TECHNOLOGY INC PTE. LTD. Full time**Responsibility**: - Run production environment by monitoring availability and taking a holistic view of the system health. - Achieve site reliability automation, minimize system downtime, and reduce site reliability cost. - Manage risks and resolves issues that affect the release scope, schedule and quality. - Suggest architecture improvements, push for...
-
Site Reliability Engineer
6 days ago
Singapore JJ Consulting Services Full timeOur Client is a fast growing company in Singapore, who is seeking to recruit a Site Reliability Engineer. **Site Reliability Engineer** **Key Roles & Responsibilities** - Providing ancillary support of Enterprise-Grade Products and solutions at customer's sites - Ironing out deployment issues or challenges that our customers may face - Responsible for...
-
Site Reliability Engineer
5 days ago
Singapore ASPIRE GLOBAL NETWORK PTE. LTD. Full timeDo you have an interest in Cryptocurrencies and want to join a global company that is doing incredible things with data? Would you like to have the flexibility to work remotely from wherever you want? A global cryptocurrency data company with offices in Singapore are looking for a Site Reliability Engineer to join their growing Singapore presence. The...
-
Site Reliability Engineer
2 weeks ago
Singapore ETEAM WORKFORCE PTE. LTD. Full timePosition: Site Reliability Engineer (SRE) Work Mode - Onsite/Hybrid Timing - 9am to 6 pm Duration – 1 Year (Highly extendable) Salary: 6018 SGD Work Location: Robinson Road, Singapore About the Role We are looking for a seasoned Site Reliability Engineer (SRE) with 5+ years of experience to join our Platform Engineering team. This role is ideal for someone...
-
Site Reliability Engineer
1 week ago
Singapore Point72 Full timeJoin to apply for the Site Reliability Engineer role at Point72 About the role As part of Point72’s Technology Team, you will focus on developing and maintaining complex, distributed, real-time systems that support our Global Macro business. Your responsibilities will include optimizing operations through automation, building foundational SRE components,...
-
Site Reliability Engineer
2 weeks ago
Singapore Crystal Equation Corporation Full timeWe are seeking a skilled Site Reliability Engineer (SRE) to join our team. SRE will be responsible for keeping all internal user-facing applications and other production systems running smoothly. This hybrid role involves a combination of both development and operations skills to build and manage systems that are both efficient and reliable. The Enterprise...