Compute Grid Site Reliability Engineer- AVP
1 week ago
To apply software engineering techniques, automation, and best practices in incident response, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them.
Accountabilities
- Availability, performance, and scalability of systems and services through proactive monitoring, maintenance, and capacity planning.
- Resolution, analysis and response to system outages and disruptions, and implement measures to prevent similar incidents from recurring.
- Development of tools and scripts to automate operational processes, reducing manual workload, increasing efficiency, and improving system resilience.
- Monitoring and optimisation of system performance and resource usage, identify and address bottlenecks, and implement best practices for performance tuning.
- Collaboration with development teams to integrate best practices for reliability, scalability, and performance into the software development lifecycle, and work closely with other teams to ensure smooth and efficient operations.
- Stay informed of industry technology trends and innovations, and actively contribute to the organization's technology communities to foster a culture of technical excellence and growth.
- To advise and influence decision making, contribute to policy development and take responsibility for operational effectiveness. Collaborate closely with other functions/ business divisions.
- Lead a team performing complex tasks, using well developed professional knowledge and skills to deliver on work that impacts the whole business function. Set objectives and coach employees in pursuit of those objectives, appraisal of performance relative to objectives and determination of reward outcomes
- If the position has leadership responsibilities, People Leaders are expected to demonstrate a clear set of leadership behaviours to create an environment for colleagues to thrive and deliver to a consistently excellent standard. The four LEAD behaviours are: L - Listen and be authentic, E - Energise and inspire, A - Align across the enterprise, D - Develop others.
- OR for an individual contributor, they will lead collaborative assignments and guide team members through structured assignments, identify the need for the inclusion of other areas of specialisation to complete assignments. They will identify new directions for assignments and/ or projects, identifying a combination of cross functional methodologies or practices to meet required outcomes.
- Consult on complex issues; providing advice to People Leaders to support the resolution of escalated issues.
- Identify ways to mitigate risk and developing new policies/procedures in support of the control and governance agenda.
- Take ownership for managing risk and strengthening controls in relation to the work done.
- Perform work that is closely related to that of other areas, which requires understanding of how areas coordinate and contribute to the achievement of the objectives of the organisation sub-function.
- Collaborate with other areas of work, for business aligned support areas to keep up to speed with business activity and the business strategy.
- Engage in complex analysis of data from multiple sources of information, internal and external sources such as procedures and practises (in other areas, teams, companies, solve problems creatively and effectively.
- Communicate complex information. 'Complex' information could include sensitive information or information that is difficult to communicate because of its content or its audience.
- Influence or convince stakeholders to achieve outcomes.
Join us in the role as Compute Grid Site Reliability Engineer- AVP in Singapore. The Compute Grid team is responsible for building and maintaining the bank's distributed super-computer which runs the bank's compute intensive workloads. The system harnesses CPU capacity sourced from on-prem and public cloud. The team's mission statement is: "To provide a stable platform for the distributed execution of computation tasks at the lowest possible price". In this role, you will work to continuously improve the Compute Grid service, operating within the team's EngOps framework (a mix of SRE & DevOps), taking part in support, operations, engineering, and development work on rotation.
To be successful in the role, you must have
Essential Skills/Basic Qualifications
- Strong verbal and written communication skills.
- Strong technical aptitude and can-do attitude along with good problem-solving skills.
- Experience in Windows/Unix Systems Administration
- PowerShell and Python scripting
- Experience with High Performance Computing software such as IBM Symphony and Tibco/DataSynapse GridServer.
- Experience with Microsoft Azure & AWS.
- Experience using Splunk.
- Experience in DevOps tooling(Git, Chef, Jenkins, Terrafor
-
Compute Grid Site Reliability Engineer
1 week ago
Singapore BARCLAYS EXECUTION SERVICES LIMITED Singapore Branch Full timeJob OverviewBARCLAYS EXECUTION SERVICES LIMITED Singapore Branch is seeking a skilled Compute Grid Site Reliability Engineer to join our team. As a key member of the Compute Grid team, you will be responsible for building and maintaining the bank's distributed super-computer which runs the bank's compute intensive workloads.
-
Compute Grid System Reliability Specialist
2 weeks ago
Singapore BARCLAYS EXECUTION SERVICES LIMITED Singapore Branch Full timeJob DescriptionAs a Compute Grid Site Reliability Engineer - AVP at Barclays Execution Services Limited Singapore Branch, you will play a pivotal role in building and maintaining the bank's distributed super-computer which runs its compute-intensive workloads. This system harnesses CPU capacity sourced from on-prem and public cloud platforms.**Key...
-
Compute Grid Site Reliability Expert
2 weeks ago
Singapore BARCLAYS EXECUTION SERVICES LIMITED Singapore Branch Full timeJob Description:We are seeking a skilled Compute Grid Site Reliability Engineer to join our team in Singapore. As a key member of the Compute Grid team, you will be responsible for building and maintaining the bank's distributed super-computer, which runs compute-intensive workloads. Your mission is to provide a stable platform for distributed execution of...
-
Site Reliability Engineer
2 weeks ago
Singapore BARCLAYS EXECUTION SERVICES LIMITED Singapore Branch Full timeJob DescriptionWe are seeking a highly skilled Site Reliability Engineer - Compute Grid to join our team at Barclays Execution Services Limited Singapore Branch. As a key member of the Compute Grid team, you will play a crucial role in building and maintaining the bank's distributed super-computer which runs the bank's compute intensive workloads.**Company...
-
Compute Grid Site Reliability Engineer- Analyst
2 weeks ago
Singapore BARCLAYS EXECUTION SERVICES LIMITED Singapore Branch Full timeRoles & ResponsibilitiesPurpose of the roleTo apply software engineering techniques, automation, and best practices in incident response, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them.AccountabilitiesAvailability, performance, and scalability of systems and services through proactive...
-
Singapore BARCLAYS EXECUTION SERVICES LIMITED Singapore Branch Full timePurpose of the roleTo apply software engineering techniques, automation, and best practices in incident response, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them.AccountabilitiesAvailability, performance, and scalability of systems and services through proactive monitoring, maintenance, and...
-
Grid Reliability and Performance Consultant
2 weeks ago
Singapore MICHAEL PAGE INTERNATIONAL PTE LTD Full timeAbout the OpportunityWe are seeking a Grid Reliability and Performance Consultant to join our team. As a key member of our team, you will play a vital role in delivering exceptional technical service and support for our products. This involves providing guidance and expertise during new product installations, representing the company at on-site installation...
-
Compute Grid Operations Lead
1 week ago
Singapore BARCLAYS EXECUTION SERVICES LIMITED Singapore Branch Full timeCompany OverviewBARCLAYS EXECUTION SERVICES LIMITED Singapore Branch is a dynamic and innovative financial institution that offers a wide range of services to its clients. We are committed to staying ahead of the curve and embracing new technologies to improve our offerings.Job SummaryWe are seeking a highly motivated and experienced Distributed Systems...
-
Compute Grid Site Operations Expert
2 weeks ago
Singapore BARCLAYS EXECUTION SERVICES LIMITED Singapore Branch Full timeWe are looking for a seasoned professional to fill the role of Compute Grid Site Operations Expert. As a key member of the BARCLAYS EXECUTION SERVICES LIMITED Singapore Branch team, you will play a critical role in ensuring the smooth operation of our distributed super-computer.Responsibilities:Briefly, your main duties will include:Building and maintaining...
-
Cloud Infrastructure SRE Lead
2 weeks ago
Singapore BARCLAYS EXECUTION SERVICES LIMITED Singapore Branch Full timeRole OverviewThe Compute Grid Site Reliability Engineer - AVP plays a critical role in ensuring the reliability, availability, and scalability of the bank's distributed super-computer. This system is used for compute-intensive workloads and is built on a mix of on-prem and public cloud infrastructure. In this role, you will work closely with cross-functional...
-
Reliability and Scalability Specialist
2 weeks ago
Singapore BARCLAYS EXECUTION SERVICES LIMITED Singapore Branch Full timeRole Overview:We are seeking a highly skilled Compute Grid Site Reliability Engineer to join our team in Singapore. The successful candidate will be responsible for building and maintaining the bank's distributed super-computer, which runs compute-intensive workloads. You will work within the team's EngOps framework (a mix of SRE & DevOps) to continuously...
-
Site Reliability Engineer
2 weeks ago
Singapore Sea Limited Full timeEngineering and Technology - Infrastructure, Singapore - Entry Level Our DevOps Engineering team plays an important role in developing and maintaining the internal systems and tools for the Infrastructure team. As a Site Reliability Engineer, you are responsible for improving the availability and reliability of our Infrastructure services. - Responsible for...
-
Singapore BARCLAYS EXECUTION SERVICES LIMITED Singapore Branch Full timeAbout the Role:The Compute Grid team is responsible for building and maintaining the bank's distributed super-computer, which runs compute-intensive workloads. The system harnesses CPU capacity sourced from on-prem and public cloud. We are looking for a talented Compute Grid Site Reliability Engineer to join our team in Singapore.Key Responsibilities:Design...
-
Compute Grid Infrastructure Specialist
2 weeks ago
Singapore BARCLAYS EXECUTION SERVICES LIMITED Singapore Branch Full timeJob SummaryAs a Compute Grid Infrastructure Specialist, you will be responsible for designing, implementing, and managing the bank's distributed super-computer which runs the bank's compute intensive workloads. This is an exciting opportunity to join a dynamic team and contribute to the success of Barclays Execution Services Limited Singapore...
-
Site Reliability Specialist
2 weeks ago
Singapore BARCLAYS EXECUTION SERVICES LIMITED Singapore Branch Full timeWe are seeking a talented individual to fill the role of Site Reliability Specialist - Compute Infrastructure. In this position, you will be responsible for ensuring the reliability, availability, and scalability of the bank's distributed super-computer.About the Role:As a Site Reliability Specialist, you will work closely with the Engineering and...
-
Site Coordinator
3 days ago
Singapore URBAN GRID ENG PTE. LTD. Full timeUrban Grid Eng Pte. Ltd. is seeking a highly organized and proactive individual to join our team as a Site Coordinator. As a Site Coordinator, you will play a crucial role in ensuring the smooth operations and coordination of our site activities. This position requires strong leadership skills, excellent communication abilities, and the ability to multitask...
-
Site Reliability Engineer
7 days ago
Singapore JJ Consulting Services Full timeOur Client is a fast growing company in Singapore, who is seeking to recruit a Site Reliability Engineer. **Site Reliability Engineer** **Key Roles & Responsibilities** - Providing ancillary support of Enterprise-Grade Products and solutions at customer's sites - Ironing out deployment issues or challenges that our customers may face - Responsible for...
-
Senior Site Reliability Engineer
1 week ago
Singapore AKAMAI TECHNOLOGIES APJ PTE. LTD. Full timeAs a Senior Site Reliability Engineer, you will influence a wide array of teams. You will be responsible for the performance and reliability of Akamai’s delivery products by working with the Product, Engineering and Support teams to diagnose, mitigate and solve outages. You will have to solve some of the most complex problems in distributed systems at...
-
Senior Site Reliability Engineer
2 weeks ago
Singapore Sea Limited Full timeEngineering and Technology - Infrastructure, Singapore - Experienced (Individual Contributor) Our DevOps Engineering team plays an important role in developing and maintaining the internal systems and tools for the Infrastructure team. As a Senior Site Reliability Operation Engineer, you are responsible for improving the availability and reliability of our...
-
Site Reliability Engineer
7 days ago
Singapore THALES SOLUTIONS ASIA PTE. LTD. Full timeRoles & ResponsibilitiesDigital Competence Center (DCC)Thales IFE has decided to create a leading technology center in Singapore for its IFE Digital Engineering. It will leverage on unique digital skillset from Singapore and neighbouring countries on Cloud engineering. Thanks to a multi-year strategic plan, Thales is locating at WeWork@Suntec, a center that...