Site Reliability Engineer

1 month ago


Singapur, Singapore CITADEL ENTERPRISE (SINGAPORE) PTE. LIMITED Full time
Roles & Responsibilities

Citadel’s Site Reliability Engineers (SRE) work to bring their practices to the financial trading field by bringing innovation and cutting-edge technology to reduce complexity and improve performance. SREs are responsible for taking applications to production, providing early support for applications in development, and ensuring crisp application function throughout their lifetime. SREs will have a deep understanding of how applications function and be able to change applications for production quality.

Individuals in SRE will work closely with application development teams in PTE on automation and application refactoring. In some instances, the SRE and app dev teams will move to allow maximum cross-pollination.

Depending on the situation, the SRE team may be designing and deploying a new generation of production infrastructure.

This role will primarily involve working with a cohesive team of engineers, developers, and trade support teams to build world-class systems and the necessary tools to maintain and constantly improve them. The position calls for someone willing to innovate, automate, and continuously use measurements and statistics to improve. Team members have backgrounds in various areas, including software development, networking, UNIX internals, and large-scale systems administration.


Responsibilities:

· Manage and provide technical and non-technical guidance and support for the growth of the SREs on the team

and engineers across other teams

· Understanding SRE principles, including monitoring, alerting, error budgets, fault analysis, and other common

reliability engineering concepts, with a keen eye for opportunities to eliminate toil by code and process

improvements.

· Ensure the reliability, availability, and performance of applications

· Own the automation of repetitive tasks and resolution of systematic issues

· Identify and deliver engineering solutions for issues based on root cause analysis

· Own incident management and resolution

· Lead by evangelizing the SRE mindset to other teams

· Provide support and ensure applications are production-ready

· Working across geo-distributed teams.


Qualifications:

· Strong background in computer science fundamentals, data structures, algorithms, distributed systems

· Fully proficient in at least one modern structured programming language (Python or Java)

· Experience in building and leading engineering teams, ideally SRE or Production Engineering

· Comfortable with a range of current software development tools and practices (testing, source control, build

systems, CI/CD, etc.)

· A basic working understanding of TCP/IP networking, LAN, and WAN, as well as Linux internals

· Experience with building and managing highly reliable large-scale systems

· Excellent written and verbal communication skills

· Strong entrepreneurial spirit

· A passion for learning, adapting to changing requirements and technology, and inventing new approaches to

complex problems


Education:

Bachelor’s or Masters in Computer Engineering / Computer Science or an allied field.


Tell employers what skills you have

Data Structures
Application Development
Root Cause Analysis
Unix
Administration
Reliability
Distributed Systems
Reliability Engineering
Networking
Python
Technical Consultation
Java
Linux
Software Development
Incident Management

  • Singapur, Singapore Renesas Electronics Full time

    Job DescriptionOverviewWe are seeking a skilled and experienced Site Reliability Engineer to join our team. In this role, you will be part of the AI & Cloud Engineering (ACE) Division and AI Workbench team. Our AI Workbench is a cloud-based environment to accelerate Automotive AI Software Development and Evaluation. The AI Workbench has 4 main functional...


  • Singapur, Singapore Encora Inc. Full time

    Site Reliability Engineer Location: Singapore Experience: 5 years Job Mode: Full-time  Work Mode: On-site The Site Reliability Engineer/Software Engineer is a contract position responsible software and systems engineering to build and run large-scale, distributed, fault-tolerant systems. As a SRE you will help to ensure that our services are reliable,...


  • Singapur, Singapore Sea Full time

    Our Infrastructure team provides the end-to-end managed services and solutions for the Group's entire Internet infrastructure alongside running business applications. We excel in building the architecture, providing solutions and operations of data centre, connectivity, cloud, networking, system, storage and security. We are a proud provider of high-quality...


  • Singapur, Singapore Sea Full time

    Our Infrastructure team provides the end-to-end managed services and solutions for the Group's entire Internet infrastructure alongside running business applications. We excel in building the architecture, providing solutions and operations of data centre, connectivity, cloud, networking, system, storage and security. We are a proud provider of high-quality...


  • Singapur, Singapore GEMINI Full time

    Department : Platform Our Platform organization’s purpose is to enable Gemini to scale effectively and empower our engineering teams to focus on building innovative financial products and experiences for individuals around the world. Within Platform, the Site Reliability Engineering team is responsible for partnering with Gemini’s other engineering...


  • Singapur, Singapore IHiS Full time

    Position OverviewThe Reliability Lead will support the reliability principal with senior management in strategy discussion for application & system improvement, and will also manage the reliability team. He/She will ensure that the existing site reliability engineering (SREs) initiatives, such as monitoring availability, uplifting capability and automoation...


  • Singapur, Singapore IHiS Full time

    Position OverviewThe Reliability Lead will support the reliability principal with senior management in strategy discussion for application & system improvement, and will also manage the reliability team. He/She will ensure that the existing site reliability engineering (SREs) initiatives, such as monitoring availability, uplifting capability and automoation...


  • Singapur, Singapore Ripple Full time

    At Ripple, we’re building a world where value moves like information does today. It’s big, it’s bold, and we’re already doing it. Through our crypto solutions for financial institutions, businesses, governments and developers, we are improving the global financial system and creating greater economic fairness and opportunity for more people, in more...


  • Singapur, Singapore Shopee Full time

    Senior Site Reliability Engineer (Promotion) - Engineering Infra DepartmentEngineering and TechnologyLevelExperienced (Individual Contributor)LocationSingapore The Engineering and Technology team is at the core of the Shopee platform development. The team is made up of a group of passionate engineers from all over the world, striving to build the best...


  • Singapur, Singapore NTT DATA Full time

    Job Description NTT is a leading global IT solutions and services organisation that brings together people, data and things to create a better and more sustainable future.In today’s ‘iNTTerconnected’ world, connections matter more now than ever. By bringing together talented people, world-class technology partners and emerging innovators, we help our...


  • Singapur, Singapore Sea Full time

    About Sea Labs IndonesiaSea Labs is at the core of the Sea platforms development, supporting diverse business lines from e-commerce, supply chain, games, payment and finance, among many others. The strong growth and unique positioning of Sea's e-commerce business, Shopee, spurred the launch of Sea Labs Indonesia. Since its inception, the group of passionate...

  • Reliability Intern

    2 weeks ago


    Singapur, Singapore Takeda Full time

    DescriptionScope of Internship:The manufacturing site in Woodlands is a crucial hub in Takeda's Global Manufacturing and Supply network, focusing on agility, connectivity, performance, innovation, and people-centric values to enhance patient care. As a Reliability Engineering Intern, you will collaborate with key Takeda stakeholders to fulfill reliability...


  • Singapur, Singapore U3 Full time

    Job Opening: Operation Technician for Fresh Engineering Graduates Location: Tuas Support plant goals and objectives to achieve overall site KPIs. Adhere to safety guidelines, SOPs, policies, and standards. Integrate safety practices across all work areas and contribute safety suggestions and enhancements. Promptly report any unsafe activities or...


  • Singapur, Singapore IO TECH SOLUTIONS LIMITED Full time

    We are looking for a skilled Site Reliability Engineer to join our client's global SRE Team in Singapore. Responsibilities: Overseeing and ensuring the continuous operation of the firm's Linux based trading infrastructure, addressing day to day operational needs Providing second level support, including:Rapid response to emergenciesImplementing scheduled...


  • Singapur, Singapore TikTok Full time

    About the team Our Compute Platform SRE team supports all Big Data services and products across the company. We are a newly established team and waiting for talents like you to shape the team's future together. We are responsible for the reliability of all the company's major data warehouse products, services, and query engines. We serve business needs...

  • Reliability Engineer

    4 weeks ago


    Singapur, Singapore Pfizer Full time

    Pfizer Singapore is recruiting permanent employees for manufacturing site expansion of Pfizer Asia Manufacturing Pte Ltd (PAMPL) in Singapore Why Patients Need You Whether you are involved in the design and development of manufacturing processes for products or supporting maintenance and reliability, engineering is vital to making sure customers and...


  • Singapur, Singapore Shopee Full time

    Machine Reliability Engineer - Engineering Infra (Campus Recruitment 2024) DepartmentEngineering and TechnologyLevelEntry LevelLocationSingapore The Engineering and Technology team is at the core of the Shopee platform development. The team is made up of a group of passionate engineers from all over the world, striving to build the best systems with the most...


  • Singapur, Singapore NTT Full time

    JOB DESCRIPTION NTT is a leading global IT solutions and services organisation that brings together people, data and things to create a better and more sustainable future. In today’s ‘iNTTerconnected’ world, connections matter more now than ever. By bringing together talented people, world-class technology partners and emerging innovators, we help...

  • Reliability Engineer

    4 weeks ago


    Singapur, Singapore John Crane Full time

    About Us Founded in 1917, John Crane is a global leader in the design, manufacturing, and engineering of mission critical flow control solutions for increased efficiency, emission reductions and energy transformation. Our products include mechanical seals and systems, couplings, bearings, filtration systems and predictive digital monitoring technologies....


  • Singapur, Singapore Flowserve Full time

    Flowserve is a world-leading manufacturer and aftermarket service provider of comprehensive flow control systems. Driven by our Purpose, we are committed to building a more sustainable future to make the world better for everyone. With more than 16,000 employees in more than 50 countries, we combine our global reach with local presence. We support more than...