Site Reliability Engineer
1 week ago
About the Role
We are looking for a skilled and experienced DevOps Engineer / Site Reliability
Engineer (SRE) to ensure the high availability, stability, and performance of our
business platform. This role will be responsible for designing and implementing scalable
and maintainable DevOps architecture and automation systems to enhance
operational efficiency. As a senior member, you will lead efforts in optimizing our
operational standards, managing risk assessments, and fostering collaboration with
our China-based operations team. If you are passionate about high-performance
systems, security, and automation, we welcome you to join our team.
Key Responsibilities
- Ensure high availability, stability, and performance of business platforms, developing optimization strategies and refining operational standards and procedures.
- Lead the design and implementation of scalable, maintainable DevOps architecture and automation systems to streamline and enhance operational processes.
- Oversee security risk assessments, and lead the creation and implementation of security strategies to maintain system security.
- Evaluate and review the system architecture, process logic, performance, and stability, working closely with SRE and developer teams in China to address challenges effectively.
- Act as the primary incident commander for production environment issues, leading team efforts in troubleshooting and resolution, and ensuring timely response and resolution.
- Stay updated on the latest trends in technology advancements, organizing team learning sessions to foster continuous improvement.
Desired Qualifications
- Bachelor's degree in Computer Science or a related field, with at least 4 years of experience in internet system operations or SRE roles.
- In-depth understanding of internet technology architecture, including expertise in microservices, Kubernetes, Docker, monitoring and alerting systems, CI/CD, logging systems, distributed caching, and database systems.
- Extensive experience in distributed systems and high-concurrency operations, with strong skills in fault diagnosis and system optimization.
- Proficient in cloud platform operations (e.g., AWS, Azure), with knowledge of MySQL, PostgreSQL, Redis, and familiarity with big data technologies and hybrid cloud architectures preferred.
- Skilled in at least one programming language such as Python, Go, or Java, with relevant development experience.
- Strong organizational and coordination skills, with the ability to guide team members in solving complex issues.
- Fluent in Mandarin to facilitate effective communication within a multilingual team environment.
Why Join Us
- Work with innovative DevOps and cloud technologies to drive impactful solutions.
- Be part of a collaborative, growth-oriented environment that emphasizes continuous learning.
- Engage in diverse DevOps areas, including system automation, security, and performance tuning, for a comprehensive experience.
Troubleshooting
Kubernetes
Azure
Big Data
Technology Architecture
High Availability
MySQL
Reliability
Logging
Multilingual
Distributed Systems
Python
Performance Tuning
Docker
Java
System Architecture
-
Site Reliability Engineer
7 days ago
Singapore This is an IT support group Full timeAs a key member of our team, you will play a crucial role in ensuring the reliability and efficiency of our plant's utilities and facilities.About the RoleThe Site Reliability Engineer (SRE) ensures that all utilities and facilities within the plant are functioning optimally. This includes managing utilities such as water, electricity, HVAC, compressed air,...
-
Site Reliability Engineer Leader
13 hours ago
Singapore OCBC Full timeJob Description:We are seeking a Site Reliability Engineer Leader to join our team at OCBC. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our infrastructure. This role requires strong expertise in automating releases, continuous integration/delivery systems, and relevant infrastructure...
-
Site Reliability Engineer
2 weeks ago
Singapore COMBUILDER PTE LTD Full timeRoles & ResponsibilitiesWe are seeking talented and driven professionals to join our Site Reliability Engineering (SRE) team. This role involves helping organizations enhance the availability, performance, and resilience of their applications and services through the deployment and administration of Observability Platforms.Key ResponsibilitiesDeploy and...
-
Site Reliability Engineer
2 weeks ago
Singapore FUNFLY PTE. LTD. Full timeRoles & ResponsibilitiesPosition OverviewAs a site reliability engineer, you will be responsible for ensuring the smooth operation of game services by maintaining, monitoring, and responding to faults daily. They will develop automation tools to enhance operational efficiency and manage game servers for optimal performance. The role includes collaborating...
-
Senior Site Reliability Engineer
2 weeks ago
Singapore GK CONSULTING PTE. LTD. Full timeRoles & ResponsibilitiesWe're seeking an experienced Senior Site Reliability Engineer to ensure the reliability, availability, and performance of our cloud-based internet services.Key Responsibilities1. Own reliability, availability, and user experience for assigned cloud services2. Develop and implement service governance initiatives to increase reliability...
-
Site Reliability Engineer
3 weeks ago
Singapore TRINITY CONSULTING SERVICES PTE. LTD. Full timeRoles & Responsibilities· Must have minimum 5 years' experience.· Strong technical knowledge and experience in supporting enterprise-level applications.· Proficiency in troubleshooting application issues, performing log analysis, and using monitoring tools.· Experience with databases and SQL query language.· Familiarity with software development life...
-
Site Reliability Engineer
1 week ago
Singapore FLOWDESK ASIA PTE. LTD. Full timeRoles & ResponsibilitiesAbout the jobAre you passionate about maintaining robust and high-performing infrastructures? Do you thrive in managing complex network environments and ensuring system reliability?Join our infrastructure team and help us elevate operational excellence to new heights.As a Site Reliability Engineer at Flowdesk, you will be at the heart...
-
Site Reliability Engineer
1 week ago
Singapore HELLO PLANET PTE. LTD. Full timeRoles & ResponsibilitiesWe are a global dating app created to give everyone a chance at love. The sense of belonging and connectedness we get from relationships helps us survive and thrive, and we're working to make it a little easier for people to find that. We're inspired by the stories we hear from employees, friends, and family who have used our app to...
-
Site Reliability Manager
4 days ago
Singapore Oxford Knight Full timeRequirementsOxford Knight seeks a highly motivated and experienced Senior Site Reliability Engineer with a strong background in Linux administration, cloud computing, and programming languages (preferably Python). The ideal candidate should have a degree in Computer Science or a related field and excellent communication skills.Key Skills and Qualifications5+...
-
Site Reliability Engineer
4 days ago
Singapore TIKTOK PTE. LTD. Full timeRoles & ResponsibilitiesTikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. TikTok has global offices including Los Angeles, New York, London, Paris, Berlin, Dubai, Singapore, Jakarta, Seoul and Tokyo.Why Join UsAt TikTok, our people are humble, intelligent, compassionate and creative. We create...
-
Site Reliability Engineer
5 days ago
Singapore Gravitas Recruitment Group Full timeOur client, a leading investor in financial markets, are looking for an autonomous, critical thinking, Site Reliability Engineer to join their team in Singapore. The ideal candidate must have a strong academic background, having graduated from a top university with a bachelor's degree in computer science. This degree should have been applied...
-
Site Reliability Engineer
4 weeks ago
Singapore SCIENTE INTERNATIONAL PTE. LTD. Full timeRoles & ResponsibilitiesJob Summary:We are seeking a highly skilled Site Reliability Engineer (SRE) to join our team. As an SRE, you will play a critical role in enhancing system reliability, performance, and scalability while ensuring the seamless functioning of our production environments. This is an opportunity to work in a fast-paced, dynamic environment...
-
GEL – Site Reliability Engineer
3 weeks ago
Singapore TOSS-EX PTE. LTD. Full timeRoles & ResponsibilitiesRoles & ResponsibilitiesJob PurposeThe Site Reliability Engineer (SRE) combines software development and system engineering to build and run distributed solutions in a secured multi-tier heterogeneous environment to safeguard, provide and continuously improve the software and systems behind the organization’s cloud platform...
-
GEL – Site Reliability Engineer
3 weeks ago
Singapore TOSS-EX PTE. LTD. Full timeRoles & ResponsibilitiesRoles & ResponsibilitiesJob PurposeThe Site Reliability Engineer (SRE) combines software development and system engineering to build and run distributed solutions in a secured multi-tier heterogeneous environment to safeguard, provide and continuously improve the software and systems behind the organization's cloud platform...
-
Senior Site Reliability Engineer
5 days ago
Singapore GXS Bank Full timeAbout the Team: Our team treats infrastructure and operations as software engineering problems. We are responsible for building and progressing software platforms that enable the provisioning and management of all Digibank services in safe, reliable, and scalable ways. We consistently challenge the status quo and use new technologies to build platforms...
-
Site Reliability Engineer
2 weeks ago
Singapore SOURCEO PTE. LTD. Full timeRoles & ResponsibilitiesRequired Expertise and ExperienceAt least 3 years of experience in SRE, DevOps, or a related engineering role. Proficiency in Infrastructure as Code (IaC) using Terraform to manage complex infrastructure. Hands-on experience with log analytics and observability tools, including ELK (Elasticsearch, Logstash, Kibana) and the Grafana...
-
Site Reliability Engineer
4 weeks ago
Singapore BYTEDANCE PTE. LTD. Full timeRoles & ResponsibilitiesAbout Doubao (Seed)Founded in 2023, the ByteDance Doubao (Seed) Team, is dedicated to pioneering advanced AI foundation models. Our goal is to lead in cutting-edge research and drive technological and societal advancements.With a strong commitment to AI, our research areas span deep learning, reinforcement learning, Language, Vision,...
-
Associate VP
3 days ago
Singapore DBS Bank Limited Full timeBusiness Function Group Technology enables and empowers the bank with an efficient, nimble and resilient infrastructure through a strategic focus on productivity, quality & control, technology, people capability and innovation. In Group Technology, we manage the majority of the Bank's operational processes and inspire to delight our business partners...
-
Reliability Engineer
4 weeks ago
Singapore UNITED MICROELECTRONICS CORPORATION (SINGAPORE BRANCH) Full timeRoles & ResponsibilitiesJob Summary:We are seeking a motivated and detail-oriented Fab Reliability Engineer to join our dynamic team. The ideal candidate will play a crucial role in ensuring the reliability and performance of our manufacturing processes. This position involves process qualification, conformance, process change management, new process...
-
Associate VP
4 days ago
Singapore DBS Bank Limited Full timeBusiness Function Group Technology enables and empowers the bank with an efficient, nimble and resilient infrastructure through a strategic focus on productivity, quality & control, technology, people capability and innovation. In Group Technology, we manage the majority of the Bank's operational processes and inspire to delight our business partners...