
Sre Manager
4 days ago
Responsibilities
About TikTok
TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. TikTok has global offices including Los Angeles, New York, London, Paris, Berlin, Dubai, Singapore, Jakarta, Seoul, and Tokyo.
Why Join Us
Creation is the core of TikTok's purpose. Our platform is built to help imaginations thrive. This is doubly true of the teams that make TikTok possible.
Together, we inspire creativity and bring joy - a mission we all believe in and aim towards achieving every day.
To us, every challenge, no matter how difficult, is an opportunity; to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always.
At TikTok, we create together and grow together. That's how we drive impact - for ourselves, our company, and the communities we serve.
Join us.
About the team
TikTok and affiliate are developing the next-generation high-performance analytical database, with a mission to enable efficient and real-time data-driven decision-making on PB-level data sets. The initial product was forked from Clickhouse, after which large re-architecture had been taken place. The product now not only improves the efficiency of Clickhouse but also fits into the elastic cloud-native infrastructure with better scalability and resource utilization. With years of polishment in the internal EB-level scenarios, we are now ready to serve our business partners via various cloud vendors.
Our software engineers for product infrastructure role combine software and systems engineering disciplines to run high-performance, large-scale distributed infrastructure. This means you will be deeply involved in the developmental lifecycle of critical software services, collaborating closely with product engineers to combine software code and systems knowledge to ensure that cloud-native OLAP engines are reliable, fault-tolerant, efficiently scalable and cost-effective. You will also be leveraging your software engineering expertise to develop software platforms and tools to optimise the operational and engineering efficiencies of complex systems at scale, with particular focus on improving the systems' observability, performance and maintainability.
**In this role, you will**:
- Building and managing the Global SRE team, including team recruitment, new talent training, system operation/maintenance/coordination and team culture building.
- Improve the cross-team/time zone/regional cooperation mechanism, and provide SRE solutions in line with actual business scenarios based on business orientation.
- Responsible for SRE team arrangement and project management, guiding basic SRE work to be more effective, and improving the overall SRE efficiency.
- Develop process specifications and plans for compliant access, configuration, disaster recovery and fault handling of critical paths of overseas SRE services.
- Responsible for continuously improving the core SRE capabilities of OLAP engine in efficiency, cost, quality, security, etc.
- Develop automation, data visualization and automated monitoring processes to facilitate the optimization of the cloud-native OLAP engine infrastructure.
- Drive the design and engineering of tools, as well as platform solutions, to optimize product engineering and operation efficiencies.
- Manage oncall processes to respond to performance and reliability issues, and establish best practices for coordinating escalation to resolve issues and minimize downtime.
**Qualifications**:
- Bachelor degree or above in Computer Science or a related technical discipline and good English communication skills.
- Familiar with SRE-related processes, understand the development trend of SRE technology in the industry, and have a good ability to build an SRE system, 6 years+ SRE experience, big-data or OLAP engine SRE experience is best to have.
- Familiar with SRE technologies, including Kubernetes, Terraform, Ansible, Bash Scripting etc.
- Familiar with cloud computing technologies of Amazon Web Services, Google Cloud Platform and other suppliers.
- Expertise in operations, deployment, and trouble shooting high availability and quality assurance of large-scale distributed systems, with a strong focus on stability and performance.
- Possesses a strong sense of responsibility, a proactive team spirit, and a strong ability to comprehensively analyze and solve problems.
TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At TikTok, our mission is to inspire creativity and bring joy. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.
-
Engineer, SRE
1 week ago
Singapore Rakuten Viki Full timeJoin to apply for the Engineer, SRE role at Rakuten Viki Rakuten International oversees 7 businesses with over 4,000 employees globally. The brand is recognized for leadership and innovation in e-commerce, digital content, advertising, entertainment and communications, bringing the joy of discovery and access to more than 1 billion members across the world....
-
Vp, Platform Sre Engineer, Sre
6 days ago
Singapore DBS Bank Full timeJob ObjectiveDBS Bank is looking for a Platform SRE Engineer with experience working on enterprise level data engineering, analytics, and observability applications. The SRE engineer would be responsible for ensuring high availability of the platform services and perform continuous improvements to increase the platform’s efficiency and resiliency. The SRE...
-
Engineer, SRE
3 days ago
Singapore Rakuten International Full time $80,000 - $120,000 per yearJob Description:Rakuten International oversees 7 businesses with over 4,000 employees globally. The brand is recognized for its leadership and innovation in e-commerce, digital content, advertising, entertainment and communications, bringing the joy of discovery and access to more than 1 billion members across the world. Our teams deliver on the company's...
-
Singapore DBS Bank Full timeVP, Technology Risk Manager, SRE&Governance, Group Technology Join to apply for the VP, Technology Risk Manager, SRE&Governance, Group Technology role at DBS
-
SRE Lead
2 weeks ago
Singapore pinely Full timeWe are looking for a SRE Lead to join a Pinely team! The main role will be to manage an in-house infrastructure team, internal compute cluster, and development services. Responsibilities: Operational management of trading activities, proactive trading monitoring; Incident management, rapid escalation, and mitigation; On-call duty; Debugging and issue...
-
SVP, Problem
1 week ago
Singapore DBS Bank Limited Full timeOverview The Role: This position is for an SRE Problem and Knowledge Management Lead within the enabling group, Site Reliability Engineering and Governance (SRE & Governance) department. This role is expected to strategically lead the conduct of incident retrospective/ problem management operations and in other SRE activities in general which pertains to...
-
VP, Problem
2 days ago
Singapore DBS Bank Limited Full timeThe Role This position is for an SRE Problem and Knowledge Management Team Lead within the enabling group, Site Reliability Engineering and Governance (SRE & Governance) department. This role is expected to strategically lead the conduct of incident retrospective/ problem management operations and in other SRE activities in general which pertains to...
-
SRE Lead
2 weeks ago
Singapore pinely Full timeWe are looking for a SRE Lead to join a Pinely team! The main role will be to manage an in-house infrastructure team, internal compute cluster, and development services. Responsibilities: Operational management of trading activities, proactive trading monitoring; Incident management, rapid escalation, and mitigation; On-call duty; Debugging and issue...
-
Senior System Software Engineer
2 weeks ago
Singapore EXASOFT PTE. LTD. Full timeSRE/ PRE ( require 10-15 years experience , FULL SRE experience, SRE Practices not only on cloud Multi cloud, Hybrid cloud- need on Data Center sites as well)Responsibilities and Requirements Experience must be atleast 9+ years Should be from engineering skills Typically people hired for R&D Experience in using Infrastructure as Code (IaC) tools Strong...
-
Associate Engineer, SRE
1 week ago
Singapore Rakuten Viki Full timeJoin to apply for the Associate Engineer, SRE role at Rakuten Viki Join to apply for the Associate Engineer, SRE role at Rakuten Viki Job Description: Rakuten International oversees 7 businesses with over 4,000 employees globally. The brand is recognized for its leadership and innovation in e-commerce, digital content, advertising, entertainment and...