Site Reliability Engineer
5 days ago
TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. TikTok has global offices including Los Angeles, New York, London, Paris, Berlin, Dubai, Singapore, Jakarta, Seoul and Tokyo.
Why Join Us
At TikTok, our people are humble, intelligent, compassionate and creative. We create to inspire - for you, for us, and for more than 1 billion users on our platform. We lead with curiosity and aim for the highest, never shying away from taking calculated risks and embracing ambiguity as it comes. Here, the opportunities are limitless for those who dare to pursue bold ideas that exist just beyond the boundary of possibility. Join us and make impact happen with a career at TikTok.
This position is with TikTok's Stability Assurance Team. The team is responsible for ensuring that the services provided by TikTok are highly reliable with low-latency. Reliability assurance is complex and systematic for any massive application system and the team focuses on optimizing the application architecture from end to end; driven by data analysis, with automatic and intelligent failure recovery.
Job Responsibilities:
1. Ensure the online stability of the core system such as TikTok/Live, quickly respond to online accidents and build mechanisms and platforms to improve processing efficiency.
2. Participate in the construction of operation and maintenance tools and platforms, and promote the automation of operation and maintenance.
3. Find system weaknesses and improve projects on the ground through continuous and comprehensive data operations (including availability indicators, historical accidents, resource utilization, etc.),
4. Accumulate best practices in operation and maintenance, provide guidance for business architecture design and component selection, and output operation and maintenance technical documents;
5. Promote the improvement of service reliability, scalability and performance optimization to ensure system SLA.
Qualifications:
- Bachelor's Degree or above, Major in Computer Science;
- Solid basic knowledge of computer software; understand the relevant principles of Linux operating system, storage, network IO, etc.;
- Familiar with one or more programming languages, such as Python/Go/Java/PHP/C/C++;
- Have the ability to solve problems systematically, good communication skills, and a strong sense of responsibility
TikTok is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At TikTok, our mission is to inspire creativity and bring joy. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.
Tell employers what skills you haveApplication Architecture
MASSIVE
Scalability
Kubernetes
Budget Management
Reliability
Administration Management
Good Communication Skills
Distributed Systems
Reliability Engineering
Python
Infrastructure Architecture
RedHat
Architecture Design
Technical Consultation
Technical Engineering
Docker
Java
Linux
Failure Analysis
-
Site Reliability Engineer Leader
1 day ago
Singapore OCBC Full timeJob Description:We are seeking a Site Reliability Engineer Leader to join our team at OCBC. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our infrastructure. This role requires strong expertise in automating releases, continuous integration/delivery systems, and relevant infrastructure...
-
Site Reliability Engineer
2 weeks ago
Singapore COMBUILDER PTE LTD Full timeRoles & ResponsibilitiesWe are seeking talented and driven professionals to join our Site Reliability Engineering (SRE) team. This role involves helping organizations enhance the availability, performance, and resilience of their applications and services through the deployment and administration of Observability Platforms.Key ResponsibilitiesDeploy and...
-
Site Reliability Engineer
2 weeks ago
Singapore FUNFLY PTE. LTD. Full timeRoles & ResponsibilitiesPosition OverviewAs a site reliability engineer, you will be responsible for ensuring the smooth operation of game services by maintaining, monitoring, and responding to faults daily. They will develop automation tools to enhance operational efficiency and manage game servers for optimal performance. The role includes collaborating...
-
Senior Site Reliability Engineer
2 weeks ago
Singapore GK CONSULTING PTE. LTD. Full timeRoles & ResponsibilitiesWe're seeking an experienced Senior Site Reliability Engineer to ensure the reliability, availability, and performance of our cloud-based internet services.Key Responsibilities1. Own reliability, availability, and user experience for assigned cloud services2. Develop and implement service governance initiatives to increase reliability...
-
Site Reliability Engineer
3 weeks ago
Singapore TRINITY CONSULTING SERVICES PTE. LTD. Full timeRoles & Responsibilities· Must have minimum 5 years' experience.· Strong technical knowledge and experience in supporting enterprise-level applications.· Proficiency in troubleshooting application issues, performing log analysis, and using monitoring tools.· Experience with databases and SQL query language.· Familiarity with software development life...
-
Site Reliability Engineer
2 weeks ago
Singapore FLOWDESK ASIA PTE. LTD. Full timeRoles & ResponsibilitiesAbout the jobAre you passionate about maintaining robust and high-performing infrastructures? Do you thrive in managing complex network environments and ensuring system reliability?Join our infrastructure team and help us elevate operational excellence to new heights.As a Site Reliability Engineer at Flowdesk, you will be at the heart...
-
Site Reliability Engineer
2 weeks ago
Singapore HELLO PLANET PTE. LTD. Full timeRoles & ResponsibilitiesWe are a global dating app created to give everyone a chance at love. The sense of belonging and connectedness we get from relationships helps us survive and thrive, and we're working to make it a little easier for people to find that. We're inspired by the stories we hear from employees, friends, and family who have used our app to...
-
Site Reliability Engineer
2 weeks ago
Singapore PATSNAP PTE. LTD. Full timeRoles & ResponsibilitiesAbout the RoleWe are looking for a skilled and experienced DevOps Engineer / Site ReliabilityEngineer (SRE) to ensure the high availability, stability, and performance of ourbusiness platform. This role will be responsible for designing and implementing scalableand maintainable DevOps architecture and automation systems to...
-
Site Reliability Manager
4 days ago
Singapore Oxford Knight Full timeRequirementsOxford Knight seeks a highly motivated and experienced Senior Site Reliability Engineer with a strong background in Linux administration, cloud computing, and programming languages (preferably Python). The ideal candidate should have a degree in Computer Science or a related field and excellent communication skills.Key Skills and Qualifications5+...
-
Site Reliability Engineer
6 days ago
Singapore Gravitas Recruitment Group Full timeOur client, a leading investor in financial markets, are looking for an autonomous, critical thinking, Site Reliability Engineer to join their team in Singapore. The ideal candidate must have a strong academic background, having graduated from a top university with a bachelor's degree in computer science. This degree should have been applied...
-
Site Reliability Engineer
4 weeks ago
Singapore SCIENTE INTERNATIONAL PTE. LTD. Full timeRoles & ResponsibilitiesJob Summary:We are seeking a highly skilled Site Reliability Engineer (SRE) to join our team. As an SRE, you will play a critical role in enhancing system reliability, performance, and scalability while ensuring the seamless functioning of our production environments. This is an opportunity to work in a fast-paced, dynamic environment...
-
GEL – Site Reliability Engineer
3 weeks ago
Singapore TOSS-EX PTE. LTD. Full timeRoles & ResponsibilitiesRoles & ResponsibilitiesJob PurposeThe Site Reliability Engineer (SRE) combines software development and system engineering to build and run distributed solutions in a secured multi-tier heterogeneous environment to safeguard, provide and continuously improve the software and systems behind the organization's cloud platform...
-
GEL – Site Reliability Engineer
3 weeks ago
Singapore TOSS-EX PTE. LTD. Full timeRoles & ResponsibilitiesRoles & ResponsibilitiesJob PurposeThe Site Reliability Engineer (SRE) combines software development and system engineering to build and run distributed solutions in a secured multi-tier heterogeneous environment to safeguard, provide and continuously improve the software and systems behind the organization’s cloud platform...
-
Senior Site Reliability Engineer
6 days ago
Singapore GXS Bank Full timeAbout the Team: Our team treats infrastructure and operations as software engineering problems. We are responsible for building and progressing software platforms that enable the provisioning and management of all Digibank services in safe, reliable, and scalable ways. We consistently challenge the status quo and use new technologies to build platforms...
-
Site Reliability Engineer
2 weeks ago
Singapore SOURCEO PTE. LTD. Full timeRoles & ResponsibilitiesRequired Expertise and ExperienceAt least 3 years of experience in SRE, DevOps, or a related engineering role. Proficiency in Infrastructure as Code (IaC) using Terraform to manage complex infrastructure. Hands-on experience with log analytics and observability tools, including ELK (Elasticsearch, Logstash, Kibana) and the Grafana...
-
Site Reliability Engineer
4 weeks ago
Singapore BYTEDANCE PTE. LTD. Full timeRoles & ResponsibilitiesAbout Doubao (Seed)Founded in 2023, the ByteDance Doubao (Seed) Team, is dedicated to pioneering advanced AI foundation models. Our goal is to lead in cutting-edge research and drive technological and societal advancements.With a strong commitment to AI, our research areas span deep learning, reinforcement learning, Language, Vision,...
-
Associate VP
4 days ago
Singapore DBS Bank Limited Full timeBusiness Function Group Technology enables and empowers the bank with an efficient, nimble and resilient infrastructure through a strategic focus on productivity, quality & control, technology, people capability and innovation. In Group Technology, we manage the majority of the Bank's operational processes and inspire to delight our business partners...
-
Reliability Engineer
4 weeks ago
Singapore UNITED MICROELECTRONICS CORPORATION (SINGAPORE BRANCH) Full timeRoles & ResponsibilitiesJob Summary:We are seeking a motivated and detail-oriented Fab Reliability Engineer to join our dynamic team. The ideal candidate will play a crucial role in ensuring the reliability and performance of our manufacturing processes. This position involves process qualification, conformance, process change management, new process...
-
Associate VP
5 days ago
Singapore DBS Bank Limited Full timeBusiness Function Group Technology enables and empowers the bank with an efficient, nimble and resilient infrastructure through a strategic focus on productivity, quality & control, technology, people capability and innovation. In Group Technology, we manage the majority of the Bank's operational processes and inspire to delight our business partners...
-
Senior Site Reliability Engineer
6 days ago
Singapore Oxford Knight Full timeSalary: up to 250-275k SGD base Summary High-frequency prop trading firm with offices worldwide looking for skilled Senior Site Reliability Engineer developer to join their High Performance Computing team, developing and supporting their large-scale compute and storage platform. This platform is designed to solve demanding problems - both business and...