混元大模型运维工程师/ Hunyuan LLM Site Reliability Engineer

4 days ago


Singapur, Singapore Tencent Full time

Overview

Lead HR/Talent Partner for TiMi Global

Job description
  1. Responsible for the operation and maintenance of overseas model services at Hunyuan, ensuring stable, reliable, and efficient service operations
  2. Responsible for capacity management and planning, resource cost optimization, ensuring reasonable online service capacity and improving resource efficiency
  3. Responsible for continuous integration and delivery, efficient and automated operational optimization, enhancing service stability and research and development efficiency
  4. Participate in the design of online systems and various service architectures, providing professional solutions for stability and architecture improvement
  5. Analyze and deeply explore the shortcomings of existing systems, data-driven to find weak points, and promote system optimization implementation and improvement
  6. Pay attention to industry front-end technology trends, explore technologies and directions for automation and intelligence in the operation and maintenance of complex business systems
Job requirements
  1. Bachelor\'s degree or above, with 2 years or more experience in internet operations and maintenance
  2. Familiar with Linux operating system, with solid system management and network knowledge
  3. Familiar with deploying, configuring, and tuning components such as Nginx, Redis, MySQL
  4. Proficient in monitoring systems such as Zabbix, Prometheus, Grafana, real-time grasping the running status of overseas systems
  5. Proficient in at least one programming language (such as Python, Go, Shell, etc.), with experience in developing automated operational tools to meet the needs of complex and variable overseas operations and maintenance
  6. Familiar with mainstream public cloud operations and maintenance management overseas (such as AWS, Azure, etc.), with experience in containerization and microservices architecture, able to cope with the characteristics and differences of local cloud services
  7. Strong sense of work responsibility, good communication skills, learning ability, and team spirit
  8. Proficient in English and Chinese listening, speaking, reading, and writing, timely writing updated workflow and technical documents as required. Bilingualism required to work with both international stakeholders and China HQ based teammates
Seniority level
  • Associate
Employment type
  • Full-time
Industries
  • Software Development and Technology, Information and Media
#J-18808-Ljbffr

  • Singapur, Singapore IMAGE FRAME INVESTMENT (UK) LIMITED Full time

    Hunyuan LLM Site Reliability Engineer page is loadedHunyuan LLM Site Reliability Engineer Apply remote type Onsite locations Singapore-CapitaSky Malaysia-Kuala Lumpur time type Full time posted on Posted 12 Days Ago job requisition id R Business Unit Technology Engineering Group (TEG) is responsible for supporting the company and its business groups on...


  • Singapur, Singapore Tencent Full time

    Join to apply for the Hunyuan LLM Site Reliability Engineer role at Tencent . **Business Unit**Technology Engineering Group (TEG) is responsible for supporting the company and its business groups on technology and operational platforms, as well as the construction and operation of R&D management and data centers. TEG provides users with a full range of...


  • Singapur, Singapore ByteDance Full time

    Site Reliability Engineer Graduate (Network Automation) - 2026 start (BS/MS) Join to apply for the Site Reliability Engineer Graduate (Network Automation) - 2026 start (BS/MS) role at ByteDance Site Reliability Engineer Graduate (Network Automation) - 2026 start (BS/MS) 2 days ago Be among the first 25 applicants Join to apply for the Site Reliability...


  • Singapur, Singapore WeChat International Pte. Ltd. Full time

    Site Reliability Engineer page is loadedSite Reliability Engineer Apply remote type Onsite locations Singapore-CapitaSky time type Full time posted on Posted 30+ Days Ago job requisition id R Business Unit Technology Engineering Group (TEG) is responsible for supporting the company and its business groups on technology and operational platforms, as well as...


  • Singapur, Singapore IDEMIA Full time

    Join to apply for the Site Reliability Engineer role at IDEMIA Join to apply for the Site Reliability Engineer role at IDEMIA Get AI-powered advice on this job and more exclusive features. PurposeThis role plays a critical part in ensuring reliability, scalability, and performance of our systems and services. You will work closely with development and...

  • LLM Engineer

    4 days ago


    Singapur, Singapore Sonar Full time

    Join to apply for the LLM Engineer role at Sonar 1 month ago Be among the first 25 applicants Why Should I Apply At Sonar, we’re a group of brilliant, motivated, and driven professionals working hard to help organizations build responsible, secure, high-quality code quickly and systematically. We build solutions that don’t just solve symptoms of...


  • Singapur, Singapore ByteDance Full time

    Site Reliability Engineer, Machine Learning Systems - Singapore Site Reliability Engineer, Machine Learning Systems - Singapore 4 days ago Be among the first 25 applicants Direct message the job poster from ByteDance Hiring for Gen AI talents Globally! | ByteDance Seed Foundation & LLM Global Data | covering & ResponsibilitiesThe ByteDance Large Model Team...


  • Singapur, Singapore Beijing Foreign Enterprise Management Consultants Co.,Ltd. Full time

    Direct message the job poster from Beijing Foreign Enterprise Management Consultants Co.,Ltd. On behalf of Huawei, a world-renowned information and communication technology company, we are seeking passionate and talented individuals to join our team as Site Reliability Engineer Overview On behalf of Huawei, a world-renowned information and communication...


  • Singapur, Singapore Point72 Full time

    Join to apply for the Site Reliability Engineer role at Point72 About the role As part of Point72’s Technology Team, you will focus on developing and maintaining complex, distributed, real-time systems that support our Global Macro business. Your responsibilities will include optimizing operations through automation, building foundational SRE...

  • Site Reliability

    4 days ago


    Singapur, Singapore Canonical Full time

    Join to apply for the Site Reliability / Gitops Engineer role at Canonical 1 day ago Be among the first 25 applicants Join to apply for the Site Reliability / Gitops Engineer role at Canonical Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is very widely...