Algorithm Engineer, Large Language Model

2 days ago


Singapore Refine Group Full time

The Engineering and Technology team is at the core of the Shopee platform development. The team is made up of a group of passionate engineers from all over the world, striving to build the best systems with the most suitable technologies. Our engineers do not merely solve problems at hand; We build foundations for a long-lasting future. We don't limit ourselves on what we can or can't do; we take matters into our own hands even if it means drilling down to the bottom layer of the computing platform. Shopee's hyper-growing business scale has transformed most "innocent" problems into huge technical challenges, and there is no better place to experience it first-hand if you love technologies as much as we do.

About the Team:

Shopee will be prioritizing applicants who have a current right to work in Singapore , and do not require Shopee sponsorship of a visa .

Kindly note that you can only be considered in one recruitment process at a time within Sea Group and will be considered for jobs in the order that you have applied.

The Large Language Model team (LLM) is committed to building the leading multi-lingual large model in Southeast Asia, building a complete Model-as-a-Infrastructure product support system, and supporting the intelligent upgrade and transformation of the company's business.

Job Description:

  • The application exploration and implementation of large language models in the fields of e-commerce, gaming, and payments include building multi-language agents with technologies such as function calling, tool usage, RAG, and code interpreters. These technologies are utilized and applied in business areas such as customer service, shopping guidance, video, and search recommendations.
  • The research and implementation of pre-training and alignment algorithms include ultra-large-scale multilingual pre-training technology, Mixture-of-Experts model training, Instruction Pretraining, SFT, and RLHF. From a multilingual perspective, these efforts aim to reduce the hallucination problem of models, enhance safety capabilities, and improve long-text comprehension and Q&A performance.
  • Building a service framework and platform for large models, as well as accelerating inference, involves creating online service architectures, data, and evaluation platforms. This also includes exploring and implementing inference acceleration strategies such as Medusa, Speculative Decoding, multi-LoRA inference, and Pruning.
  • Establishing a comprehensive evaluation system for Southeast Asian multilingual large models involves providing standardized model evaluation capabilities and creating comprehensive evaluation datasets. This system will drive and refine improvements in large language models through model evaluation, addressing issues encountered in practical business scenarios and during technical iteration and optimization processes.
  • The data mining and optimization of large model algorithms involve constructing data systems for multilingual pre-training, instruction fine-tuning, and human preference behaviors. This includes establishing a comprehensive engineering system for model fine-tuning and dataset preparation to effectively improve the quality and delivery capability of datasets.

Requirements:

  • Bachelor's degree or above in Computer Science or related fields.
  • Excellent coding skills, data structure and basic algorithm skills, proficiency in Python/Pytorch coding, and harness the Hands-on ability.
  • Familiar with NLP and CV related algorithms and technologies, and those who are familiar with large model training and RL algorithms.
  • Familiar with the basic principles and training methods of industry-leading LLM (such as GPT, LLaMA), or familiar with the basic principles and training methods of mainstream multi-modal large models (such as Flamingo, LLaVA), and have research experience in text generation or dialogue systems, etc.
  • Excellent problem analysis and solving skills, able to deeply solve problems in large model training and application.
  • Good communication and collaboration skills, able to explore new technologies with the team and promote technological progress.
#J-18808-Ljbffr

  • Singapore beBeeAlgorithm Full time $80,000 - $120,000

    Job Title:Large Language Model Algorithm EngineerWe're seeking a skilled Large Language Model Algorithm Engineer to join our team. In this role, you will design and develop algorithms related to AI assistants, including but not limited to LLM post-training/SFT/MoE, user preference alignment with RL(such as DPO/GPRO).As a member of our team, you will be...


  • Singapore Shopee Full time

    DepartmentEngineering and Technology- LevelExperienced (Individual Contributor)- LocationSingaporeThe Engineering and Technology team is at the core of the Shopee platform development. The team is made up of a group of passionate engineers from all over the world, striving to build the best systems with the most suitable technologies. Our engineers do not...


  • Singapore ByteDance Full time

    Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok, Helo, and Resso, as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content. Why Join...


  • Singapore BYTEPLUS PTE. LTD. Full time

    Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok, Helo, and Resso, as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content. **Why Join...


  • Singapore Binance Full time

    Position : Large Language Model (LLM) Algorithm Engineer Company : Binance Job Type : Full-Time, Hybrid About the Job Binance is seeking top-tier innovators to join their lean, elite LLM Algorithm & Data Science Team . This team is focused on developing next-generation AI solutions in finance , blockchain , and beyond. If you're passionate about...


  • Singapore beBeeAlgorithm Full time $180,000 - $300,000

    Large Language Model EngineerJoin a company that specializes in large language models and apply for this role. Advanced post-training of large language models (e.g., SFT, RLHF/RLAIF, continual pretraining) is required. Aligning models for reliable JSON-schema function calls and external tool usage is essential. Design, deploy, and operate Model Context...


  • Singapore Shopee Full time

    Department Engineering and Technology - LevelExperienced (Individual Contributor) - LocationSingapore The Engineering and Technology team is at the core of the Shopee platform development. The team is made up of a group of passionate engineers from all over the world, striving to build the best systems with the most suitable technologies. Our engineers do...

  • Algorithm Engineer

    2 weeks ago


    Singapore ALPHA X TECHNOLOGY PTE. LTD. Full time

    Roles & ResponsibilitiesAlpha X is an innovative high-tech manufacturing technology company pioneering the integration of advanced automation, transportation and AI-driven solutions to revolutionize traditional manufacturing processes. We harness the power of artificial intelligence, machine learning, and robotics to optimize production efficiency, enhance...


  • Singapore beBeeLanguage Full time $120,000 - $180,000

    Large Language Model EngineerAbout the RoleThis is a challenging opportunity for an experienced engineer to join our team in building and deploying large language models. The successful candidate will be responsible for designing, implementing and optimizing model architectures, as well as ensuring seamless integration into production environments.Key...


  • Singapore beBeeOptimization Full time

    Large Model Optimization EngineerWe are seeking a skilled Large Model Optimization Engineer to join our team. In this role, you will be responsible for developing and optimizing large models for inference and training.Your primary focus will be on designing and optimizing self-developed NPU software and hardware systems to achieve high-performance...