Research Engineer, Multimodal Model

3 months ago


Singapore BYTEDANCE PTE. LTD. Full time
Roles & Responsibilities

Responsibilities

Established in 2023, the ByteDance Doubao (Seed) Team is dedicated to building industry-leading AI foundation models. We aim to do world-leading research and foster both technological and social progress.


With a long-term vision and a strong commitment to the AI field, the Team conducts research in a range of areas including natural language processing (NLP), computer vision (CV), and speech recognition and generation. It has labs and researcher roles in China, Singapore, and the US.


Leveraging substantial data and computing resources and through continued investment in these domains, our team has built a proprietary general-purpose model with multimodal capabilities. In the market, Doubao models power over 50 ByteDance apps and business lines, including Doubao, Coze, and Dreamina, and was launched to external enterprise clients through Volcano Engine. The Doubao app is the most used AIGC app in China.


Why Join Us

Creation is the core of ByteDance's purpose. Our products are built to help imaginations thrive. This is doubly true of the teams that make our innovations possible. Together, we inspire creativity and enrich life - a mission we aim towards achieving every day. To us, every challenge, no matter how ambiguous, is an opportunity; to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always. At ByteDance, we create together and grow together. That's how we drive impact - for ourselves, our company, and the users we serve. Join us.


About the team

Welcome to the GAI-Vision team, where we lead the way in developing foundational models for multi-modal visual understanding and generation. Our mission is to solve the challenge of visual intelligence in AI. We conduct cutting-edge research on areas such as vision and language, large-scale vision models, and generative foundation models. Comprising experienced research scientists and engineers, our team is dedicated to pushing the boundaries of foundation model research and implementing our innovations across diverse application scenarios. We foster a feedback-driven environment to continuously enhance our foundation technologies. Come join us in shaping the future of AI and transforming the product experience for users worldwide.


Responsibilities

- Explore large-scale/ultra-large-scale visual models and perform system optimization. Data construction, instruction fine-tuning, preference alignment, model optimization.

- Conduct cutting-edge research and development in computer vision, natural language processing, machine learning and general artificial intelligence, especially in the areas of multi-modality, vision and language, etc.

- Publish our latest research results, and help to build our brand in the research community.

- Explore vision/multi-modality application models, and contribute to the development of new technologies and products leveraging artificial intelligence.


Qualifications

Minimum Qualifications

- Possess research and practical experience in one or more areas of computer vision, encompassing multi-modal understanding, vision-language models (e.g., video captioning, VQA, Text-to-video retrieval, and other related topics), large-scale training, RLHF, multimodal generation (e.g., text-to-image, image, video, 3D generation and editing), diffusion models, GANs, transformers for generation tasks.

- Experience with vision-language models and apply them in various downstream tasks.

- Possess coding skills in C/C++ and Python.

- Collaborate effectively with team members.

- Ability to work independently.


Preferred Qualifications

- Work with large-scale datasets, and build large-scale datasets to scale up foundation models.

- Demonstrate impactful publications in leading AI conferences (e.g., CVPR, ECCV, ICCV, NeurIPS, ICLR, SIGGRAPH, SIGGRAPH Asia) and journals (e.g., TPAMI, JMLR).

- Achievement as a winner in international academic competitions.

- Proficiency in one of the differentiable programming frameworks such as PyTorch, TensorFlow, JAX, etc.


ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.


Tell employers what skills you have

TensorFlow
Machine Learning
Construction
Ability To Work Independently
Natural Language Processing
Artificial Intelligence
Computer Vision
3D
Research and Development
PyTorch
Speech Recognition
Python
Publications
  • Research Associate

    1 month ago


    Singapore Nanyang Technological University Full time

    The School of Electrical and Electronic Engineering at Nanyang Technological University is seeking a Research Associate to collaborate with multiple disciplines of machine learning and artificial intelligence to develop multimodal large language models with capabilities such as conversational information retrieval, multimodality sensemaking, question...

  • Research Engineer

    3 months ago


    Singapore BYTEDANCE PTE. LTD. Full time

    Roles & ResponsibilitiesResponsibilitiesEstablished in 2023, the ByteDance Doubao (Seed) Team is dedicated to building industry-leading AI foundation models. We aim to do world-leading research and foster both technological and social progress.With a long-term vision and a strong commitment to the AI field, the Team conducts research in a range of areas...


  • Singapore BYTEDANCE PTE. LTD. Full time

    Roles & ResponsibilitiesEstablished in 2023, the ByteDance Doubao (Seed) Team is dedicated to building industry-leading AI foundation models. We aim to do world-leading research and foster both technological and social progress.With a long-term vision and a strong commitment to the AI field, the Team conducts research in a range of areas including natural...


  • Singapore Nanyang Technological University Full time

    The Nanyang Technological University seeks a talented Senior Researcher in Multimodal Large Language Models to work with a team from multiple disciplines of machine learning and artificial intelligence.The successful candidate will contribute to the development of multimodal large language models with capabilities such as conversational information...


  • Singapore BYTEDANCE PTE. LTD. Full time

    About BYTEDANCE PTE. LTD.We are a leading technology company dedicated to creating innovative AI solutions that drive progress and growth. Our mission is to inspire creativity and enrich life through cutting-edge research and development.As a Research Scientist, Multimodal Foundation Model at BYTEDANCE PTE. LTD., you will play a critical role in advancing...


  • Singapore TikTok Full time

    About Our TeamExplore the Future of AIThe TikTok Eng-AI Innovation Center is dedicated to pushing the boundaries of cutting-edge AGI technologies, including Large Language Models and multimodal LLMs. Our mission is to enhance content understanding on the TikTok platform, bringing better search experiences, recommendations, and improved content detection and...


  • Singapore BYTEDANCE PTE. LTD. Full time

    About the RoleAt BYTEDANCE PTE. LTD., we are seeking a talented Multimodal Intelligence Researcher to join our team in Singapore.This role involves conducting cutting-edge research in areas such as multimodal understanding, generative models, machine learning, and reinforcement learning.ResponsibilitiesExplore and research multi-modal understanding,...


  • Singapore Hireio, Inc. Full time

    Hireio, Inc. Job DescriptionAbout the TeamHireio, Inc.The team at Hireio, Inc. utilizes the most advanced AI technology to combat various risks and violations in our e-commerce platform, maintain platform security, build a good e-commerce ecosystem, and empower business teams to improve work efficiency.Our ultimate goal is to achieve the highest risk...


  • Singapore BYTEDANCE PTE. LTD. Full time

    Roles & ResponsibilitiesAbout ByteDanceEstablished in 2023, the ByteDance Doubao (Seed) Team is dedicated to building industry-leading AI foundation models. We aim to do world-leading research and foster both technological and social progress.With a long-term vision and a strong commitment to the AI field, the Team conducts research in a range of areas...


  • Singapore BYTEDANCE PTE. LTD. Full time

    About UsByteDance PTE. LTD., a global leader in AI research, is dedicated to building industry-leading foundation models.We aim to drive technological progress and foster innovation through cutting-edge research and development.Job DescriptionAs a Multimodal Vision Research Scientist, you will conduct research on computer vision, deep learning, and AI,...

  • Research Scientist

    1 month ago


    Singapore BYTEDANCE PTE. LTD. Full time

    Roles & ResponsibilitiesAbout ByteDanceEstablished in 2023, the ByteDance Doubao (Seed) Team is dedicated to building industry-leading AI foundation models. We aim to do world-leading research and foster both technological and social progress.With a long-term vision and a strong commitment to the AI field, the Team conducts research in a range of areas...


  • Singapore BYTEDANCE PTE. LTD. Full time

    About the RoleWe are seeking a talented Research Scientist to join our Multimodal Interaction & World Model team at ByteDance PTE. LTD.As a Research Scientist, you will be responsible for exploring and researching cutting-edge technologies such as multi-modal understanding, generative models, machine learning, reinforcement learning, AIGC, computer vision,...


  • Singapore BYTEDANCE PTE. LTD. Full time

    About the RoleByteDance PTE. LTD. is seeking a talented Multimodal Interaction Research Scientist/Engineer to join our team in Singapore.The TeamWe are a dynamic team dedicated to building industry-leading AI foundation models, focusing on natural language processing (NLP), computer vision (CV), and speech recognition and generation.ResponsibilitiesExplore...


  • Singapore TikTok Full time

    About Us TikTok Eng-AI Innovation Center Mission:The center is dedicated to exploring cutting-edge AGI technologies, including large language models and multimodal large models. Our goal is to enable machines to better understand user creations on the TikTok platform.Our research focuses on enhancing content understanding for a better user experience of...


  • Singapore BYTEDANCE PTE. LTD. Full time

    Company OverviewBYTEDANCE PTE. LTD., a cutting-edge technology company, is dedicated to pushing the boundaries of artificial intelligence. With a strong focus on innovation and creativity, we strive to inspire imagination and enrich life through our products and services.Estimated Salary: $150,000 - $200,000 per year (dependent on experience),Job...


  • Singapore BYTEDANCE PTE. LTD. Full time

    BYTEDANCE PTE. LTD. is a pioneer in the field of artificial intelligence, with a strong focus on research and development.We are seeking a highly skilled Multimodal Research Scientist to join our team in Singapore.The successful candidate will have a strong background in computer science, engineering, or quantitative fields, with a Ph.D. or recent graduate...


  • Singapore BYTEDANCE PTE. LTD. Full time

    About the RoleWe are seeking a highly skilled Multimodal Vision Researcher to join our team at ByteDance PTE. LTD.Key Responsibilities:Conduct research on computer vision, deep learning, and AI, addressing various challenges in deep learning, computer vision, AIGC, graphics, large multi-modality models, diffusion models, video generation, 3D generation,...


  • Singapore BYTEDANCE PTE. LTD. Full time

    About UsByteDance PTE. LTD. is a leading technology company dedicated to building industry-leading AI foundation models.We aim to conduct world-leading research and foster both technological and social progress in the field of artificial intelligence.Job DescriptionWe are seeking a talented Multimodal AI Researcher to join our team at ByteDance PTE. LTD.Your...

  • AI Research Engineer

    2 weeks ago


    Singapore TIKTOK PTE. LTD. Full time

    At TIKTOK PTE. LTD., we are committed to creating an inclusive environment where employees are valued for their skills, experiences, and unique perspectives.As a highly self-motivated research engineer, you will have the opportunity to participate in the development of cutting-edge content understanding models to help improve recognition abilities on our...


  • Singapore BYTEDANCE PTE. LTD. Full time

    As a visionary researcher at ByteDance PTE. LTD., you will be part of a team that leads the way in developing foundational models for multi-modal visual understanding and generation.We are seeking an exceptional individual with expertise in computer vision, deep learning, and AI to join our research team. The successful candidate will have a proven track...