Audio Synthesis Research Scientist, Speech and Audio

5 days ago


Singapore BYTEDANCE PTE. LTD. Full time

**About ByteDance**

Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok, Helo, and Resso, as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content.

**Why Join Us**

Creation is the core of ByteDance's purpose. Our products are built to help imaginations thrive. This is doubly true of the teams that make our innovations possible. Together, we inspire creativity and enrich life - a mission we aim towards achieving every day. To us, every challenge, no matter how ambiguous, is an opportunity; to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always. At ByteDance, we create together and grow together. That's how we drive impact - for ourselves, our company, and the users we serve. Join us.

**About The Team**

Speech & Audio (SA) team provides industry-leading ML algorithm capabilities and complete voice product solutions for all business lines, including B2B as well. We empower businesses in the fields of speech/song synthesis, audio understanding, ML education, etc., and solve basic problems such as audio and video content understanding and creation, and human-computer voice interaction. The research team of audio generation group is committed to promoting the cutting-edge development of generative models in the fields of speech and virtual human, following up the leading research work in the academic industry, improving the academic influence and building technical barriers of SA team in related fields.

**Responsibilities**
- Basic generative models
- High-quality and expressive text-to-speech models
- Speech-driven talking face generation and human pose generation
- Speech-to-speech translation and simultaneous translation
- Singing voice synthesis
- Audio generation conditioned on text, image and video
- Support the production of scalable and optimised AI/machine learning (ML) models
- Focus on building algorithms for the extraction, transformation and loading of large volumes of Realtime, unstructured data to deploy AI/ML solutions from theoretical data science models
- Run experiments to test the performance of deployed models, and identifies and resolves bugs that arise in the process
- Work with the relevant software platforms in which the models are deployed

**Qualifications**
- Ph.D student or Master student by research from top universities, majoring in EE or CS in the related areas (CV/NLP/Speech). Speech-related background is optional
- Having publications/submissions in relevant conferences/journals
- Familiar with the recent advance in the field of deep learning and generative models
- Self-driven, innovative, collaborative, with good communication and presentation skills
- Proficiency in one or more of the community open source tools such as Pytorch, Tensorflow, Jax

**Preferred Qualifications**
- Having publications/submissions in top-tier conferences/journals (e.g., ICML、ICLR、NeurIPS、CVPR、ICCV and TPAMI), or winning the best paper award/best paper nomination/other honors in relevant international conferences/journals

ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and enrich life. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too.



  • Singapore TikTok Full time

    Responsibilities TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. TikTok has global offices including Los Angeles, New York, London, Paris, Berlin, Dubai, Singapore, Jakarta, Seoul and Tokyo. Why Join Us At TikTok, our people are humble, intelligent, compassionate and creative. We create to...


  • Singapore SPECTRUM AUDIO VISUAL PTE. LTD. Full time

    Key Responsibilities: System configuration, Testing & Commissioning for Pro-Audio solution for large Venue and corporate meeting room that need integration with other Audio Visual and Videoconferencing solution. Conduct end user trainings to end users on how to use the Solution(s) implemented. Work closely with in house AV Designers and Engineers from...

  • Audio Ai Engineer

    5 days ago


    Singapore Zoom Full time

    **What you can expect** As an Audio AI Engineer, you will research and develop algorithms for accent conversion, voice conversion, speech synthesis, and speech recognition on low-latency streaming architectures. You’ll prototype and refine end-to-end audio models that enhance intelligibility and naturalness while maintaining speaker identity. Working...


  • Singapore Razer Full time

    Joining Razer will place you on a global mission to revolutionize the way the world games. Razer is a place to do great work , offering you the opportunity to make an impact globally while working across a global team located across 5 continents. Razer is also a great place to work, providing you the unique, gamer-centric #LifeAtRazer experience that will...


  • Singapore Spectrum Audio Visual Pte Ltd Full time $40,000 - $60,000 per year

    Working Hour: Monday to Friday, 8.30am-5.30pm1 Year Renewable Contract RoleKey Responsibilities:Work closely with manufacturers and Engineers to implement and service video conferencing/ MS team solutionsConduct end user trainings to end users on how to use the Solution(s) implemented.Perform system checkingAssist in after sales servicesSystem configuration,...


  • Singapore Zoom Full time

    A leading video communications platform in Singapore is looking for an Audio AI Engineer to develop algorithms for accent conversion, voice conversion, speech synthesis, and speech recognition. This role requires a PhD or equivalent experience and proficiency in deep learning frameworks. You will work closely with global teams to optimize model performance...


  • Singapore Zoom Full time

    A global communications company in Singapore seeks an Audio AI Engineer to research and develop algorithms for voice conversion and speech synthesis. The role involves collaborating with product teams and optimizing model performance in real-time communication systems. Ideal candidates will have a PhD in a relevant field and proficiency in deep learning...


  • Singapore TikTok Full time

    TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. TikTok has global offices including Los Angeles, New York, London, Paris, Berlin, Dubai, Singapore, Jakarta, Seoul and Tokyo. Our team provides industry-leading ML algorithm capabilities and complete voice product solutions for all business...

  • Audio AI Engineer

    2 weeks ago


    Singapore Zoom Full time $80,000 - $150,000 per year

    What you can expectAs an Audio AI Engineer, you will research and develop algorithms for accent conversion, voice conversion, speech synthesis, and speech recognition on low-latency streaming architectures. You'll prototype and refine end-to-end audio models that enhance intelligibility and naturalness while maintaining speaker identity. Working closely with...

  • Software Engineer

    6 days ago


    Singapore NodeFlair Full time

    **Job Summary**: **Salary** S$9,786 - S$19,494 / Monthly EST **Job Type** Permanent **Seniority** Mid **Years of Experience** At least 3 years **Tech Stacks** C++ Go Java Linux NoSQL SQL C Python - Our team provides industry-leading ML algorithm capabilities and complete voice product solutions for all business lines, including toB business as well. We...