Vision-Language Model

3 weeks ago


Singapore RAKUTEN ASIA PTE. LTD. Full time
Roles & Responsibilities

Situated in the heart of Singapore's Central Business District, Rakuten Asia Pte. Ltd. is Rakuten's Asia Regional headquarters. Established in August 2012 as part of Rakuten's global expansion strategy, Rakuten Asia comprises various businesses that provide essential value-added services to Rakuten's global ecosystem. Through advertisement product development, product strategy, and data management, among others, Rakuten Asia is strengthening Rakuten Group's core competencies to take the lead in an increasingly digitalized world.


The Visual Intelligence Engineering Department (VIED), part of the AI and Data Division, delivers scalable computer vision technology to drive measurable business outcomes. Leveraging Rakuten's unique data and expertise, VIED develops innovative solutions in areas like content moderation, OCR, face verification, and Creative AI, with a focus on engineering excellence, reusability, and state-of-the-art CV algorihtm.

VIED's strategy includes advancing Creative AI for personalized, engaging content and building scalable systems for seamless testing and deployment of computer vision models. Committed to collaboration, standardized processes, and continuous improvement, VIED seeks top talent in computer vision and machine learning to create impactful, scalable solutions aligned with Rakuten's goals.


About the Role

We are seeking a driven Vision-Language Model (VLM) Engineer to spearhead the development of a next-generation VLM solution. You will focus on fine-tuning an existing VLM for various compliance-related tasks, unifying multiple models into one cohesive system. This position involves close collaboration with a distributed team, including members who have previously experimented with VLM fine-tuning and are eager to support your work.


Key Responsibilities

•Build upon existing VLM architectures and fine-tune them for compliance tasks

•Unify previously separate compliance models into a single deployment-ready solution.

•Conduct experiments to evaluate and improve the VLM’s accuracy, drawing insights from past attempts (e.g., fine-tuning Florence-2).

•Investigate methods for leveraging spatial and geometric understanding in next-generation VLMs.

•Present findings, progress updates, and recommendations to the broader team.

•Explore cutting-edge techniques (e.g., text prompting for image border detection, bounding box generation, and dimension measurement).


Qualifications

•Advanced degree in Computer Science, Machine Learning, or a related field.

•Demonstrable experience in training or fine-tuning large-scale vision-language or multimodal models.

•Proficiency in Python and popular ML frameworks (e.g., PyTorch, TensorFlow)

• Experience with transformer-based architectures, image processing, and NLP techniques.

•Familiarity with MLOps best practices (containerization, continuous integration, deployment).

•Strong communication abilities, with the capacity to convey technical concepts to both technical and non-technical stakeholders.

•Collaborative mindset and a willingness to partner with cross-functional, globally distributed teams.



Tell employers what skills you have

TensorFlow
Machine Learning
Image Processing
Data Management
Drawing
Computer Vision
Strategy
PyTorch
Compliance
Python
Containerization
Continuous Integration
Product Development
OCR
Computer Vision Technology
  • Vision-Language Model

    3 weeks ago


    Singapore RAKUTEN ASIA PTE. LTD. Full time

    Roles & ResponsibilitiesSituated in the heart of Singapore's Central Business District, Rakuten Asia Pte. Ltd. is Rakuten's Asia Regional headquarters. Established in August 2012 as part of Rakuten's global expansion strategy, Rakuten Asia comprises various businesses that provide essential value-added services to Rakuten's global ecosystem. Through...


  • Singapore NTU (Nanyang Technology University- Main Office-HR) Full time

    The National Centre for Research in Digital Trust (DTC) at Nanyang Technological University is seeking a researcher to join our team and contribute to the development of trust technologies.About the RoleWe aim to explore the interpretation of vision-language models, which process and understand both text and images.This requires analyzing how visual tokens...


  • Singapore NTU (Nanyang Technology University- Main Office-HR) Full time

    We are seeking a highly skilled Research Engineer I to join our team and contribute to our mission at NTU's National Centre for Research in Digital Trust (DTC).About the PositionThe successful candidate will play a key role in researching and developing trust technologies, with a focus on mechanistic interpretability of vision-language models.This is an...

  • AI Researcher

    3 weeks ago


    Singapore PERSOLKELLY SINGAPORE PTE. LTD. Full time

    Roles & ResponsibilitiesJob Responsibilities:• for research and development on the algorithm for Large Vision-Language Generation Models in the multimedia field, make breakthroughs in key algorithm technologies, such as controllable image generation, fine-grained generation content control, text-to-image generation, text-to-video generation, image editing,...


  • Singapore BYTEPLUS PTE. LTD. Full time

    Roles & ResponsibilitiesFounded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok, Helo, and Resso, as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create...


  • Singapore NTU (Nanyang Technology University- Main Office-HR) Full time

    About the RoleAs a key member of our research team, you will be responsible for designing and conducting research projects focused on Computer Vision. This includes developing and fine-tuning advanced AI models to analyze complex visual data from various sources and utilizing Large Language Models and advanced deep learning techniques to enhance image and...


  • Singapore BUNKA LANGUAGE SCHOOL PTE LTD Full time

    Roles & ResponsibilitiesAs a Japanese Language Teacher, you are a role model in terms of the way you speak Japanese.You must be able to connect with your students and understand their difficulties in learning the Language.You have to be able to make classes enjoyable yet knowledgeable.You must have native levels of understanding of the Japanese language and...


  • Singapore BUNKA LANGUAGE SCHOOL PTE LTD Full time

    Roles & ResponsibilitiesAs a Japanese Language Teacher, you are a role model in terms of the way you speak Japanese.You must be able to connect with your students and understand their difficulties in learning the Language.You have to be able to make classes enjoyable yet knowledgeable.You must have native levels of understanding of the Japanese language and...


  • Singapore NTU (Nanyang Technology University- Main Office-HR) Full time

    Nanyang Technological University's National Centre for Research in Digital Trust (DTC) seeks a researcher to join our team and contribute to the development of trust technologies.About the RoleWe aim to explore the interpretation of vision-language models, which process and understand both text and images.This requires analyzing how visual tokens are...


  • Singapore AILYTICS PTE. LTD. Full time

    Roles & ResponsibilitiesHere at Ailytics, we're building AI solutions to envision a safer world. By combining computer vision and predictive analytics, we enable organizations to proactively identify risks, optimize processes, and ultimately save lives. Our platforms are currently deployed all over the world, covering more than 300 million square metersAs a...


  • Singapore NEUROTREE CONSULTANCY PTE. LTD. Full time

    Roles & ResponsibilitiesComputer Vision Engineer Intern (AI & Movement Analytics)About UsWe're building an innovative screening app that analyzes motor abilities using video-based movement analysis. Our goal is to develop a computer vision-powered screening tool that supports real-time data processing, third-party integrations, and AI-driven analytics. Our...


  • Singapore AILYTICS PTE. LTD. Full time

    Roles & ResponsibilitiesHere at Ailytics, we’re building AI solutions to envision a safer world. By combining computer vision and predictive analytics, we enable organizations to proactively identify risks, optimize processes, and ultimately save lives. Our platforms are currently deployed all over the world, covering more than 300 million square meters!As...

  • Computer Vision Expert

    21 hours ago


    Singapore NTU (Nanyang Technology University- Main Office-HR) Full time

    About the PositionWe are seeking a highly skilled Computer Vision Expert to join our team at NTU's Alibaba-NTU Global e-Sustainability CorpLab (ANGEL).Main Responsibilities:Develop and fine-tune advanced AI models to analyze complex visual data from various sources.Collaborate with the team to design and conduct research projects focused on Computer...


  • Singapore NTU (Nanyang Technology University- Main Office-HR) Full time

    We are seeking a highly skilled Computer Vision Specialist to join our team at the National Institute of Education.About the RoleThis role involves developing and implementing computer vision and machine learning techniques to improve throwing proficiency in children at a primary school.Key ResponsibilitiesTo develop innovative computer vision algorithms for...


  • Singapore NTU (Nanyang Technology University- Main Office-HR) Full time

    About the RoleWe invite applications for the position of Research Scientist at the Alibaba-NTU Global e-Sustainability CorpLab (ANGEL).Design and conduct research projects focused on Computer Vision.Develop and fine-tune advanced AI models to analyze complex visual data.This is an excellent opportunity for a talented researcher to contribute to cutting-edge...

  • AI Vision Engineer

    1 day ago


    Singapore EVOLUTION RECRUITMENT SOLUTIONS PTE. LTD. Full time

    Roles & ResponsibilitiesIntroductionThis position is responsible for designing, developing, fine-tuning, and optimizing deep learning models for quality inspection applications using 3D and wearable sensors. You will contribute to the development of advanced AI algorithms for sensing methodologies, object detection, recognition, segmentation, pose...


  • Singapore VISION APPAREL PTE. LTD. Full time

    Roles & ResponsibilitiesVision Apparel Pte Ltd is a full-service operations specialist company with expertise in managing and optimizing every phase of production and delivery.Our expertise spans FOB, CM, CMPT coordination, merchandising, and customer support, with a strong focus on quality assurance throughout every stage, from sample development to final...

  • BIM Modeler

    1 day ago


    Singapore CHANGI AIRPORT CONSULTANTS PTE. LTD. Full time

    Roles & ResponsibilitiesJob DescriptionLead modeling efforts on civil and structural engineering projects. Work closely with design engineers to develop 3D models required for tender and construction drawings, and authority submissions. Maintain and update BIM models throughout the project lifecycle, involving in multi-party BIM coordination. Identify and...


  • Singapore MEMIONTEC PTE LTD Full time

    Roles & ResponsibilitiesResponsibilities:- Responsible for all technical drawings, 3D modelling and documentations- Create, modify existing engineering design and models- Ensure all structural drawings & specifications are fully compliant with project requirements- Ensure correct modelling as project requirement to avoid rework modelling· Ensure drawing and...


  • Singapore NTU (Nanyang Technology University- Main Office-HR) Full time

    Job DescriptionWe are seeking a highly motivated Postdoctoral Research Fellow to join our research team focusing on integrating multimodal large language models with embodied AI. This position will involve the development and deployment of Vision-Language-Action models for real-world robotic applications, such as autonomous task execution and human-robot...