As tech giants, such as OpenAI and Google, launch large language models (LLMs) to compete in the booming generative AI space, Chinese company Alibaba has launched two new artificial intelligence (AI) models that can carry out complex conversations and understand images
According to Alibaba, while Qwen-VL can respond to open-ended queries related to different images and generate picture captions, Qwen-VL-Chat can engage in “complex interactions”.
For example, it can compare multiple image inputs and answer several rounds of questions.
Other tasks that Qwen-VL-Chat can perform include writing stories, creating images based on photos that a user inputs and solving mathematical equations shown in a picture.
Until now, the companies have focussed on generating content on the basis of text input by humans but Qwen-VL-Chat, pretty much like the latest version of OpenAI’s ChatGPT, can analyse images and respond in text.
The news comes soon after South Korean internet giant Naver unveiled HyperCLOVA X, its generative artificial intelligence (AI) suite of tools and CLOVA X chatbot application.
Naver said that the product will provide AI-driven searches for users and offer customised services for enterprises