Home ยป Alibaba Opens Open Source AI Models that Learn Visual Data, Supporting English and Chinese Languages

Alibaba Opens Open Source AI Models that Learn Visual Data, Supporting English and Chinese Languages

Alibaba Cloud has introduced a large-scale open source language model called Large Vision Language, which has the ability to understand images and text.

Two models, Qwen-VL and Qwen-VL-Chat, have been trained for image understanding and conversation. With 7 billion parameters, Qwen-VL-Chat is capable of processing images, such as performing mathematical calculations, and generating conversational responses.

This model can also be used to help read Chinese signs for those who are unfamiliar with the language or assist individuals with visual impairments. Both Qwen-7B and Qwen-7B-Chat are available for download and use on ModelScope, Alibaba Cloud’s AI developer community, and Hugging Face.

More Reading

Post navigation

Leave a Comment

Leave a Reply

Your email address will not be published. Required fields are marked *

Google Cloud Introduces Meta Llama 2 and Anthropic Claude 2 for Model Renting

Current Update of 20 Start-up Categories That Partners are Highly Interested in Investing in at Y Combinator.

Introducing Mistral’s NeMo 12B Language Model: The Upgrade from Mistral 7B, Featuring a 128k Context Window.