Shanghai AI Lab Launches New Generation of Booker-Vision Grand Model: 6 Billion Parameters Lead to Fine Alignment of Vision and Language

ShanghaiAIThe lab has released a new generation of Shusheng-visual big model, InternVL-6B. this model has a visual coder parameter count of 6 billion, and employs a progressive alignment technique of contrast-generation fusion, which realizes a fine-grained alignment of visual big models with linguistic big models on Internet-level data.

In addition to the above technical features, the model can also process subtle visual information in complex pictures and accomplish the task of graphic generation. At the same time, it can also recognize and interpret the information in complex pages and even solve the mathematical and scientific problems in them.

The new generation of Booker-Vision Grand Model released by Shanghai AI Lab is technically highly innovative and practical, and its processing capability for images and text is worth expecting.

This article comes from users or anonymous contributions, does not represent the position of Mass Intelligence; all content (including images, videos, etc.) in this article are copyrighted by the original author. Please refer to this site for the relevant issues involvedstatement denying or limiting responsibilityPlease contact the operator of this website for any infringement of rights (Contact Us) We will handle this as stated. Link to this article: https://dzzn.com/en/2024/3070.html