The advancement in AI research with ‘How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites’ highlights the release of InternVL 1.5, an open-source multimodal LLM aimed at reducing the differences between proprietary and open-source models. The improvements include:
InternVL 1.5 has shown competitive performance in several benchmarks against both open-source and proprietary models, marking it as a significant step forward in the field. The open-source nature of this project also encourages transparency and collaboration in the AI research community, possibly setting a new standard for future developments.