1 articles
Multimodal large models are developing rapidly, with representative models like BLIP-2, MiniGPT-4, Flamingo, LLaVA, and Qwen2.5-VL emerging.