Two Giraffes in a Dirt Field: Using Game Play to Investigate Situation Modelling in Large Multimodal Models Paper • 2406.14035 • Published Jun 20, 2024 • 13
Qwen2-VL Collection Vision-language model series based on Qwen2 • 16 items • Updated 5 days ago • 227
Vision Language Leaderboards Collection This collection has all the vision language leaderboards. • 7 items • Updated Aug 24, 2024 • 21