PartEdit: Fine-Grained Image Editing using Pre-Trained Diffusion Models Paper β’ 2502.04050 β’ Published Feb 6 β’ 1
FloorplanQA: A Benchmark for Spatial Reasoning in LLMs using Structured Representations Paper β’ 2507.07644 β’ Published Jul 10 β’ 4
FloorplanQA: A Benchmark for Spatial Reasoning in LLMs using Structured Representations Paper β’ 2507.07644 β’ Published Jul 10 β’ 4
AraLingBench A Human-Annotated Benchmark for Evaluating Arabic Linguistic Capabilities of Large Language Models Paper β’ 2511.14295 β’ Published 22 days ago β’ 71
Mind-the-Glitch: Visual Correspondence for Detecting Inconsistencies in Subject-Driven Generation Paper β’ 2509.21989 β’ Published Sep 26 β’ 22
Hala Technical Report: Building Arabic-Centric Instruction & Translation Models at Scale Paper β’ 2509.14008 β’ Published Sep 17 β’ 88
PlaceIt3D: Language-Guided Object Placement in Real 3D Scenes Paper β’ 2505.05288 β’ Published May 8 β’ 14
Running Featured 557 Vision Arena (Testing VLMs side-by-side) πΌ 557 Display image analysis results