UniPixel Collection [NeurIPS 2025] UniPixel: Unified Object Referring and Segmentation for Pixel-Level Visual Reasoning • 5 items • Updated about 8 hours ago
UniPixel Collection [NeurIPS 2025] UniPixel: Unified Object Referring and Segmentation for Pixel-Level Visual Reasoning • 5 items • Updated about 8 hours ago
UniPixel Collection [NeurIPS 2025] UniPixel: Unified Object Referring and Segmentation for Pixel-Level Visual Reasoning • 5 items • Updated about 8 hours ago
A Survey on Video Temporal Grounding with Multimodal Large Language Model Paper • 2508.10922 • Published Aug 7, 2025 • 1
UniPixel: Unified Object Referring and Segmentation for Pixel-Level Visual Reasoning Paper • 2509.18094 • Published Sep 22, 2025 • 4
UniPixel Collection [NeurIPS 2025] UniPixel: Unified Object Referring and Segmentation for Pixel-Level Visual Reasoning • 5 items • Updated about 8 hours ago
UniPixel Collection [NeurIPS 2025] UniPixel: Unified Object Referring and Segmentation for Pixel-Level Visual Reasoning • 5 items • Updated about 8 hours ago
PosterLLaVA Collection PosterLLaVa: Constructing a Unified Multi-modal Layout Generator with LLM • 3 items • Updated Apr 3, 2025