Green-VLA: Staged Vision-Language-Action Model for Generalist Robots Paper • 2602.00919 • Published 23 days ago • 286
Evolving Prompts In-Context: An Open-ended, Self-replicating Perspective Paper • 2506.17930 • Published Jun 22, 2025 • 19
DINOv3 Collection DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 • 13 items • Updated Aug 21, 2025 • 510
KVAE 1.0 Collection KVAE 1.0 tokenizers are for images (KVAE-2D-1.0) and video (KVAE-3D-1.0) are distributed under MIT license (commercial use is possible). • 2 items • Updated Dec 14, 2025 • 7