Recurrent-Depth VLA: Implicit Test-Time Compute Scaling of Vision-Language-Action Models via Latent Iterative Reasoning Paper • 2602.07845 • Published 4 days ago • 64
MOVA: Towards Scalable and Synchronized Video-Audio Generation Paper • 2602.08794 • Published 2 days ago • 142
AgentCPM-Report: Interleaving Drafting and Deepening for Open-Ended Deep Research Paper • 2602.06540 • Published 5 days ago • 20
Modality Gap-Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models Paper • 2602.07026 • Published 9 days ago • 128
QuantaAlpha: An Evolutionary Framework for LLM-Driven Alpha Mining Paper • 2602.07085 • Published 5 days ago • 176
VLS: Steering Pretrained Robot Policies via Vision-Language Models Paper • 2602.03973 • Published 8 days ago • 22
FS-Researcher: Test-Time Scaling for Long-Horizon Research Tasks with File-System-Based Agents Paper • 2602.01566 • Published 10 days ago • 46