Vision Transformer Finetuning Benefits from Non-Smooth Components Paper • 2602.06883 • Published 4 days ago • 3
ReMiT: RL-Guided Mid-Training for Iterative LLM Evolution Paper • 2602.03075 • Published 8 days ago • 4
OmniMoE: An Efficient MoE by Orchestrating Atomic Experts at Scale Paper • 2602.05711 • Published 5 days ago • 9
OmniVideo-R1: Reinforcing Audio-visual Reasoning with Query Intention and Modality Attention Paper • 2602.05847 • Published 5 days ago • 11
QuantLRM: Quantization of Large Reasoning Models via Fine-Tuning Signals Paper • 2602.02581 • Published 10 days ago • 6
InftyThink+: Effective and Efficient Infinite-Horizon Reasoning via Reinforcement Learning Paper • 2602.06960 • Published 4 days ago • 10
Canzona: A Unified, Asynchronous, and Load-Balanced Framework for Distributed Matrix-based Optimizers Paper • 2602.06079 • Published 7 days ago • 16
DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos Paper • 2602.06949 • Published 4 days ago • 26
Baichuan-M3: Modeling Clinical Inquiry for Reliable Medical Decision-Making Paper • 2602.06570 • Published 5 days ago • 57
F-GRPO: Don't Let Your Policy Learn the Obvious and Forget the Rare Paper • 2602.06717 • Published 4 days ago • 66
OmniRad: A Radiological Foundation Model for Multi-Task Medical Image Analysis Paper • 2602.04547 • Published 6 days ago • 1
Beyond Unimodal Shortcuts: MLLMs as Cross-Modal Reasoners for Grounded Named Entity Recognition Paper • 2602.04486 • Published 7 days ago • 6
Agent-Omit: Training Efficient LLM Agents for Adaptive Thought and Observation Omission via Agentic Reinforcement Learning Paper • 2602.04284 • Published 7 days ago • 13
Horizon-LM: A RAM-Centric Architecture for LLM Training Paper • 2602.04816 • Published 6 days ago • 16
A-RAG: Scaling Agentic Retrieval-Augmented Generation via Hierarchical Retrieval Interfaces Paper • 2602.03442 • Published 8 days ago • 19
WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning Paper • 2602.04634 • Published 6 days ago • 89