Stabilizing MoE Reinforcement Learning by Aligning Training and Inference Routers Paper • 2510.11370 • Published Oct 13 • 3
MVP-Human Dataset for 3D Human Avatar Reconstruction from Unconstrained Frames Paper • 2204.11184 • Published Apr 24, 2022
Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B Paper • 2511.06221 • Published Nov 9 • 129