Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning Paper • 2512.07461 • Published 3 days ago • 61
AI & Human Co-Improvement for Safer Co-Superintelligence Paper • 2512.05356 • Published 6 days ago • 8
From Imitation to Discrimination: Toward A Generalized Curriculum Advantage Mechanism Enhancing Cross-Domain Reasoning Tasks Paper • 2512.02580 • Published 9 days ago • 27
Guided Self-Evolving LLMs with Minimal Human Supervision Paper • 2512.02472 • Published 9 days ago • 48
Guided Self-Evolving LLMs with Minimal Human Supervision Paper • 2512.02472 • Published 9 days ago • 48 • 2
Video Generation Models Are Good Latent Reward Models Paper • 2511.21541 • Published 14 days ago • 45
Agent0-VL: Exploring Self-Evolving Agent for Tool-Integrated Vision-Language Reasoning Paper • 2511.19900 • Published 16 days ago • 46
Insights from the ICLR Peer Review and Rebuttal Process Paper • 2511.15462 • Published 21 days ago • 6
Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning Paper • 2511.16043 • Published 21 days ago • 105
First Frame Is the Place to Go for Video Content Customization Paper • 2511.15700 • Published 21 days ago • 52
VisPlay: Self-Evolving Vision-Language Models from Images Paper • 2511.15661 • Published 21 days ago • 42
VisPlay: Self-Evolving Vision-Language Models from Images Paper • 2511.15661 • Published 21 days ago • 42 • 3
VisPlay: Self-Evolving Vision-Language Models from Images Paper • 2511.15661 • Published 21 days ago • 42
P1: Mastering Physics Olympiads with Reinforcement Learning Paper • 2511.13612 • Published 23 days ago • 132