SWE-World: Building Software Engineering Agents in Docker-Free Environments Paper • 2602.03419 • Published 4 days ago • 38
Nemotron-Cascade Collection Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models • 18 items • Updated 2 days ago • 51
FunReason-MT Technical Report: Overcoming the Complexity Barrier in Multi-Turn Function Calling Paper • 2510.24645 • Published Oct 28, 2025 • 10
Running 82 Unlocking On-Policy Distillation for Any Model Family 📝 82 Improve model performance by transferring knowledge between different model families