Arch-Router: Aligning LLM Routing with Human Preferences Paper • 2506.16655 • Published Jun 19, 2025 • 17
Agent Lightning: Train ANY AI Agents with Reinforcement Learning Paper • 2508.03680 • Published Aug 5, 2025 • 136
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models Paper • 2508.06471 • Published Aug 8, 2025 • 206
moonshotai/Kimi-K2-Instruct-0905 Text Generation • 1T • Updated 24 days ago • 24.6k • • 676
NousResearch/DeepHermes-Egregore-v2-RLAIF-8b-Atropos Reinforcement Learning • 8B • Updated Apr 29, 2025 • 7 • 6