Running 89 Unlocking On-Policy Distillation for Any Model Family 📝 89 Visualize on-policy distillation for any model family
M-ABSA: A Multilingual Dataset for Aspect-Based Sentiment Analysis Paper • 2502.11824 • Published Feb 17, 2025 • 3
Running on CPU Upgrade Featured 3.02k The Smol Training Playbook 📚 3.02k The secrets to building world-class LLMs
Kimi Linear: An Expressive, Efficient Attention Architecture Paper • 2510.26692 • Published Oct 30, 2025 • 127