AIMO AI-MO/NuminaMath-7B-TIR Text Generation • 7B • Updated Aug 14, 2024 • 351 • 350 Running 420 Reward Bench Leaderboard 📐 420 Explore and compare LLM reward benchmark scores KTO: Model Alignment as Prospect Theoretic Optimization Paper • 2402.01306 • Published Feb 2, 2024 • 21
KTO: Model Alignment as Prospect Theoretic Optimization Paper • 2402.01306 • Published Feb 2, 2024 • 21
Fater.ai mistralai/Mistral-7B-Instruct-v0.1 Text Generation • 7B • Updated Jul 24, 2025 • 576k • 1.83k Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking Paper • 2403.09629 • Published Mar 14, 2024 • 79 Improve Mathematical Reasoning in Language Models by Automated Process Supervision Paper • 2406.06592 • Published Jun 5, 2024 • 29
Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking Paper • 2403.09629 • Published Mar 14, 2024 • 79
Improve Mathematical Reasoning in Language Models by Automated Process Supervision Paper • 2406.06592 • Published Jun 5, 2024 • 29
ARC 01-ai/Yi-Coder-9B-Chat Text Generation • 9B • Updated Sep 12, 2024 • 7.73k • 212 AI-MO/NuminaMath-7B-TIR Text Generation • 7B • Updated Aug 14, 2024 • 351 • 350
AIMO AI-MO/NuminaMath-7B-TIR Text Generation • 7B • Updated Aug 14, 2024 • 351 • 350 Running 420 Reward Bench Leaderboard 📐 420 Explore and compare LLM reward benchmark scores KTO: Model Alignment as Prospect Theoretic Optimization Paper • 2402.01306 • Published Feb 2, 2024 • 21
KTO: Model Alignment as Prospect Theoretic Optimization Paper • 2402.01306 • Published Feb 2, 2024 • 21
ARC 01-ai/Yi-Coder-9B-Chat Text Generation • 9B • Updated Sep 12, 2024 • 7.73k • 212 AI-MO/NuminaMath-7B-TIR Text Generation • 7B • Updated Aug 14, 2024 • 351 • 350
Fater.ai mistralai/Mistral-7B-Instruct-v0.1 Text Generation • 7B • Updated Jul 24, 2025 • 576k • 1.83k Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking Paper • 2403.09629 • Published Mar 14, 2024 • 79 Improve Mathematical Reasoning in Language Models by Automated Process Supervision Paper • 2406.06592 • Published Jun 5, 2024 • 29
Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking Paper • 2403.09629 • Published Mar 14, 2024 • 79
Improve Mathematical Reasoning in Language Models by Automated Process Supervision Paper • 2406.06592 • Published Jun 5, 2024 • 29