Dongwei/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_Math_smalllr Text Generation • 2B • Updated Feb 4, 2025 • 2
AlejandroOlmedo/DeepSeek-R1-Distill-Qwen-7B-GRPO_Math-8bit-mlx Text Generation • 2B • Updated Feb 23, 2025 • 45 • 3
AlejandroOlmedo/DeepSeek-R1-Distill-Qwen-7B-GRPO_Math-4bit-mlx Text Generation • 1B • Updated Feb 23, 2025 • 18