·
AI & ML interests
None yet
Organizations
None yet
EricLabile/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_generation_num_4
Text Generation
•
2B
•
Updated
•
3
EricLabile/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_generation_num_2
Text Generation
•
2B
•
Updated
•
2
EricLabile/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_generation_num_6
Text Generation
•
2B
•
Updated
•
2
EricLabile/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_generation_num_2_epoch_2
Text Generation
•
2B
•
Updated
•
2
EricLabile/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_generation_num_1
Updated
EricLabile/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_generation_num_12
2B
•
Updated
EricLabile/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_generation_num_3
Text Generation
•
2B
•
Updated
•
2
EricLabile/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-3-24-1300
Updated
EricLabile/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-3-24
Updated
EricLabile/Qwen-2.5-7B-Simple-RL
Updated
EricLabile/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-3-23
Updated
EricLabile/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Updated
EricLabile/Llama-3.1-8B-Instruct-VP_Finetuned_350_iter_lr_5
Updated
EricLabile/Llama-3.1-8B-Instruct-VP_Finetuned_600_step
Updated
EricLabile/Llama-3.1-8B-Instruct-VP_Finetuned_epoch_10
Updated
EricLabile/Llama-3.1-8B-Instruct-VP_Finetuned
Updated