AI & ML interests
None yet
Organizations
None yet
JerrrrryKun/DeepSeek-R1-Distill-Qwen-7B-ER-v1-2-1epoch500steps
Text Generation
•
8B
•
Updated
•
5
JerrrrryKun/DeepSeek-R1-Distill-Qwen-7B-ER-v1-2-300steps
Text Generation
•
8B
•
Updated
•
2
JerrrrryKun/DeepSeek-R1-Distill-Qwen-7B-ER-v1-2-200steps
Text Generation
•
8B
•
Updated
•
5
JerrrrryKun/DeepSeek-R1-Distill-Qwen-7B-ER-v1-2-100steps
Text Generation
•
8B
•
Updated
•
1
JerrrrryKun/DeepSeek-R1-Distill-Qwen-7B-ER-v1-1-1430steps
Text Generation
•
8B
•
Updated
•
1
JerrrrryKun/Qwen2.5-Math-7B-Instruct-LLM4Math-V2data-Sequential-perturbationsignalonly-ispass-400steps
Text Generation
•
8B
•
Updated
•
2
JerrrrryKun/Llama-3.1-8B-Instruct-LLM4Math-V2data-Sequential-vanillaRL-200steps
Text Generation
•
8B
•
Updated
•
2
JerrrrryKun/Llama-3.1-8B-Instruct-LLM4Math-V2data-Sequential-perturbationsignalonly-200steps
Text Generation
•
8B
•
Updated
•
2
JerrrrryKun/Llama-3.1-8B-Instruct-LLM4Math-V2data-Sequential-vanillaRL-100steps
Text Generation
•
8B
•
Updated
•
2
JerrrrryKun/Llama-3.1-8B-Instruct-LLM4Math-V2data-Sequential-perturbationsignalonly-100steps
Text Generation
•
8B
•
Updated
•
2
JerrrrryKun/Qwen2.5-Math-7B-Instruct-LLM4Math-V2data-Sequential-NoPerturbationData-NoPL-400steps
Text Generation
•
8B
•
Updated
•
2
JerrrrryKun/OLMo-2-1124-7B-Instruct-LLM4Math-V2data-Sequential-perturbationsignalonly-100steps
Text Generation
•
7B
•
Updated
•
1
JerrrrryKun/OLMo-2-1124-7B-Instruct-LLM4Math-V2data-Sequential-vanillaRL-100steps
Text Generation
•
7B
•
Updated
•
1
JerrrrryKun/SmolLM3-3B-LLM4Math-V2data-Sequential-perturbationsignalonly-300steps
Text Generation
•
3B
•
Updated
•
2
JerrrrryKun/DeepMath-1.5B-LLM4Math-V2data-Sequential-vanillaRL-400steps
Text Generation
•
2B
•
Updated
•
1
JerrrrryKun/DeepMath-1.5B-LLM4Math-V2data-Sequential-perturbationsignalonly-400steps
Text Generation
•
2B
•
Updated
•
2
JerrrrryKun/Qwen2.5-Math-7B-Instruct-LLM4Math-V2data-Sequential-perturbationsignalonly-400steps
Text Generation
•
8B
•
Updated
•
2
JerrrrryKun/Qwen2.5-Math-7B-Instruct-LLM4Math-V2data-Sequential-vanillaRL-400steps
Text Generation
•
8B
•
Updated
•
2
JerrrrryKun/Qwen2.5-Math-7B-Instruct-LLM4Math-V2data-Sequential-NoPerturbationData-NoPL-100steps
Text Generation
•
8B
•
Updated
•
2
JerrrrryKun/Qwen2.5-Math-7B-Instruct-LLM4Math-V2data-Sequential-NoPerturbationData-100steps
Text Generation
•
8B
•
Updated
•
2
JerrrrryKun/Qwen2.5-Math-7B-Instruct-LLM4Math-V2data-Sequential-300steps
Text Generation
•
8B
•
Updated
•
1
JerrrrryKun/Qwen2.5-Math-7B-Instruct-LLM4Math-V2data-Sequential-NoPreconditionLabel-300steps
Text Generation
•
8B
•
Updated
•
1
JerrrrryKun/deepseek-math-7b-rl-LLM4Math-V2data-150steps
Text Generation
•
7B
•
Updated
•
2
JerrrrryKun/Qwen2.5-Math-7B-Instruct-LLM4Math-V2data-400steps
Text Generation
•
8B
•
Updated
•
2
JerrrrryKun/DeepSeek-R1-Distill-Qwen-7B-LLM4Math-V2data-250steps
Text Generation
•
8B
•
Updated
•
1