arxiv:2505.20081
TengXiao
TTTXXX01
AI & ML interests
None yet
Organizations
models 96
TTTXXX01/SFT_model
7B • Updated
• 1
TTTXXX01/bce_0.1_800step
8B • Updated
TTTXXX01/global_step_1100
8B • Updated
TTTXXX01/global-step-920
8B • Updated
TTTXXX01/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Updated
TTTXXX01/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation • 2B • Updated
• 19
TTTXXX01/Qwen2.5-1.5B-Open-R1-Distill
Text Generation • 2B • Updated
TTTXXX01/LLama-8B-Instruct-v0.1-MI-6e-7
8B • Updated
TTTXXX01/LLama-8B-Instruct-v0.1-MI-2e-5
8B • Updated
TTTXXX01/LLama-8B-Instruct-v0.1-MI-5e-7
8B • Updated
datasets 106
TTTXXX01/Teng_MATH_6K_Clustering
Viewer
• Updated
• 6k • 7
TTTXXX01/MATH-mix-6Ks-k60-spc100
Viewer
• Updated
• 6k • 10
TTTXXX01/DPO_Orz-30K_filtered
Viewer
• Updated
• 3k • 3
TTTXXX01/DPO_MathSub-30K_filtered
Viewer
• Updated
• 3k • 15
TTTXXX01/DPO_AceReason-Math_filtered
Viewer
• Updated
• 6.6k • 2
TTTXXX01/DPO_DAPO-Math-17k-Processed_filtered
Viewer
• Updated
• 2.58k • 4
TTTXXX01/MathSub-30K
Viewer
• Updated
• 9k • 10
TTTXXX01/MathSub-30K-up
Viewer
• Updated
• 9k • 8
TTTXXX01/diverse-semi-verifiable-tasks-o3-7500-o4-mini-high
Viewer
• Updated
• 10k • 2
TTTXXX01/new-wildchat-english-general
Viewer
• Updated
• 19k • 3