arxiv:2505.20686
Mingyu Chen
MYC081
AI & ML interests
theory
Organizations
None yet
models 10
MYC081/SELM-Llama-3-8B-Instruct-DPO-iter-3
8B • Updated
MYC081/SELM-Zephyr-7B-iter-0
Updated
MYC081/Qwen2.5-3B-WPO-bf16-1
Text Generation • 3B • Updated
• 1
MYC081/Qwen2.5-3B-WPO-bf16-1-test
Updated
MYC081/Qwen2.5-1.5B-WPO-bf16-1
Updated
MYC081/Qwen2-0.5B-WPO-bf16-1
0.5B • Updated
MYC081/pythia-1b-tldr-xpo
7B • Updated
MYC081/pythia-6.9b-deduped-tldr-online-dpo
Updated
MYC081/Qwen2.5-0.5B-Online-DPO-PairRM
Updated
MYC081/pythia-2.8b-deduped-tldr-online-dpo
Updated