-
Beyond Log Likelihood: Probability-Based Objectives for Supervised Fine-Tuning across the Model Capability Continuum
Paper • 2510.00526 • Published • 9 -
gaotang/figlet_font
Viewer • Updated • 45k • 28 -
gaotang/medical_sft_processed
Viewer • Updated • 23.5k • 34 -
gaotang/numina-cot-subset-67k
Viewer • Updated • 67.6k • 32
Gaotang Li
gaotang
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
1 day ago
Agentic Reasoning for Large Language Models
upvoted
a
paper
8 days ago
Your Group-Relative Advantage Is Biased
upvoted
a
paper
about 2 months ago
DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models
Organizations
None yet