arxiv:2507.21183
Xinpeng Wei
WindowsXp-Beta
AI & ML interests
LLM Systems
Recent Activity
upvoted
an
article
8 days ago
From GRPO to DAPO and GSPO: What, Why, and How
updated
a model
3 months ago
WindowsXp-Beta/RAPO
authored
a paper
4 months ago
MaPPO: Maximum a Posteriori Preference Optimization with Prior Knowledge
Organizations
None yet