arxiv:2505.13417
Lin Nianyi
linny2002
ยท
AI & ML interests
None yet
Recent Activity
upvoted a paper about 2 months ago
Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards updated
a model 5 months ago
THU-KEG/LLaDA-8B-BGPO-sudoku