13 22 1

Jinyang Wu

Jinyang23

https://orcid.org/my-orcid?orcid=0009-0006-0220-616X

jinyangwu

AI & ML interests

large language models, reasoning, agentic rl

Recent Activity

upvoted a paper 1 day ago

CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models

upvoted a paper 8 days ago

Query as Anchor: Scenario-Adaptive User Representation via Large Language Model

upvoted a paper 15 days ago

MOVA: Towards Scalable and Synchronized Video-Audio Generation

View all activity

Organizations

None yet

upvoted a paper 1 day ago

CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models

Paper • 2602.17684 • Published 20 days ago • 21

upvoted a paper 8 days ago

Query as Anchor: Scenario-Adaptive User Representation via Large Language Model

Paper • 2602.14492 • Published 9 days ago • 18

upvoted a paper 15 days ago

MOVA: Towards Scalable and Synchronized Video-Audio Generation

Paper • 2602.08794 • Published 15 days ago • 153

authored 2 papers 15 days ago

OdysseyArena: Benchmarking Large Language Models For Long-Horizon, Active and Inductive Interactions

Paper • 2602.05843 • Published 19 days ago • 57

Exploring Knowledge Purification in Multi-Teacher Knowledge Distillation for LLMs

Paper • 2602.01064 • Published 24 days ago • 2

upvoted a paper 16 days ago

OdysseyArena: Benchmarking Large Language Models For Long-Horizon, Active and Inductive Interactions

Paper • 2602.05843 • Published 19 days ago • 57

submitted a paper to Daily Papers 16 days ago

Exploring Knowledge Purification in Multi-Teacher Knowledge Distillation for LLMs

Paper • 2602.01064 • Published 24 days ago • 2

upvoted 2 papers 19 days ago

HER: Human-like Reasoning and Reinforcement Learning for LLM Role-playing

Paper • 2601.21459 • Published 26 days ago • 9

TIDE: Trajectory-based Diagnostic Evaluation of Test-Time Improvement in LLM Agents

Paper • 2602.02196 • Published 22 days ago • 34

upvoted a paper 20 days ago

SafeGround: Know When to Trust GUI Grounding Models via Uncertainty Calibration

Paper • 2602.02419 • Published 22 days ago • 4

upvoted a paper 21 days ago

Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models

Paper • 2601.22060 • Published 26 days ago • 156

upvoted a paper 22 days ago

Kimi K2.5: Visual Agentic Intelligence

Paper • 2602.02276 • Published 22 days ago • 243

upvoted a paper 23 days ago

SSL: Sweet Spot Learning for Differentiated Guidance in Agentic Optimization

Paper • 2601.22491 • Published 26 days ago • 12

submitted a paper to Daily Papers 23 days ago

SSL: Sweet Spot Learning for Differentiated Guidance in Agentic Optimization

Paper • 2601.22491 • Published 26 days ago • 12

New activity in Jinyang23/Spark-1.5B-ScienceWorld 25 days ago

Update README.md

#2 opened 25 days ago by

shuo-yan

New activity in Jinyang23/Spark-1.5B-WebShop 25 days ago

Update README.md

#2 opened 25 days ago by

shuo-yan

New activity in Jinyang23/Spark-1.5B-ALFWorld 25 days ago

Update README.md

#2 opened 25 days ago by

shuo-yan

authored 2 papers 26 days ago

Double: Breaking the Acceleration Limit via Double Retrieval Speculative Parallelism

Paper • 2601.05524 • Published Jan 9 • 1

Spark: Strategic Policy-Aware Exploration via Dynamic Branching for Long-Horizon Agentic Learning

Paper • 2601.20209 • Published 28 days ago • 22

updated a model 27 days ago

Jinyang23/Spark-1.5B-ALFWorld

Reinforcement Learning • 2B • Updated 25 days ago • 10

Jinyang Wu

AI & ML interests

Recent Activity

Organizations

Jinyang23's activity

Update README.md

Update README.md

Update README.md