Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
26
2
Runpeng Dai
PRO
Leo-Dai
Follow
TongZheng1999's profile picture
1 follower
·
3 following
AI & ML interests
None yet
Recent Activity
liked
a Space
2 days ago
EfficientReasoning/efficient_reasoning_online_judgement
upvoted
a
paper
5 days ago
Training Data Efficiency in Multimodal Process Reward Models
authored
a paper
6 days ago
Parallel-Probe: Towards Efficient Parallel Thinking via 2D Probing
View all activity
Organizations
Leo-Dai
's models
17
Sort:Â Recently updated
Leo-Dai/PPO_BL_250_critic
4B
•
Updated
Aug 15, 2025
Leo-Dai/PPO_BL_200_critic
Updated
Aug 15, 2025
•
2
Leo-Dai/PPO_BL_300_actor
Updated
Aug 15, 2025
Leo-Dai/PPO_BL_250_actor
Updated
Aug 15, 2025
Leo-Dai/PPO_BL_300_critic
Updated
Aug 15, 2025
Leo-Dai/GRPO_BL_40
4B
•
Updated
Aug 15, 2025
Leo-Dai/GRPO_BL_30
4B
•
Updated
Aug 15, 2025
Leo-Dai/GRPO_BL_20
4B
•
Updated
Aug 15, 2025
Leo-Dai/GRPO_BL_400
4B
•
Updated
Aug 15, 2025
Leo-Dai/GRPO_BL_10
4B
•
Updated
Aug 15, 2025
Leo-Dai/GRPO_BL_350
4B
•
Updated
Aug 15, 2025
Leo-Dai/GRPO_BL_200
4B
•
Updated
Aug 13, 2025
Leo-Dai/GRPO_BL_150
4B
•
Updated
Aug 13, 2025
Leo-Dai/GRPO_BL_100
4B
•
Updated
Aug 13, 2025
Leo-Dai/GRPO_BL_300
4B
•
Updated
Aug 13, 2025
Leo-Dai/GRPO_BL_250
4B
•
Updated
Aug 13, 2025
Leo-Dai/GRPO_BL_50
4B
•
Updated
Aug 13, 2025