Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
Conley Lee
ComingToy
Follow
AI & ML interests
None yet
Recent Activity
upvoted
an
article
about 2 months ago
使用 PPO 算法进行 RLHF 的 N 步实现细节
upvoted
an
article
3 months ago
StackLLaMA: A hands-on guide to train LLaMA with RLHF
View all activity
Organizations
None yet
ComingToy
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
upvoted
an
article
about 2 months ago
view article
Article
使用 PPO 算法进行 RLHF 的 N 步实现细节
+1
Oct 24, 2023
•
4
upvoted
an
article
3 months ago
view article
Article
StackLLaMA: A hands-on guide to train LLaMA with RLHF
+5
Apr 5, 2023
•
48