Conley Lee
ComingToy
AI & ML interests
None yet
Recent Activity
upvoted
an
article
about 2 months ago
使用 PPO 算法进行 RLHF 的 N 步实现细节
upvoted
an
article
3 months ago
StackLLaMA: A hands-on guide to train LLaMA with RLHF
Organizations
None yet