Zimo Xu
Movix
·
AI & ML interests
modality, reinforcement learning
Recent Activity
liked
a model
24 days ago
meta-llama/Llama-3.2-11B-Vision
upvoted
an
article
about 1 month ago
Illustrating Reinforcement Learning from Human Feedback (RLHF)
upvoted
an
article
about 1 month ago
ChatGPT 背后的“功臣”——RLHF 技术详解
Organizations
None yet