Zimo Xu
Movix
·
AI & ML interests
modality, reinforcement learning
Recent Activity
liked
a model
26 days ago
meta-llama/Llama-3.2-11B-Vision
upvoted
an
article
about 2 months ago
Illustrating Reinforcement Learning from Human Feedback (RLHF)
upvoted
an
article
about 2 months ago
ChatGPT 背后的“功臣”——RLHF 技术详解
Organizations
None yet