Zimo Xu's picture

2 11

Zimo Xu

Movix

·

Moviw

AI & ML interests

modality, reinforcement learning

Recent Activity

liked a model about 1 month ago

meta-llama/Llama-3.2-11B-Vision

upvoted an article 2 months ago

Illustrating Reinforcement Learning from Human Feedback (RLHF)

upvoted an article 2 months ago

ChatGPT 背后的“功臣”——RLHF 技术详解

View all activity

Organizations

None yet

Movix 's models

None public yet