Ivy's picture

8 2

Ivy

FURUF

AI & ML interests

NLP RL

Recent Activity

upvoted a paper 12 days ago

Shaping capabilities with token-level data filtering

upvoted a paper 13 days ago

Reinforcement Learning via Self-Distillation

upvoted a paper about 1 month ago

DiffCoT: Diffusion-styled Chain-of-Thought Reasoning in LLMs

View all activity

Organizations

None yet

FURUF 's models

None public yet