Ivy
FURUF
AI & ML interests
NLP RL
Recent Activity
upvoted
a
paper
12 days ago
Shaping capabilities with token-level data filtering
upvoted
a
paper
13 days ago
Reinforcement Learning via Self-Distillation
upvoted
a
paper
about 1 month ago
DiffCoT: Diffusion-styled Chain-of-Thought Reasoning in LLMs
Organizations
None yet