Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
kiante
xkianteb
Follow
xkianteb
xkianteb
AI & ML interests
None yet
Organizations
None yet
xkianteb
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
authored
a paper
over 1 year ago
Dataset Reset Policy Optimization for RLHF
Paper
•
2404.08495
•
Published
Apr 12, 2024
•
9