Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
LMFlow's picture
2 4 3

LMFlow

lmflow-optimalscale
·

AI & ML interests

None yet

Recent Activity

upvoted a paper 10 days ago
Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text
liked a model 17 days ago
nvidia/Nemotron-Orchestrator-8B
upvoted a paper about 1 month ago
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization
View all activity

Organizations

OptimalScale's profile picture

upvoted a paper 10 days ago

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

Paper • 2601.22975 • Published 13 days ago • 96
upvoted a paper about 1 month ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published Jan 8 • 225
upvoted a paper 2 months ago

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

Paper • 2511.21689 • Published Nov 26, 2025 • 124
upvoted a paper 10 months ago

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Paper • 2504.13161 • Published Apr 17, 2025 • 93
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs