Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
ZhenghaiXue
/
Qwen2.5-7B-SimpleTIR
like
1
Reinforcement Learning
Safetensors
hkust-nlp/SimpleRL-Zoo-Data
agentica-org/DeepScaleR-Preview-Dataset
English
qwen2
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
README.md exists but content is empty.
Downloads last month
89
Safetensors
Model size
8B params
Tensor type
BF16
·
Chat template
Files info
Video Preview
Reinforcement Learning
loading
Model tree for
ZhenghaiXue/Qwen2.5-7B-SimpleTIR
Base model
Qwen/Qwen2.5-7B
Finetuned
(
798
)
this model
Datasets used to train
ZhenghaiXue/Qwen2.5-7B-SimpleTIR
agentica-org/DeepScaleR-Preview-Dataset
Viewer
•
Updated
Feb 10, 2025
•
40.3k
•
8.71k
•
183
hkust-nlp/SimpleRL-Zoo-Data
Viewer
•
Updated
Mar 25, 2025
•
53.1k
•
813
•
10
Collection including
ZhenghaiXue/Qwen2.5-7B-SimpleTIR
SimpleTIR
Collection
2 items
•
Updated
Jul 8, 2025