Harvard-DCML/boomerang-qwen3-2.3B
Text Generation • 3B • Updated
• 104 • 1
Data-Centric ML
A Critical Look at Targeted Instruction Selection: Disentangling What Matters (and What Doesn't)
Boomerang Distillation Enables Zero-Shot Model Size Interpolation