Datasets used in the paper "A Critical Look at Targeted Instruction Selection: Disentangling What Matters (and What Doesn’t)"
AI & ML interests
Data-Centric ML
Recent Activity
View all activity
Papers
A Critical Look at Targeted Instruction Selection: Disentangling What Matters (and What Doesn't)
Boomerang Distillation Enables Zero-Shot Model Size Interpolation
datasets 16
Harvard-DCML/tis-subset-datasets-Llama-2-7b-hf
Viewer
• Updated
• 300k • 64
Harvard-DCML/tis-quantile-datasets-gtr-t5-base
Viewer
• Updated
• 25k • 53
Harvard-DCML/tis-random-unbalanced
Viewer
• Updated
• 30k • 14
Harvard-DCML/tis-quantile-datasets-Olmo-3-1025-7B
Viewer
• Updated
• 50k • 46
Harvard-DCML/tis-quantile-datasets-SmolLM3-3B-Base
Viewer
• Updated
• 50k • 74
Harvard-DCML/tis-quantile-datasets-Qwen3-4B-Base
Viewer
• Updated
• 50k • 70
Harvard-DCML/tis-quantile-datasets-Llama-3.2-3B
Viewer
• Updated
• 50k • 76
Harvard-DCML/tis-quantile-datasets-Llama-2-7b-hf
Viewer
• Updated
• 50k • 103
Harvard-DCML/tulu-v2-10K-warmup-processed
Viewer
• Updated
• 10k • 19
Harvard-DCML/tis-subset-datasets-Olmo-3-1025-7B
Viewer
• Updated
• 300k • 24