unsloth/Qwen3-Next-80B-A3B-Instruct-GGUF Text Generation • 80B • Updated Jan 14 • 45.4k • 166
unsloth/Nemotron-3-Nano-30B-A3B-GGUF Text Generation • 32B • Updated Dec 31, 2025 • 121k • 271
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16 Text Generation • 32B • Updated 10 days ago • 970k • 645
YOYO-AI/Qwen3-30B-A3B-YOYO-V5-Q4_K_M-GGUF Text Generation • 31B • Updated Nov 13, 2025 • 5 • 2
YOYO-AI/Qwen3-30B-A3B-YOYO-V4-Q4_K_M-GGUF Text Generation • 31B • Updated Oct 5, 2025 • 31 • 3
YOYO-AI/Qwen3-30B-A3B-YOYO-V3-Q4_K_M-GGUF Text Generation • 31B • Updated Sep 15, 2025 • 2 • 1
Gemma 3 Collection All versions of Google's new multimodal models including QAT in 1B, 4B, 12B, and 27B sizes. In GGUF, dynamic 4-bit and 16-bit formats. • 54 items • Updated about 1 hour ago • 107
Unsloth Dynamic 2.0 Quants Collection New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance. • 75 items • Updated about 1 hour ago • 404
Recommended small models Collection This is everything recent smaller than ~25B parameters that are high quality/reputable • 19 items • Updated Nov 30, 2024 • 164