Running on CPU Upgrade Featured 2.94k The Smol Training Playbook 📚 2.94k The secrets to building world-class LLMs
view article Article Fine-tuning LLMs to 1.58bit: extreme quantization made easy +4 Sep 18, 2024 • 273
google/embeddinggemma-300m Sentence Similarity • 0.3B • Updated Sep 25, 2025 • 731k • • 1.44k
view article Article Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers +5 Sep 11, 2025 • 178
view article Article What’s MXFP4? The 4-Bit Secret Powering OpenAI’s GPT‑OSS Models on Modest Hardware Aug 8, 2025 • 32
bartowski/mistralai_Mistral-Small-3.2-24B-Instruct-2506-GGUF Image-Text-to-Text • 24B • Updated Dec 19, 2025 • 6.86k • 38