deepseek-ai/DeepSeek-R1-0528 Text Generation โข 685B โข Updated May 29, 2025 โข 333k โข โข 2.39k
Running 3.65k The Ultra-Scale Playbook ๐ 3.65k The ultimate guide to training LLM on large GPU Clusters
ISTA-DASLab/Meta-Llama-3.1-70B-Instruct-AQLM-PV-2Bit-1x16 Text Generation โข 11B โข Updated Sep 17, 2024 โข 45 โข 46