ISTA-DASLab/Meta-Llama-3-8B-AQLM-PV-1Bit-1x16
Text Generation • 2B • Updated
• 1
None defined yet.
MatGPTQ: Accurate and Efficient Post-Training Matryoshka Quantization
DASH: Faster Shampoo via Batched Block Preconditioning and Efficient Inverse-Root Solvers