Running 3.55k The Ultra-Scale Playbook 🌌 3.55k The ultimate guide to training LLM on large GPU Clusters
Symbolic Graphics Programming with Large Language Models Paper • 2509.05208 • Published Sep 5 • 46
SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator Paper • 2412.12094 • Published Dec 16, 2024 • 11 • 5
DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis Paper • 2405.14224 • Published May 23, 2024 • 16
SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator Paper • 2412.12094 • Published Dec 16, 2024 • 11
SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator Paper • 2412.12094 • Published Dec 16, 2024 • 11
SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator Paper • 2412.12094 • Published Dec 16, 2024 • 11 • 5
DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis Paper • 2405.14224 • Published May 23, 2024 • 16
Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding Paper • 2410.01699 • Published Oct 2, 2024 • 18
Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding Paper • 2410.01699 • Published Oct 2, 2024 • 18
meta-llama/Meta-Llama-3-8B-Instruct Text Generation • 8B • Updated Jun 18 • 1.29M • • 4.32k
Forward-Backward Reasoning in Large Language Models for Mathematical Verification Paper • 2308.07758 • Published Aug 15, 2023 • 4
Forward-Backward Reasoning in Large Language Models for Mathematical Verification Paper • 2308.07758 • Published Aug 15, 2023 • 4
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models Paper • 2309.12284 • Published Sep 21, 2023 • 18