DFlash: Block Diffusion for Flash Speculative Decoding Paper • 2602.06036 • Published 8 days ago • 41
daVinci-Dev: Agent-native Mid-training for Software Engineering Paper • 2601.18418 • Published 18 days ago • 124
ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference Paper • 2511.10645 • Published Nov 13, 2025 • 6