TreeGRPO: Tree-Advantage GRPO for Online RL Post-Training of Diffusion Models Paper • 2512.08153 • Published 3 days ago • 4
Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance Paper • 2512.08765 • Published 2 days ago • 108
TwinFlow: Realizing One-step Generation on Large Models with Self-adversarial Flows Paper • 2512.05150 • Published 8 days ago • 64
Semantics Lead the Way: Harmonizing Semantic and Texture Modeling with Asynchronous Latent Diffusion Paper • 2512.04926 • Published 7 days ago • 40
PixelDiT: Pixel Diffusion Transformers for Image Generation Paper • 2511.20645 • Published 16 days ago • 26
RELIC: Interactive Video World Model with Long-Horizon Memory Paper • 2512.04040 • Published 8 days ago • 23
Generating an Image From 1,000 Words: Enhancing Text-to-Image With Structured Captions Paper • 2511.06876 • Published Nov 10 • 26
DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models Paper • 2512.02556 • Published 9 days ago • 199
Flowing Backwards: Improving Normalizing Flows via Reverse Representation Alignment Paper • 2511.22345 • Published 14 days ago • 12
DeCo: Frequency-Decoupled Pixel Diffusion for End-to-End Image Generation Paper • 2511.19365 • Published 17 days ago • 63
Reasoning via Video: The First Evaluation of Video Models' Reasoning Abilities through Maze-Solving Tasks Paper • 2511.15065 • Published 22 days ago • 74
Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation Paper • 2511.14993 • Published 23 days ago • 222
Back to Basics: Let Denoising Generative Models Denoise Paper • 2511.13720 • Published 24 days ago • 65
UniAVGen: Unified Audio and Video Generation with Asymmetric Cross-Modal Interactions Paper • 2511.03334 • Published Nov 5 • 51
Kimi Linear: An Expressive, Efficient Attention Architecture Paper • 2510.26692 • Published Oct 30 • 117