The N-Body Problem: Parallel Execution from Single-Person Egocentric Video Paper • 2512.11393 • Published 4 days ago • 1
EgoX: Egocentric Video Generation from a Single Exocentric Video Paper • 2512.08269 • Published 7 days ago • 36
From Next-Token to Next-Block: A Principled Adaptation Path for Diffusion LLMs Paper • 2512.06776 • Published 8 days ago • 22
MotionEdit: Benchmarking and Learning Motion-Centric Image Editing Paper • 2512.10284 • Published 5 days ago • 25
V-RGBX: Video Editing with Accurate Controls over Intrinsic Properties Paper • 2512.11799 • Published 3 days ago • 24
PersonaLive! Expressive Portrait Image Animation for Live Streaming Paper • 2512.11253 • Published 4 days ago • 10
Exploring MLLM-Diffusion Information Transfer with MetaCanvas Paper • 2512.11464 • Published 3 days ago • 8
Structure From Tracking: Distilling Structure-Preserving Motion for Video Generation Paper • 2512.11792 • Published 3 days ago • 5
SVG-T2I: Scaling Up Text-to-Image Latent Diffusion Model Without Variational Autoencoder Paper • 2512.11749 • Published 3 days ago • 32
DentalGPT: Incentivizing Multimodal Complex Reasoning in Dentistry Paper • 2512.11558 • Published 3 days ago • 37
X-Humanoid: Robotize Human Videos to Generate Humanoid Videos at Scale Paper • 2512.04537 • Published 12 days ago • 6
T-pro 2.0: An Efficient Russian Hybrid-Reasoning Model and Playground Paper • 2512.10430 • Published 5 days ago • 78
VQRAE: Representation Quantization Autoencoders for Multimodal Understanding, Generation and Reconstruction Paper • 2511.23386 • Published 17 days ago • 14
Achieving Olympia-Level Geometry Large Language Model Agent via Complexity Boosting Reinforcement Learning Paper • 2512.10534 • Published 4 days ago • 30
OPV: Outcome-based Process Verifier for Efficient Long Chain-of-Thought Verification Paper • 2512.10756 • Published 4 days ago • 31