SLA2: Sparse-Linear Attention with Learnable Routing and QAT Paper • 2602.12675 • Published 17 days ago • 53
AgenticPay: A Multi-Agent LLM Negotiation System for Buyer-Seller Transactions Paper • 2602.06008 • Published 25 days ago • 4
MomaGraph: State-Aware Unified Scene Graphs with Vision-Language Model for Embodied Task Planning Paper • 2512.16909 • Published Dec 18, 2025 • 3
XQuant: Breaking the Memory Wall for LLM Inference with KV Cache Rematerialization Paper • 2508.10395 • Published Aug 14, 2025 • 42
QuantSpec: Self-Speculative Decoding with Hierarchical Quantized KV Cache Paper • 2502.10424 • Published Feb 5, 2025 • 1
Pre-training Auto-regressive Robotic Models with 4D Representations Paper • 2502.13142 • Published Feb 18, 2025 • 6
Exploring the Intersection of Large Language Models and Agent-Based Modeling via Prompt Engineering Paper • 2308.07411 • Published Aug 14, 2023 • 2
Towards Language Models That Can See: Computer Vision Through the LENS of Natural Language Paper • 2306.16410 • Published Jun 28, 2023 • 29