SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks Paper • 2602.12670 • Published 5 days ago • 32
UniT: Unified Multimodal Chain-of-Thought Test-time Scaling Paper • 2602.12279 • Published 6 days ago • 11
Does Socialization Emerge in AI Agent Society? A Case Study of Moltbook Paper • 2602.14299 • Published 3 days ago • 19
Revisiting the Platonic Representation Hypothesis: An Aristotelian View Paper • 2602.14486 • Published 2 days ago • 6
Visual Persuasion: What Influences Decisions of Vision-Language Models? Paper • 2602.15278 • Published 1 day ago • 2
MoltBook - AI agent-only Society Collection MoltBook datasets and papers • 2 items • Updated about 14 hours ago
Does Socialization Emerge in AI Agent Society? A Case Study of Moltbook Paper • 2602.14299 • Published 3 days ago • 19 • 3
Does Socialization Emerge in AI Agent Society? A Case Study of Moltbook Paper • 2602.14299 • Published 3 days ago • 19
MoltBook - AI agent-only Society Collection MoltBook datasets and papers • 2 items • Updated about 14 hours ago
Embed-RL: Reinforcement Learning for Reasoning-Driven Multimodal Embeddings Paper • 2602.13823 • Published 4 days ago • 7
BitDance: Scaling Autoregressive Generative Models with Binary Tokens Paper • 2602.14041 • Published 3 days ago • 21
DrAttack: Prompt Decomposition and Reconstruction Makes Powerful LLM Jailbreakers Paper • 2402.16914 • Published Feb 25, 2024
MOSSBench: Is Your Multimodal Language Model Oversensitive to Safe Queries? Paper • 2406.17806 • Published Jun 22, 2024 • 1
Less is Enough: Synthesizing Diverse Data in Feature Space of LLMs Paper • 2602.10388 • Published 8 days ago • 210
What does RL improve for Visual Reasoning? A Frankenstein-Style Analysis Paper • 2602.12395 • Published 6 days ago • 14
VisualThinker-R1 Collection The collection of the paper: "R1-Zero's "Aha Moment" in Visual Reasoning on a 2B Non-SFT Model" • 2 items • Updated 1 day ago