th's picture

13 5

th

CHEN1594

·

AI & ML interests

None yet

Organizations

None yet

upvoted a paper 2 months ago

Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions

Paper • 2507.05257 • Published Jul 7 • 14

upvoted a collection 2 months ago

ReasonMap

A fine-grained visual reasoning benchmark (We show more question types in the extension dataset.) • 3 items • Updated Oct 1 • 8

upvoted a paper 2 months ago

RewardMap: Tackling Sparse Rewards in Fine-grained Visual Reasoning via Multi-Stage Reinforcement Learning

Paper • 2510.02240 • Published Oct 2 • 17

upvoted a paper 3 months ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4 • 193

upvoted 6 papers 5 months ago

GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents

Paper • 2506.03143 • Published Jun 3 • 53

ShowUI: One Vision-Language-Action Model for GUI Visual Agent

Paper • 2411.17465 • Published Nov 26, 2024 • 90

Optimus-3: Towards Generalist Multimodal Minecraft Agents with Scalable Task Experts

Paper • 2506.10357 • Published Jun 12 • 21

MiniCPM4: Ultra-Efficient LLMs on End Devices

Paper • 2506.07900 • Published Jun 9 • 92

AgentSynth: Scalable Task Generation for Generalist Computer-Use Agents

Paper • 2506.14205 • Published Jun 17 • 7

Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search

Paper • 2507.02652 • Published Jul 3 • 26

upvoted 2 papers 7 months ago

Can MLLMs Guide Me Home? A Benchmark Study on Fine-Grained Visual Reasoning from Transit Maps

Paper • 2505.18675 • Published May 24 • 25

Seed1.5-VL Technical Report

Paper • 2505.07062 • Published May 11 • 153

upvoted a paper 8 months ago

Efficient Reasoning Models: A Survey

Paper • 2504.10903 • Published Apr 15 • 21