FlySugar's picture

3 16 4

FlySugar

SugarVapeur

·

AI & ML interests

DeepLearning,NLP,RL

Recent Activity

liked a dataset 27 days ago

ServiceNow/GroundCUA

upvoted a paper about 2 months ago

SpatialLadder: Progressive Training for Spatial Reasoning in Vision-Language Models

upvoted a paper 2 months ago

GSM8K-V: Can Vision Language Models Solve Grade School Math Word Problems in Visual Contexts

View all activity

Organizations

None yet

upvoted a paper about 2 months ago

SpatialLadder: Progressive Training for Spatial Reasoning in Vision-Language Models

Paper • 2510.08531 • Published Oct 9 • 12

upvoted 2 papers 2 months ago

GSM8K-V: Can Vision Language Models Solve Grade School Math Word Problems in Visual Contexts

Paper • 2509.25160 • Published Sep 29 • 30

EasySteer: A Unified Framework for High-Performance and Extensible LLM Steering

Paper • 2509.25175 • Published Sep 29 • 30

upvoted a paper 3 months ago

UI-S1: Advancing GUI Automation via Semi-online Reinforcement Learning

Paper • 2509.11543 • Published Sep 15 • 47

upvoted 4 papers 4 months ago

Cooper: Co-Optimizing Policy and Reward Models in Reinforcement Learning for Large Language Models

Paper • 2508.05613 • Published Aug 7 • 17

Test-Time Reinforcement Learning for GUI Grounding via Region Consistency

Paper • 2508.05615 • Published Aug 7 • 22

OmniEAR: Benchmarking Agent Reasoning in Embodied Tasks

Paper • 2508.05614 • Published Aug 7 • 20

Aesthetics is Cheap, Show me the Text: An Empirical Evaluation of State-of-the-Art Generative Models for OCR

Paper • 2507.15085 • Published Jul 20 • 6

upvoted 8 papers 5 months ago

GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents

Paper • 2506.03143 • Published Jun 3 • 53

Hierarchical Budget Policy Optimization for Adaptive Reasoning

Paper • 2507.15844 • Published Jul 21 • 16

A Survey on (M)LLM-Based GUI Agents

Paper • 2504.13865 • Published Mar 27 • 5

Think Twice, Click Once: Enhancing GUI Grounding via Fast and Slow Systems

Paper • 2503.06470 • Published Mar 9 • 3

LAPO: Internalizing Reasoning Efficiency via Length-Adaptive Policy Optimization

Paper • 2507.15758 • Published Jul 21 • 35

The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs

Paper • 2507.11097 • Published Jul 15 • 64

SVGenius: Benchmarking LLMs in SVG Understanding, Editing and Generation

Paper • 2506.03139 • Published Jun 3 • 17

GUI-G^2: Gaussian Reward Modeling for GUI Grounding

Paper • 2507.15846 • Published Jul 21 • 133