3 41 13

Shengyuan Ding

ChrisDing1105

https://github.com/SYuan03

SYuan03

AI & ML interests

SII is an institution dedicated to innovation in education and research in the field of AI.

Recent Activity

upvoted a paper 4 days ago

ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning

commented on a paper 4 days ago

ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning

upvoted a paper 5 days ago

ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation

View all activity

Organizations

None yet

upvoted a paper 4 days ago

ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning

Paper • 2512.05111 • Published 4 days ago • 44

upvoted a paper 5 days ago

ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation

Paper • 2512.03036 • Published 6 days ago • 20

upvoted a paper 29 days ago

DeepEyesV2: Toward Agentic Multimodal Model

Paper • 2511.05271 • Published Nov 7 • 42

upvoted 3 papers about 1 month ago

UniREditBench: A Unified Reasoning-based Image Editing Benchmark

Paper • 2511.01295 • Published Nov 3 • 37

Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning

Paper • 2510.27606 • Published Oct 31 • 27

STAR-Bench: Probing Deep Spatio-Temporal Reasoning as Audio 4D Intelligence

Paper • 2510.24693 • Published Oct 28 • 18

upvoted 2 papers about 2 months ago

InternSVG: Towards Unified SVG Tasks with Multimodal Large Language Models

Paper • 2510.11341 • Published Oct 13 • 34

MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Policy Optimization

Paper • 2510.08540 • Published Oct 9 • 109

upvoted 4 papers 2 months ago

upvoted 2 papers 3 months ago

CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning

Paper • 2508.20096 • Published Aug 27 • 36

Self-Rewarding Vision-Language Model via Reasoning Decomposition

Paper • 2508.19652 • Published Aug 27 • 84

upvoted a paper 4 months ago

Hi3DEval: Advancing 3D Generation Evaluation with Hierarchical Validity

Paper • 2508.05609 • Published Aug 7 • 29

upvoted a paper 5 months ago

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction

Paper • 2507.15852 • Published Jul 21 • 38

upvoted 2 papers 7 months ago

CPGD: Toward Stable Rule-based Reinforcement Learning for Language Models

Paper • 2505.12504 • Published May 18 • 24

MM-PRM: Enhancing Multimodal Mathematical Reasoning with Scalable Step-Level Supervision

Paper • 2505.13427 • Published May 19 • 26

upvoted a collection 8 months ago

MM-IFEngine

Collection

[ICCV 2025] Official Implementation of "MM-IFEngine: Towards Multimodal Instruction Following" • 2 items • Updated Jul 16 • 6

upvoted a paper 8 months ago

VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning

Paper • 2504.07956 • Published Apr 10 • 47

Shengyuan Ding

AI & ML interests

Recent Activity

Organizations

ChrisDing1105's activity