1 8 8

Ling Xing

ling441

AI & ML interests

None yet

Recent Activity

upvoted a collection about 2 months ago

Visual instruction datasets for visual language models

upvoted a paper 3 months ago

Glance: Accelerating Diffusion Models with 1 Sample

upvoted a paper 4 months ago

VCode: a Multimodal Coding Benchmark with SVG as Symbolic Visual Representation

View all activity

Organizations

None yet

upvoted a collection about 2 months ago

Visual instruction datasets for visual language models

Collection

Collections of multimodal (image+text) instruction finetuning datasets tailored for visual language models like LlaVA, Fuyu, or IDEFICS. • 5 items • Updated Nov 21, 2023 • 2

upvoted a paper 3 months ago

Glance: Accelerating Diffusion Models with 1 Sample

Paper • 2512.02899 • Published Dec 2, 2025 • 30

upvoted 2 papers 4 months ago

VCode: a Multimodal Coding Benchmark with SVG as Symbolic Visual Representation

Paper • 2511.02778 • Published Nov 4, 2025 • 102

See the Text: From Tokenization to Visual Reading

Paper • 2510.18840 • Published Oct 21, 2025 • 4

authored a paper 4 months ago

See the Text: From Tokenization to Visual Reading

Paper • 2510.18840 • Published Oct 21, 2025 • 4

commented a paper 4 months ago

See the Text: From Tokenization to Visual Reading

Paper • 2510.18840 • Published Oct 21, 2025 • 4 •

authored a paper 4 months ago

Vision-centric Token Compression in Large Language Model

Paper • 2502.00791 • Published Feb 2, 2025 • 1

upvoted a paper 4 months ago

Vision-centric Token Compression in Large Language Model

Paper • 2502.00791 • Published Feb 2, 2025 • 1

liked a dataset 7 months ago

floatai/TKEval

Preview • Updated Dec 11, 2024 • 22 • 2

liked a dataset 9 months ago

databricks/databricks-dolly-15k

Viewer • Updated Jun 30, 2023 • 15k • 16.4k • 924

liked 2 datasets 10 months ago

gpt4life/alpaca_claud_filtered

Viewer • Updated Jul 20, 2023 • 5.31k • 25 • 12

GAIR/lima

Viewer • Updated Jun 8, 2023 • 1.33k • 1.2k • 454

upvoted a paper 10 months ago

LIMA: Less Is More for Alignment

Paper • 2305.11206 • Published May 18, 2023 • 27

upvoted a paper 11 months ago

V-MAGE: A Game Evaluation Framework for Assessing Visual-Centric Capabilities in Multimodal Large Language Models

Paper • 2504.06148 • Published Apr 8, 2025 • 13

liked a dataset 12 months ago

CohereLabs/m-WildVision

Viewer • Updated Apr 15, 2025 • 11.5k • 451 • 22

liked a Space 12 months ago

Chat With Janus-Pro-7B

🌍

2.02k

A unified multimodal understanding and generation model.

liked a model about 1 year ago

deepseek-ai/Janus-Pro-1B

Any-to-Any • Updated Feb 1, 2025 • 7.52k • 472

upvoted a paper about 1 year ago

TextAtlas5M: A Large-scale Dataset for Dense Text Image Generation

Paper • 2502.07870 • Published Feb 11, 2025 • 45

liked a dataset about 1 year ago

CSU-JPG/TextAtlas5M

Viewer • Updated Oct 14, 2025 • 5.4M • 2.85k • 36

Ling Xing

AI & ML interests

Recent Activity

Organizations

ling441's activity

Chat With Janus-Pro-7B