3 10 5

Zihao Yue PRO

yuezih

https://yuezih.github.io/

yuezih

AI & ML interests

Multimodality

Recent Activity

upvoted a paper 1 day ago

HySparse: A Hybrid Sparse Attention Architecture with Oracle Token Selection and KV Cache Sharing

upvoted a paper 16 days ago

Being-H0.5: Scaling Human-Centric Robot Learning for Cross-Embodiment Generalization

liked a model about 2 months ago

XiaomiMiMo/MiMo-V2-Flash

View all activity

Organizations

upvoted a paper 1 day ago

HySparse: A Hybrid Sparse Attention Architecture with Oracle Token Selection and KV Cache Sharing

Paper • 2602.03560 • Published 3 days ago • 40

upvoted a paper 16 days ago

Being-H0.5: Scaling Human-Centric Robot Learning for Cross-Embodiment Generalization

Paper • 2601.12993 • Published 18 days ago • 75

liked a model about 2 months ago

XiaomiMiMo/MiMo-V2-Flash

Text Generation • 310B • Updated Dec 18, 2025 • 72.3k • • 616

liked a Space 5 months ago

MiMo-Audio-Chat

💬

Chat with Xiaomi MiMo-Audio using voice

liked a model 5 months ago

XiaomiMiMo/MiMo-VL-7B-SFT-2508

Image-Text-to-Text • 8B • Updated Aug 21, 2025 • 360 • 34

upvoted a paper 6 months ago

DeepPHY: Benchmarking Agentic VLMs on Physical Reasoning

Paper • 2508.05405 • Published Aug 7, 2025 • 64

upvoted a paper 7 months ago

Being-H0: Vision-Language-Action Pretraining from Large-Scale Human Videos

Paper • 2507.15597 • Published Jul 21, 2025 • 34

updated a dataset 8 months ago

yuezih/Movie101

Viewer • Updated Jun 11, 2025 • 234k • 393 • 5

liked a dataset 8 months ago

yuezih/Movie101

Viewer • Updated Jun 11, 2025 • 234k • 393 • 5

published a dataset 8 months ago

yuezih/Movie101

Viewer • Updated Jun 11, 2025 • 234k • 393 • 5

authored a paper 8 months ago

MiMo-VL Technical Report

Paper • 2506.03569 • Published Jun 4, 2025 • 80

upvoted a paper 8 months ago

MiMo-VL Technical Report

Paper • 2506.03569 • Published Jun 4, 2025 • 80

upvoted a paper 9 months ago

VisionReasoner: Unified Visual Perception and Reasoning via Reinforcement Learning

Paper • 2505.12081 • Published May 17, 2025 • 18

authored 7 papers 9 months ago

Learning Descriptive Image Captioning via Semipermeable Maximum Likelihood Estimation

Paper • 2306.13460 • Published Jun 23, 2023 • 2

Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective

Paper • 2402.14545 • Published Feb 22, 2024

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Paper • 2505.07608 • Published May 12, 2025 • 82

Zihao Yue PRO

AI & ML interests

Recent Activity

Organizations

yuezih's activity

MiMo-Audio-Chat