Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
melsiddieg
's Collections
DiffusionLLMs
Arudi
Biomedical
from_scratch_pretrain
bert and friends
Audiovisual
Research and Optimization
Visual and OCR
finetune_datasets
Audiovisual
updated
Oct 22
Upvote
-
microsoft/VibeVoice-1.5B
Text-to-Speech
•
3B
•
Updated
Sep 1
•
390k
•
2.07k
ibm-granite/granite-docling-258M
Image-Text-to-Text
•
0.3B
•
Updated
Sep 23
•
115k
•
1.05k
deepseek-ai/DeepSeek-OCR
Image-Text-to-Text
•
3B
•
Updated
Nov 4
•
5.39M
•
2.95k
Qwen/Qwen3-VL-2B-Thinking
Image-Text-to-Text
•
2B
•
Updated
Oct 20
•
36.8k
•
88
datalab-to/chandra
Image-to-Text
•
9B
•
Updated
Oct 21
•
87.1k
•
409
Qwen/Qwen3-VL-2B-Instruct
Image-Text-to-Text
•
2B
•
Updated
Oct 23
•
537k
•
223
PokeeAI/pokee_research_7b
Text Generation
•
8B
•
Updated
Oct 23
•
109k
•
100
Upvote
-
Share collection
View history
Collection guide
Browse collections