Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 44
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
sentence-transformers
Safetensors
ONNX
GGUF
Transformers.js
MLX
+ 41
Apps
vLLM
TGI
llama.cpp
MLX LM
LM Studio
Ollama
Jan
+ 12
Inference Providers
Groq
Novita
Nebius AI
Cerebras
SambaNova
Nscale
fal
Hyperbolic
+ 11
Apply filters
Models
6,143
Full-text search
Inference Available
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
bartowski/microsoft_Fara-7B-GGUF
Image-Text-to-Text
•
8B
•
Updated
17 days ago
•
34.3k
•
25
NexaAI/AutoNeural
Image-Text-to-Text
•
Updated
9 days ago
•
40
•
11
alecccdd/moondream3-preview-4bit
Image-Text-to-Text
•
Updated
7 days ago
•
257
•
4
sensenova/SenseNova-SI-1.1-Qwen3-VL-8B
Image-Text-to-Text
•
9B
•
Updated
2 days ago
•
253
•
4
sensenova/SenseNova-SI-1.2-InternVL3-8B
Image-Text-to-Text
•
8B
•
Updated
1 day ago
•
191
•
4
vikhyatk/moondream2
Image-Text-to-Text
•
2B
•
Updated
Sep 23
•
1.76M
•
1.35k
microsoft/Florence-2-base
Image-Text-to-Text
•
0.2B
•
Updated
Aug 4
•
517k
•
322
Qwen/Qwen2.5-VL-72B-Instruct
Image-Text-to-Text
•
73B
•
Updated
Jun 6
•
116k
•
•
571
fancyfeast/llama-joycaption-beta-one-hf-llava
Image-Text-to-Text
•
8B
•
Updated
May 16
•
71.7k
•
270
zai-org/GLM-4.1V-9B-Thinking
Image-Text-to-Text
•
10B
•
Updated
Oct 25
•
332k
•
•
757
google/medgemma-27b-it
Image-Text-to-Text
•
29B
•
Updated
Jul 10
•
15.7k
•
237
zai-org/GLM-4.5V
Image-Text-to-Text
•
108B
•
Updated
Oct 25
•
48.5k
•
•
697
opendatalab/MinerU2.5-2509-1.2B
Image-Text-to-Text
•
1B
•
Updated
Sep 29
•
1.31M
•
292
Qwen/Qwen3-VL-30B-A3B-Thinking
Image-Text-to-Text
•
31B
•
Updated
15 days ago
•
57.4k
•
•
163
ByteDance/Dolphin-1.5
Image-Text-to-Text
•
0.4B
•
Updated
29 days ago
•
1.63k
•
32
Qwen/Qwen3-VL-2B-Instruct-FP8
Image-Text-to-Text
•
2B
•
Updated
Oct 20
•
24.2k
•
29
Qwen/Qwen3-VL-2B-Thinking
Image-Text-to-Text
•
2B
•
Updated
Oct 20
•
36.1k
•
89
Keyven/german-ocr
Image-Text-to-Text
•
2B
•
Updated
7 days ago
•
14
•
3
mlx-community/GLM-4.6V-4bit
Image-Text-to-Text
•
Updated
3 days ago
•
411
•
3
llava-hf/llava-1.5-7b-hf
Image-Text-to-Text
•
7B
•
Updated
Jun 6
•
1.38M
•
325
openbmb/MiniCPM-V-2_6
Image-Text-to-Text
•
8B
•
Updated
Jun 13
•
91.8k
•
1.02k
Qwen/Qwen2-VL-2B-Instruct
Image-Text-to-Text
•
2B
•
Updated
Jan 12
•
1.87M
•
473
Qwen/Qwen2-VL-7B-Instruct
Image-Text-to-Text
•
8B
•
Updated
Feb 6
•
1.51M
•
•
1.25k
HuggingFaceTB/SmolVLM2-256M-Video-Instruct
Image-Text-to-Text
•
0.3B
•
Updated
Apr 8
•
110k
•
85
docling-project/SmolDocling-256M-preview
Image-Text-to-Text
•
0.3B
•
Updated
Sep 17
•
118k
•
1.6k
Qwen/Qwen2.5-VL-7B-Instruct-AWQ
Image-Text-to-Text
•
8B
•
Updated
Apr 6
•
181k
•
95
google/gemma-3-27b-pt
Image-Text-to-Text
•
27B
•
Updated
Mar 21
•
11.1k
•
113
CohereLabs/aya-vision-8b
Image-Text-to-Text
•
9B
•
Updated
Oct 30
•
50.6k
•
316
google/gemma-3-12b-it-qat-q4_0-gguf
Image-Text-to-Text
•
12B
•
Updated
Apr 11
•
88.1k
•
216
google/gemma-3-4b-it-qat-q4_0-gguf
Image-Text-to-Text
•
4B
•
Updated
Apr 11
•
10.7k
•
219
Previous
1
2
3
4
5
...
100
Next