Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
Libraries
Languages
Licenses
Other
1
Apps
llama.cpp
LM Studio
Jan
Draw Things
DiffusionBee
Jellybox
JoyFusion
LocalAI
vLLM
Ollama
TGI
MLX LM
Docker Model Runner
Lemonade
Inference Providers
Select all
Groq
Novita
Nebius AI
Cerebras
SambaNova
Nscale
fal
Hyperbolic
Together AI
Fireworks
Featherless AI
Zai
Replicate
Cohere
Scaleway
Public AI
OVHcloud AI Endpoints
HF Inference API
WaveSpeed
Misc
Reset Misc
image-text-to-text
Inference Endpoints
text-generation-inference
Eval Results
Merge
4-bit precision
custom_code
8-bit precision
text-embeddings-inference
Mixture of Experts
Carbon Emissions
Apply filters
Models
6,640
Full-text search
Inference Available
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
LiquidAI/LFM2.5-VL-1.6B
Image-Text-to-Text
•
2B
•
Updated
about 21 hours ago
•
332
•
47
deepseek-ai/DeepSeek-OCR
Image-Text-to-Text
•
3B
•
Updated
Nov 4, 2025
•
3.37M
•
3.04k
nvidia/Cosmos-Reason2-8B
Image-Text-to-Text
•
9B
•
Updated
18 days ago
•
19.3k
•
33
PaddlePaddle/PaddleOCR-VL
Image-Text-to-Text
•
1.0B
•
Updated
26 days ago
•
14.1k
•
1.45k
Qwen/Qwen3-VL-2B-Instruct
Image-Text-to-Text
•
2B
•
Updated
Oct 23, 2025
•
1.19M
•
264
LiquidAI/LFM2.5-VL-1.6B-GGUF
Image-Text-to-Text
•
1B
•
Updated
about 22 hours ago
•
80
•
16
Qwen/Qwen3-VL-8B-Instruct
Image-Text-to-Text
•
9B
•
Updated
Oct 15, 2025
•
2.53M
•
•
625
LiquidAI/LFM2-VL-3B
Image-Text-to-Text
•
3B
•
Updated
Dec 5, 2025
•
4.46k
•
123
tencent/HunyuanOCR
Image-Text-to-Text
•
1.0B
•
Updated
14 days ago
•
859k
•
669
google/gemma-3-4b-it
Image-Text-to-Text
•
4B
•
Updated
Mar 21, 2025
•
766k
•
1.08k
google/gemma-3-27b-it
Image-Text-to-Text
•
27B
•
Updated
Mar 21, 2025
•
1.51M
•
•
1.79k
LiquidAI/LFM2.5-VL-1.6B-ONNX
Image-Text-to-Text
•
Updated
about 4 hours ago
•
1
•
12
ibm-granite/granite-docling-258M
Image-Text-to-Text
•
0.3B
•
Updated
Sep 23, 2025
•
202k
•
1.08k
google/medgemma-4b-it
Image-Text-to-Text
•
4B
•
Updated
Oct 28, 2025
•
367k
•
817
Qwen/Qwen3-VL-30B-A3B-Instruct
Image-Text-to-Text
•
31B
•
Updated
Nov 26, 2025
•
1.27M
•
•
488
google/t5gemma-2-270m-270m
Image-Text-to-Text
•
0.8B
•
Updated
21 days ago
•
14.1k
•
156
google/gemma-3n-E4B-it
Image-Text-to-Text
•
8B
•
Updated
Jul 14, 2025
•
132k
•
847
Qwen/Qwen3-VL-4B-Instruct
Image-Text-to-Text
•
4B
•
Updated
Oct 15, 2025
•
613k
•
290
QuixiAI/Prisma-VL-8B
Image-Text-to-Text
•
770k
•
Updated
6 days ago
•
57
•
29
google/gemma-3-12b-it
Image-Text-to-Text
•
12B
•
Updated
Mar 21, 2025
•
1.31M
•
•
605
moondream/moondream3-preview
Image-Text-to-Text
•
9B
•
Updated
Oct 9, 2025
•
5.83k
•
•
536
zai-org/GLM-4.6V-Flash
Image-Text-to-Text
•
10B
•
Updated
28 days ago
•
309k
•
•
527
browser-use/bu-30b-a3b-preview
Image-Text-to-Text
•
31B
•
Updated
14 days ago
•
6.41k
•
231
nvidia/Cosmos-Reason2-2B
Image-Text-to-Text
•
Updated
18 days ago
•
7.01k
•
15
janhq/Jan-v2-VL-max-Instruct-FP8
Image-Text-to-Text
•
31B
•
Updated
6 days ago
•
46
•
8
kacperwikiel/RysOCR
Image-Text-to-Text
•
Updated
7 days ago
•
130
•
7
google/medgemma-27b-it
Image-Text-to-Text
•
29B
•
Updated
Jul 10, 2025
•
11.7k
•
260
Qwen/Qwen3-VL-8B-Thinking
Image-Text-to-Text
•
9B
•
Updated
Nov 26, 2025
•
131k
•
171
Qwen/Qwen3-VL-4B-Instruct-GGUF
Image-Text-to-Text
•
4B
•
Updated
Nov 1, 2025
•
16.4k
•
28
zai-org/AutoGLM-Phone-9B
Image-Text-to-Text
•
934k
•
Updated
28 days ago
•
93.8k
•
402
Previous
1
2
3
...
100
Next