Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 44
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
sentence-transformers
Safetensors
ONNX
GGUF
Transformers.js
MLX
+ 41
Apps
vLLM
TGI
llama.cpp
MLX LM
LM Studio
Ollama
Jan
+ 7
Inference Providers
Groq
Novita
Nebius AI
Cerebras
SambaNova
Nscale
fal
Hyperbolic
+ 11
Apply filters
Models
9,993
Full-text search
Inference Available
Edit filters
Sort: Trending
Active filters:
image-to-text
Clear all
Qwen/Qwen3-VL-Embedding-8B
Image-to-Text
•
8B
•
Updated
2 days ago
•
19.1k
•
158
Qwen/Qwen3-VL-Embedding-2B
Image-to-Text
•
2B
•
Updated
2 days ago
•
11.6k
•
143
Qwen/Qwen3-VL-Reranker-2B
Image-to-Text
•
2B
•
Updated
2 days ago
•
18.3k
•
52
Qwen/Qwen3-VL-Reranker-8B
Image-to-Text
•
9B
•
Updated
2 days ago
•
1.28k
•
46
allenai/olmOCR-2-7B-1025-FP8
Image-to-Text
•
8B
•
Updated
Dec 9, 2025
•
1.11M
•
178
datalab-to/chandra
Image-to-Text
•
9B
•
Updated
Oct 21, 2025
•
437k
•
456
nvidia/nemotron-ocr-v1
Image-to-Text
•
Updated
26 days ago
•
270
•
61
Salesforce/blip-image-captioning-base
Image-to-Text
•
Updated
Feb 3, 2025
•
1.59M
•
834
snuh/mvl-rrg-1.0
Image-to-Text
•
770k
•
Updated
4 days ago
•
9
•
3
sugartai/Qwen3-VL-4B-Uni-MuMER-Final
Image-to-Text
•
4B
•
Updated
7 days ago
•
14
•
3
microsoft/trocr-large-handwritten
Image-to-Text
•
Updated
May 27, 2024
•
21.4k
•
135
unsloth/Llama-3.2-11B-Vision-Instruct
Image-to-Text
•
11B
•
Updated
Dec 10, 2024
•
26.9k
•
88
xiangjx/musk
Image-to-Text
•
Updated
Jan 19, 2025
•
40
reducto/RolmOCR
Image-to-Text
•
8B
•
Updated
Apr 2, 2025
•
3.1k
•
571
allenai/olmOCR-2-7B-1025
Image-to-Text
•
8B
•
Updated
Oct 22, 2025
•
65.6k
•
120
microsoft/trocr-base-handwritten
Image-to-Text
•
0.3B
•
Updated
Feb 11, 2025
•
125k
•
470
microsoft/trocr-small-handwritten
Image-to-Text
•
Updated
May 27, 2024
•
5.66k
•
62
nlpconnect/vit-gpt2-image-captioning
Image-to-Text
•
Updated
Feb 27, 2023
•
826k
•
923
naver-clova-ix/donut-base
Image-to-Text
•
Updated
Aug 13, 2022
•
173k
•
241
microsoft/git-base-coco
Image-to-Text
•
Updated
Feb 8, 2023
•
22.7k
•
20
Salesforce/blip2-opt-2.7b-coco
Image-to-Text
•
4B
•
Updated
Feb 3, 2025
•
308k
•
11
Xenova/vit-gpt2-image-captioning
Image-to-Text
•
Updated
Oct 8, 2024
•
4.85k
•
27
facebook/nougat-base
Image-to-Text
•
0.3B
•
Updated
Nov 20, 2023
•
7.14k
•
182
microsoft/kosmos-2-patch14-224
Image-to-Text
•
2B
•
Updated
Nov 28, 2023
•
144k
•
182
OleehyO/TexTeller
Image-to-Text
•
0.3B
•
Updated
Jun 22, 2024
•
136k
•
41
breezedeus/pix2text-mfr
Image-to-Text
•
Updated
May 5, 2024
•
124k
•
50
unum-cloud/uform-gen2-qwen-500m
Image-to-Text
•
1B
•
Updated
Apr 24, 2024
•
679
•
85
Mozilla/distilvit
Image-to-Text
•
0.2B
•
Updated
Nov 25, 2024
•
129
•
25
HuggingFaceH4/vsft-llava-1.5-7b-hf-trl
Image-to-Text
•
7B
•
Updated
Apr 11, 2024
•
52
•
18
LanguageBind/Video-LLaVA-7B-hf
Image-to-Text
•
7B
•
Updated
May 16, 2024
•
6.73k
•
47
Previous
1
2
3
...
100
Next