Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
1
Libraries
Datasets
Languages
Licenses
Other
Reset Tasks
Multimodal
Audio-Text-to-Text
Image-Text-to-Text
Visual Question Answering
Document Question Answering
Video-Text-to-Text
Visual Document Retrieval
Any-to-Any
Computer Vision
Depth Estimation
Image Classification
Object Detection
Image Segmentation
Text-to-Image
Image-to-Text
Image-to-Image
Image-to-Video
Unconditional Image Generation
Video Classification
Text-to-Video
Zero-Shot Image Classification
Mask Generation
Zero-Shot Object Detection
Text-to-3D
Image-to-3D
Image Feature Extraction
Keypoint Detection
Natural Language Processing
Text Classification
Token Classification
Table Question Answering
Question Answering
Zero-Shot Classification
Translation
Summarization
Feature Extraction
Text Generation
Text2Text Generation
Fill-Mask
Sentence Similarity
Audio
Text-to-Speech
Text-to-Audio
Automatic Speech Recognition
Audio-to-Audio
Audio Classification
Voice Activity Detection
Tabular
Tabular Classification
Tabular Regression
Time Series Forecasting
Reinforcement Learning
Reinforcement Learning
Robotics
Other
Graph Machine Learning
Apply filters
Models
7,408
Full-text search
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
Gopal2002/donut_finetune_vqa
Image-Text-to-Text
•
Updated
Feb 23, 2024
•
6
jatingogia/donut-2
Image-Text-to-Text
•
Updated
Feb 15, 2024
•
10
granddad/llava-v1.5-7b-gguf
Image-Text-to-Text
•
Updated
Mar 6, 2024
•
27
hadrakey/alphapen_base_hwr
Image-Text-to-Text
•
Updated
Feb 18, 2024
•
4
pradeep239/philip_plain_only5_years_epoch2
Image-Text-to-Text
•
Updated
Feb 19, 2024
•
10
yoon1000/TrOCR_0216
Image-Text-to-Text
•
Updated
Feb 16, 2024
•
7
juliansmidek/donut_cord_v3
Image-Text-to-Text
•
Updated
Feb 19, 2024
•
4
yoon1000/TrOCR_0216_All_data
Image-Text-to-Text
•
Updated
Feb 16, 2024
•
5
Ahmed007/VIT_ara_gpt2
Image-Text-to-Text
•
Updated
Feb 16, 2024
•
6
Rv-23bit/finetuned_trocr_v2
Image-Text-to-Text
•
Updated
Feb 16, 2024
•
6
•
1
singhutkarsh/LLaVA
Image-Text-to-Text
•
Updated
Mar 12, 2024
Christa27/docvqa_mini_subset
Image-Text-to-Text
•
Updated
Feb 16, 2024
•
6
a8nova/tiny-random-idefics
Image-Text-to-Text
•
Updated
Feb 16, 2024
•
7
granddad/llava-v1.5-13b-gguf
Image-Text-to-Text
•
Updated
Mar 6, 2024
•
29
Gowtham04/visiontransformer
Image-Text-to-Text
•
Updated
Feb 17, 2024
•
35
cjpais/llava-v1.6-vicuna-7b-gguf
Image-Text-to-Text
•
Updated
Mar 7, 2024
•
809
•
5
cjpais/llava-v1.6-vicuna-13b-gguf
Image-Text-to-Text
•
Updated
Mar 7, 2024
•
1.23k
•
9
ShekDass/donut-base-cord-test3-CMS30SYN85AUG
Image-Text-to-Text
•
Updated
Feb 18, 2024
•
9
yoon1000/TrOCR_0219_All_data
Image-Text-to-Text
•
Updated
Feb 19, 2024
•
8
SurfaceData/llava-v1.6-mistral-7b-sglang
Image-Text-to-Text
•
Updated
Mar 7, 2024
•
36
•
9
SurfaceData/llava-v1.6-vicuna-7b-sglang
Image-Text-to-Text
•
Updated
Mar 7, 2024
•
11
•
1
jatingogia/donut_kv
Image-Text-to-Text
•
Updated
Feb 21, 2024
•
6
agnisharmanv/idClassification
Image-Text-to-Text
•
Updated
Feb 21, 2024
•
8
alexbeta80/donut-test-ddt-250img
Image-Text-to-Text
•
Updated
Feb 21, 2024
•
6
eduvedras/git-base-vqg-balanced
Image-Text-to-Text
•
Updated
Feb 19, 2024
•
5
eduvedras/git-base-vqg-unbalanced
Image-Text-to-Text
•
Updated
Feb 19, 2024
•
5
vishnu027/cm1132_type1
Image-Text-to-Text
•
Updated
Feb 20, 2024
•
4
vishnu027/cm1132_type1_m2
Image-Text-to-Text
•
Updated
Feb 20, 2024
•
4
vishnu027/cm1132_type2
Image-Text-to-Text
•
Updated
Feb 20, 2024
•
4
vishnu027/cm1132_type2_m2
Image-Text-to-Text
•
Updated
Feb 20, 2024
•
4
Previous
1
...
92
93
94
95
96
...
100
Next