Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

llava-hf
/
llava-onevision-qwen2-7b-ov-hf

Image-Text-to-Text
Transformers
Safetensors
English
Chinese
llava_onevision
vision
conversational
Model card Files Files and versions Community
9
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

Wrong implementation of processor for multi-image inference

#9 opened 4 months ago by
ruiqiRichard

llava-onevision-qwen2-7b-ov-hf vs llava-onevision-qwen2-7b-si-hf

#8 opened 4 months ago by
Insaf2

Model.py file

1
#7 opened 6 months ago by
AvDy

add "mm_spatial_pool_mode" to config.

14
#3 opened 8 months ago by
litianjian

Can I use multi-image input?

1
#2 opened 9 months ago by
juhonov

Some error with this model

1
#1 opened 9 months ago by
lsx666
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs