Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

DAMO-NLP-SG
/
VL3-SigLIP-NaViT

Image Feature Extraction
Transformers
Safetensors
English
videollama3_vision_encoder
feature-extraction
visual-encoder
multi-modal-large-language-model
custom_code
Model card Files Files and versions Community
6
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

Does this only supports image?

#6 opened about 2 months ago by
2U1

what is the difference between this model and "DAMO-NLP-SG/SigLIP-NaViT"?

1
#5 opened 2 months ago by
hao98

How to encode batch picture

#4 opened 3 months ago by
kurisu0306

Add model card metadata

#3 opened 4 months ago by
nielsr

Training details

#2 opened 4 months ago by
lucasjin

Rotary embedding why using 1d rather than 2d?

#1 opened 4 months ago by
lucasjin
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs