HuggingFaceM4
/

Idefics3-8B-Llama3

Image-Text-to-Text

Model card Files Files and versions Community

andito HF Staff commited on Aug 26, 2024

Commit

a1b83a3

·

verified ·

1 Parent(s): 30d4b60

Update README.md

Files changed (1) hide show

README.md +4 -5

README.md CHANGED Viewed

@@ -195,17 +195,16 @@ The model is built on top of two pre-trained models: [google/siglip-so400m-patch
 **BibTeX:**
 ```bibtex
-@misc{laurençon2024matters,
-      title={What matters when building vision-language models?},
-      author={Hugo Laurençon and Léo Tronchon and Matthieu Cord and Victor Sanh},
       year={2024},
-      eprint={2405.02246},
       archivePrefix={arXiv},
       primaryClass={cs.CV}
 }
 ```
-TODO: new paper
 # Acknowledgements

 **BibTeX:**
 ```bibtex
+@misc{laurençon2024building,
+      title={Building and better understanding vision-language models: insights and future directions.},
+      author={Hugo Laurençon and Andrés Marafioti and Victor Sanh and Léo Tronchon},
       year={2024},
+      eprint={2408.12637},
       archivePrefix={arXiv},
       primaryClass={cs.CV}
 }
 ```
 # Acknowledgements