marcusinthesky
's Collections
Multimodal Embeddings
updated
MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions
Paper
•
2403.19651
•
Published
•
22
No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency
Determines Multimodal Model Performance
Paper
•
2404.04125
•
Published
•
30
Scaling (Down) CLIP: A Comprehensive Analysis of Data, Architecture, and
Training Strategies
Paper
•
2404.08197
•
Published
•
30
Gecko: Versatile Text Embeddings Distilled from Large Language Models
Paper
•
2403.20327
•
Published
•
49
OpenGVLab/InternVL-14B-224px
Image Feature Extraction
•
Updated
•
467
•
35
Alibaba-NLP/gte-large-en-v1.5
Sentence Similarity
•
Updated
•
900k
•
215
jinaai/jina-embeddings-v2-base-en
Feature Extraction
•
Updated
•
279k
•
722
castorini/repllama-v1.1-mrl-7b-lora-passage
Feature Extraction
•
Updated
•
14
•
5
McGill-NLP/LLM2Vec-Sheared-LLaMA-mntp
Sentence Similarity
•
Updated
•
1.84k
•
5
BAAI/bge-visualized
Updated
•
54
royokong/e5-v
Image-Text-to-Text
•
Updated
•
6.3k
•
23
TIGER-Lab/VLM2Vec-Full
Text Generation
•
Updated
•
28.8k
•
25
openbmb/VisRAG-Ret
Feature Extraction
•
Updated
•
1.63k
•
65