xiangan's picture

xiangan

xiangan

·

anxiangsir

AI & ML interests

None yet

Recent Activity

updated a model 5 days ago

DeepGlint-AI/MLCD-Embodied-7B

new activity 5 days ago

DeepGlint-AI/llava-mlcd-qwen2.5-7b:Improve language tag

new activity 5 days ago

DeepGlint-AI/MLCD-Embodied-7B:Improve language tag

View all activity

Organizations

xiangan's activity

upvoted a paper 8 days ago

Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs

Paper • 2504.17432 • Published 9 days ago • 38

upvoted a paper 9 days ago

Decoupled Global-Local Alignment for Improving Compositional Understanding

Paper • 2504.16801 • Published 10 days ago • 15

upvoted a collection 10 days ago

UniME

UniME is a series of multimodal large language models trained for learning universal multimodal embedding. • 3 items • Updated 8 days ago • 4

upvoted 2 papers 2 months ago

Killing Two Birds with One Stone:Efficient and Robust Training of Face Recognition CNNs by Partial FC

Paper • 2203.15565 • Published Mar 28, 2022 • 1

RealSyn: An Effective and Scalable Multimodal Interleaved Document Transformation Paradigm

Paper • 2502.12513 • Published Feb 18 • 16

upvoted 2 papers 7 months ago

Unicom: Universal and Compact Representation Learning for Image Retrieval

Paper • 2304.05884 • Published Apr 12, 2023 • 2

Multi-label Cluster Discrimination for Visual Representation Learning

Paper • 2407.17331 • Published Jul 24, 2024 • 2

upvoted 2 collections 7 months ago

Qwen2-VL

Vision-language model series based on Qwen2 • 16 items • Updated 4 days ago • 212

MLCD

Large-Scale Visual Representation Model • 8 items • Updated Feb 15 • 7

upvoted a paper over 1 year ago

AnyMAL: An Efficient and Scalable Any-Modality Augmented Language Model

Paper • 2309.16058 • Published Sep 27, 2023 • 55