34 100 232

dame rajee

damerajee

AI & ML interests

None yet

Recent Activity

upvoted an article about 12 hours ago

Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval

liked a model about 13 hours ago

mixedbread-ai/mxbai-rerank-base-v2

upvoted a paper about 21 hours ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

View all activity

Organizations

damerajee's activity

upvoted an article about 12 hours ago

Article

Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval

Mar 22, 2024

• 88

upvoted a paper about 21 hours ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published 1 day ago • 61

upvoted a collection 12 days ago

Web-SSL

Collection

17 items • Updated 14 days ago • 14

upvoted an article about 1 month ago

Article

🪆 Introduction to Matryoshka Embedding Models

Feb 23, 2024

• 105

upvoted a paper about 2 months ago

Words or Vision: Do Vision-Language Models Have Blind Faith in Text?

Paper • 2503.02199 • Published Mar 4 • 8

upvoted a collection about 2 months ago

BD3-LMs

Collection

https://m-arriola.com/bd3lms/ • 4 items • Updated 26 days ago • 20

upvoted 2 papers 2 months ago

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Paper • 2502.15007 • Published Feb 20 • 175

Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation

Paper • 2502.14846 • Published Feb 20 • 13

upvoted 2 papers 3 months ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16 • 156

Matryoshka Quantization

Paper • 2502.06786 • Published Feb 10 • 30

upvoted an article 3 months ago

Article

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

Feb 4

• 153

upvoted 2 papers 3 months ago

WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training

Paper • 2501.18511 • Published Jan 30 • 20

Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling

Paper • 2501.16975 • Published Jan 28 • 31

upvoted an article 4 months ago

Article

Timm ❤️ Transformers: Use any timm model with transformers

Jan 16

• 48

upvoted 2 papers 4 months ago

MinMo: A Multimodal Large Language Model for Seamless Voice Interaction

Paper • 2501.06282 • Published Jan 10 • 52

Transformer^2: Self-adaptive LLMs

Paper • 2501.06252 • Published Jan 9 • 55