6 7 15

Qiying Yu

qiying

AI & ML interests

None yet

Recent Activity

updated a dataset 23 days ago

BytedTsinghua-SIA/DAPO-Math-17k

authored a paper about 2 months ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

upvoted a paper about 2 months ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

View all activity

Organizations

qiying's activity

updated a dataset 23 days ago

BytedTsinghua-SIA/DAPO-Math-17k

Viewer • Updated 23 days ago • 1.79M • 4.06k • 68

authored a paper about 2 months ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18 • 125

upvoted a paper about 2 months ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18 • 125

authored a paper 7 months ago

Emu3: Next-Token Prediction is All You Need

Paper • 2409.18869 • Published Sep 27, 2024 • 95

liked a model 9 months ago

CausalLM/miniG

Text Generation • Updated Feb 14 • 239 • 182

New activity in CausalLM/miniG 9 months ago

About the Data Generation Method

#4 opened 9 months ago by

qiying

liked a dataset 9 months ago

HuggingFaceFW/fineweb

Viewer • Updated Jan 31 • 25B • 853k • 2.14k

updated 2 models 9 months ago

qiying/henan

Updated Aug 7, 2024

qiying/super-cool-model

Updated Aug 6, 2024

liked a dataset 10 months ago

proj-persona/PersonaHub

Viewer • Updated Mar 4 • 375k • 5.68k • 569

upvoted a collection 10 months ago

Phi-3

Collection

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 26 items • Updated 10 days ago • 566

upvoted an article 10 months ago

Article

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

Mar 20, 2024

• 86

liked a Space 11 months ago

937

FineWeb: decanting the web for the finest text data at scale

🍷

Generate high-quality web text data for LLM training

New activity in MMInstruction/ArxivQA 12 months ago

How do you use the rationales and answers in your training?

#1 opened 12 months ago by

qiying

reacted to merve's post with ❤️ 12 months ago

Post

EVA-CLIP 🦖 is the CLIP scaled to the moon! 🔥
The new SotA CLIP-like model 🏆
Highlights ✨
- Performs better in linear probing
- Outperforms in Zero-Shot Image-Text Retrieval
- Higher zero-shot accuracy in IN-1K

As usual, try it with the notebook I built for you https://colab.research.google.com/drive/1K7DdCORC3x4qyhwhuB4fT4wcfJ_BQLKw?usp=sharing#scrollTo=0ZS_lJ7SK6Ys
I also built a Space for you to compare the output probabilities to CLIP, seems that EVACLIP is more "sure" of it's results 😊 merve/EVACLIP
The authors have shared 8B checkpoints open with Apache 2.0 license 💜 and it's built on top of transformers, super easy to use! BAAI/EVA-CLIP-8B
Read the paper EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters (2402.04252) 📄