-
Addition is All You Need for Energy-efficient Language Models
Paper • 2410.00907 • Published • 151 -
Emu3: Next-Token Prediction is All You Need
Paper • 2409.18869 • Published • 95 -
An accurate detection is not all you need to combat label noise in web-noisy datasets
Paper • 2407.05528 • Published • 4 -
Is It Really Long Context if All You Need Is Retrieval? Towards Genuinely Difficult Long Context NLP
Paper • 2407.00402 • Published • 23
meng shao
meng-shao
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
2 days ago
Phi-4-reasoning Technical Report
liked
a model
3 days ago
deepseek-ai/DeepSeek-Prover-V2-671B
Organizations
Collections
2
-
DreamStruct: Understanding Slides and User Interfaces via Synthetic Data Generation
Paper • 2410.00201 • Published -
Does RAG Introduce Unfairness in LLMs? Evaluating Fairness in Retrieval-Augmented Generation Systems
Paper • 2409.19804 • Published -
Rethinking Conventional Wisdom in Machine Learning: From Generalization to Scaling
Paper • 2409.15156 • Published -
Just ASR + LLM? A Study on Speech Large Language Models' Ability to Identify and Understand Speaker in Spoken Dialogue
Paper • 2409.04927 • Published
models
0
None public yet
datasets
0
None public yet