Emerging Properties in Unified Multimodal Pretraining Paper • 2505.14683 • Published 11 days ago • 124
Emerging Properties in Unified Multimodal Pretraining Paper • 2505.14683 • Published 11 days ago • 124
BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Competitive Large Language Model Baseline Paper • 2408.15079 • Published Aug 27, 2024 • 55
Clover: Regressive Lightweight Speculative Decoding with Sequential Knowledge Paper • 2405.00263 • Published May 1, 2024 • 17
ShortGPT: Layers in Large Language Models are More Redundant Than You Expect Paper • 2403.03853 • Published Mar 6, 2024 • 65