-
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
Paper • 2501.12948 • Published • 348 -
LightThinker: Thinking Step-by-Step Compression
Paper • 2502.15589 • Published • 26 -
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
Paper • 2405.04434 • Published • 18 -
Model Compression and Efficient Inference for Large Language Models: A Survey
Paper • 2402.09748 • Published • 1
Nvar Char
zombieofCrypto
·
AI & ML interests
machine learning to become more zombie-like
Recent Activity
updated
a collection
2 days ago
llm_improvement_research
updated
a collection
2 days ago
llm_improvement_research
updated
a collection
2 days ago
llm_improvement_research
Organizations
Collections
4
spaces
5
datasets
None public yet