metadata
license: apache-2.0
base_model:
- Qwen/Qwen2.5-0.5B-Instruct
datasets:
- agentlans/common-crawl-sample
- bigcode/the-stack-smol-xl
- open-thoughts/OpenThoughts-Unverified-173k
- cognitivecomputations/dolphin-r1
tags:
- draft
- speculative-decoding
language:
- zho
- eng
- fra
- spa
- por
- deu
- ita
- rus
- jpn
- kor
- vie
- tha
- ara
A 0.5B
parameter draft (speculative decoding) model for use with deepseek-ai/DeepSeek-V3-0324.
See jukofyork/DeepSeek-V3-0324-DRAFT-0.5B-v1.0 for the non-GGUF version, and a detailed explanation of how the model was created.
Without imatrix
With imatrix
See DeepSeek-R1-DRAFT-0.5B-v1.0-GGUF for detailed PPL statistics and recommendations on which quant to use, etc.
I have included the imatrix file used to generate the Q4_0
-Q6_K
quants, along with the 1MB sample of the fine-tuning data used to create it.