jukofyork's picture
Improve language tag (#1)
bd96a25 verified
metadata
license: apache-2.0
base_model:
  - Qwen/Qwen2.5-0.5B-Instruct
datasets:
  - agentlans/common-crawl-sample
  - bigcode/the-stack-smol-xl
  - open-thoughts/OpenThoughts-Unverified-173k
  - cognitivecomputations/dolphin-r1
tags:
  - draft
  - speculative-decoding
language:
  - zho
  - eng
  - fra
  - spa
  - por
  - deu
  - ita
  - rus
  - jpn
  - kor
  - vie
  - tha
  - ara

image-3.webp

A 0.5B parameter draft (speculative decoding) model for use with deepseek-ai/DeepSeek-V3-0324.

See jukofyork/DeepSeek-V3-0324-DRAFT-0.5B-v1.0 for the non-GGUF version, and a detailed explanation of how the model was created.


Without imatrix

With imatrix


See DeepSeek-R1-DRAFT-0.5B-v1.0-GGUF for detailed PPL statistics and recommendations on which quant to use, etc.

I have included the imatrix file used to generate the Q4_0-Q6_K quants, along with the 1MB sample of the fine-tuning data used to create it.