Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

AaryanK
/
Qwen_2.5_3B_GRPO_Reasoning_XIOSERV

PyTorch
GGUF
qwen2
Reasoning
GRPO
DeepSeek
CoT
finetune
conversational
Model card Files Files and versions Community
2
Qwen_2.5_3B_GRPO_Reasoning_XIOSERV
Ctrl+K
Ctrl+K
  • 1 contributor
History: 5 commits
AaryanK's picture
AaryanK
Update README.md
6119f69 verified 3 months ago
  • .gitattributes
    1.81 kB
    Upload 13 files 3 months ago
  • Qwen_2.5_3B_GRPO_Reasoning_XIOSERV_F16.gguf
    6.18 GB
    LFS
    Upload 13 files 3 months ago
  • Qwen_2.5_3B_GRPO_Reasoning_XIOSERV_Q5_K_M.gguf
    2.22 GB
    LFS
    Upload 13 files 3 months ago
  • Qwen_2.5_3B_GRPO_Reasoning_XIOSERV_Q8_0.gguf
    3.29 GB
    LFS
    Upload 13 files 3 months ago
  • README.md
    3.02 kB
    Update README.md 3 months ago
  • added_tokens.json
    605 Bytes
    Upload 13 files 3 months ago
  • config.json
    809 Bytes
    Upload 13 files 3 months ago
  • generation_config.json
    139 Bytes
    Upload 13 files 3 months ago
  • pytorch_model-00001-of-00002.bin
    181 MB
    LFS
    Upload 13 files 3 months ago
  • pytorch_model-00002-of-00002.bin
    1.21 GB
    LFS
    Upload 13 files 3 months ago
  • pytorch_model.bin.index.json
    35.6 kB
    Upload 13 files 3 months ago
  • special_tokens_map.json
    614 Bytes
    Upload 13 files 3 months ago
  • tokenizer.json
    11.4 MB
    LFS
    Upload 13 files 3 months ago
  • tokenizer_config.json
    7.36 kB
    Upload 13 files 3 months ago
  • vocab.json
    2.78 MB
    Upload 13 files 3 months ago