thewordsmiths
/

Mistral-7B-v0.3_sft_LoRA_100000_dpo_merged

Text Generation

text-generation-inference

Model card Files Files and versions Community

Mistral-7B-v0.3_sft_LoRA_100000_dpo_merged

Ctrl+K

Ctrl+K

1 contributor

History: 4 commits

paultltc's picture

Upload model trained with Unsloth

48fdb89 verified 11 months ago

.gitattributes

1.52 kB

initial commit 11 months ago
README.md

586 Bytes

Upload model trained with Unsloth 11 months ago
config.json

698 Bytes

Upload model trained with Unsloth 11 months ago
generation_config.json

111 Bytes

Upload model trained with Unsloth 11 months ago
model-00001-of-00006.safetensors

4.8 GB
LFS

Upload model trained with Unsloth 11 months ago
model-00002-of-00006.safetensors

4.83 GB
LFS

Upload model trained with Unsloth 11 months ago
model-00003-of-00006.safetensors

5 GB
LFS

Upload model trained with Unsloth 11 months ago
model-00004-of-00006.safetensors

5 GB
LFS

Upload model trained with Unsloth 11 months ago
model-00005-of-00006.safetensors

4.83 GB
LFS

Upload model trained with Unsloth 11 months ago
model-00006-of-00006.safetensors

3.99 GB
LFS

Upload model trained with Unsloth 11 months ago
model.safetensors.index.json

24 kB

Upload model trained with Unsloth 11 months ago
special_tokens_map.json

446 Bytes

Upload model trained with Unsloth 11 months ago
tokenizer.model

587 kB
LFS

Upload model trained with Unsloth 11 months ago
tokenizer_config.json

137 kB

Upload model trained with Unsloth 11 months ago