jpacifico
/

Chocolatine-3B-Instruct-DPO-v1.2

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

jpacifico commited on Oct 16, 2024

Commit

1a2d628

·

verified ·

1 Parent(s): ebc9de6

Update README.md

Files changed (1) hide show

README.md +3 -4

README.md CHANGED Viewed

@@ -14,14 +14,13 @@ pipeline_tag: text-generation
 ### Chocolatine-3B-Instruct-DPO-v1.2
 DPO fine-tuned of [microsoft/Phi-3.5-mini-instruct](https://huggingface.co/microsoft/Phi-3.5-mini-instruct) (3.82B params)
 using the [jpacifico/french-orca-dpo-pairs-revised](https://huggingface.co/datasets/jpacifico/french-orca-dpo-pairs-revised) rlhf dataset.
 Training in French also improves the model in English, surpassing the performances of its base model.
-*The model supports 128K context length*.
-### OpenLLM Leaderboard
-TBD.
 ### MT-Bench-French

 ### Chocolatine-3B-Instruct-DPO-v1.2
+Best version of Chocolatine-3B for French.
+*The model supports 128K context length*.
 DPO fine-tuned of [microsoft/Phi-3.5-mini-instruct](https://huggingface.co/microsoft/Phi-3.5-mini-instruct) (3.82B params)
 using the [jpacifico/french-orca-dpo-pairs-revised](https://huggingface.co/datasets/jpacifico/french-orca-dpo-pairs-revised) rlhf dataset.
 Training in French also improves the model in English, surpassing the performances of its base model.
 ### MT-Bench-French