llm-blender
/

pair-ranker

Model card Files Files and versions Community

Dongfu Jiang commited on Oct 23, 2023

Commit

adad0ff

·

1 Parent(s): f5ed554

Update README.md

Files changed (1) hide show

README.md +8 -3

README.md CHANGED Viewed

@@ -18,15 +18,20 @@ tags:
 PairRanker used in llm-blender, trained on deberta-v3-large. This is the ranker model used in experiments in LLM-Blender paper,
 which is trained on [mixinstruct](https://huggingface.co/datasets/llm-blender/mix-instruct) dataset for 5 epochs.
 |  PairRanker type  | Source max length | Candidate max length | Total max length |
 |:-----------------:|:-----------------:|----------------------|------------------|
 | [pair-ranker](https://huggingface.co/llm-blender/pair-ranker) (This model)              | 128               | 128                  | 384              |
 | [pair-reward-model](https://huggingface.co/llm-blender/pair-reward-model/) | 1224              | 412                  | 2048             |
-- Github: [https://github.com/yuchenlin/LLM-Blender](https://github.com/yuchenlin/LLM-Blender)
-- Paper: [https://arxiv.org/abs/2306.02561](https://arxiv.org/abs/2306.02561)
 |    **Methods**    | BERTScore | BARTScore |   BLEURT  | GPT-Rank |  Beat Vic(%)  |   Beat OA(%)  |  Top-1(%)  |  Top-2(%)  |  Top-3(%)  |
 |:-----------------:|:---------:|:---------:|:---------:|:--------:|:----------:|:----------:|:----------:|:----------:|:----------:|

 PairRanker used in llm-blender, trained on deberta-v3-large. This is the ranker model used in experiments in LLM-Blender paper,
 which is trained on [mixinstruct](https://huggingface.co/datasets/llm-blender/mix-instruct) dataset for 5 epochs.
+- Github: [https://github.com/yuchenlin/LLM-Blender](https://github.com/yuchenlin/LLM-Blender)
+- Paper: [https://arxiv.org/abs/2306.02561](https://arxiv.org/abs/2306.02561)
+## Statistics
+### Context length
 |  PairRanker type  | Source max length | Candidate max length | Total max length |
 |:-----------------:|:-----------------:|----------------------|------------------|
 | [pair-ranker](https://huggingface.co/llm-blender/pair-ranker) (This model)              | 128               | 128                  | 384              |
 | [pair-reward-model](https://huggingface.co/llm-blender/pair-reward-model/) | 1224              | 412                  | 2048             |
+### MixInstrut Performance
 |    **Methods**    | BERTScore | BARTScore |   BLEURT  | GPT-Rank |  Beat Vic(%)  |   Beat OA(%)  |  Top-1(%)  |  Top-2(%)  |  Top-3(%)  |
 |:-----------------:|:---------:|:---------:|:---------:|:--------:|:----------:|:----------:|:----------:|:----------:|:----------:|