chargoddard commited on
Commit
75c1bf5
·
1 Parent(s): 1abee4f

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +23 -0
README.md ADDED
@@ -0,0 +1,23 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - llama
4
+ ---
5
+ Five different instruction-tuned models (which I'm sure are intuitively obvious from the name) merged using the methodology described in [Resolving Interference When Merging Models](https://arxiv.org/abs/2306.01708).
6
+
7
+ In theory this should retain more of the capabilites of the constituent models than a straight linear merge would. In my testing, it feels quite capable.
8
+
9
+ Base model used for the merge: [TheBloke/Llama-2-13B-fp16](https://huggingface.co/TheBloke/Llama-2-13B-fp16)
10
+
11
+ Models merged in:
12
+ * [OpenOrca-Platypus2](https://huggingface.co/Open-Orca/OpenOrca-Platypus2)
13
+ * [limarp-13b-merged](https://huggingface.co/Oniichat/limarp-13b-merged)
14
+ * [Nous-Hermes-Llama2-13b](https://huggingface.co/NousResearch/Nous-Hermes-Llama2-13b)
15
+ * [chronos-13b-v2](https://huggingface.co/elinas/chronos-13b-v2)
16
+ * [airoboros-l2-13b-gpt4-1.4.1](https://huggingface.co/jondurbin/airoboros-l2-13b-gpt4-1.4.1)
17
+
18
+ The script I used to perform the merge is available [here](https://github.com/cg123/ties-merge).
19
+
20
+ The command that produced this model:
21
+ ```
22
+ python ties_merge.py TheBloke/Llama-2-13B-fp16 ./Chronorctypus-Limarobormes-13b --merge elinas/chronos-13b-v2 --merge Open-Orca/OpenOrca-Platypus2-13B --merge Oniichat/limarp-13b-merged --merge jondurbin/airoboros-l2-13b-gpt4-1.4.1 --merge NousResearch/Nous-Hermes-Llama2-13b --cuda
23
+ ```