Update README.md
Browse files
README.md
CHANGED
@@ -18,4 +18,47 @@ datasets:
|
|
18 |
|
19 |
- **Developed by:** datatab
|
20 |
- **License:** MIT
|
21 |
-
- **Finetuned from model :** datatab/Yugo55A-GPT
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
18 |
|
19 |
- **Developed by:** datatab
|
20 |
- **License:** MIT
|
21 |
+
- **Finetuned from model :** datatab/Yugo55A-GPT
|
22 |
+
|
23 |
+
|
24 |
+
### Full Weights Model
|
25 |
+
> [datatab/Yugo55A-GPT](https://huggingface.co/datatab/Yugo55A-GPT).
|
26 |
+
|
27 |
+
|
28 |
+
|
29 |
+
## 🏆 Results
|
30 |
+
> Results obtained through the Serbian LLM evaluation, released by Aleksa Gordić: [serbian-llm-eval](https://github.com/gordicaleksa/serbian-llm-eval)
|
31 |
+
> * Evaluation was conducted on a 4-bit version of the model due to hardware resource constraints.
|
32 |
+
|
33 |
+
<table>
|
34 |
+
<tr>
|
35 |
+
<th>MODEL</th>
|
36 |
+
<th>ARC-E</th>
|
37 |
+
<th>ARC-C</th>
|
38 |
+
<th>Hellaswag</th>
|
39 |
+
<th>BoolQ</th>
|
40 |
+
<th>Winogrande</th>
|
41 |
+
<th>OpenbookQA</th>
|
42 |
+
<th>PiQA</th>
|
43 |
+
</tr>
|
44 |
+
<tr>
|
45 |
+
<td><a href="https://huggingface.co/datatab/Yugo55-GPT-v4-4bit/">*Yugo55-GPT-v4-4bit</a></td>
|
46 |
+
<td>51.41</td>
|
47 |
+
<td>36.00</td>
|
48 |
+
<td>57.51</td>
|
49 |
+
<td>80.92</td>
|
50 |
+
<td><strong>65.75</strong></td>
|
51 |
+
<td>34.70</td>
|
52 |
+
<td><strong>70.54</strong></td>
|
53 |
+
</tr>
|
54 |
+
<tr>
|
55 |
+
<td><a href="https://huggingface.co/datatab/Yugo55A-GPT/">Yugo55A-GPT</a></td>
|
56 |
+
<td><strong>51.52</strong></td>
|
57 |
+
<td><strong>37.78</strong></td>
|
58 |
+
<td><strong>57.52</strong></td>
|
59 |
+
<td><strong>84.40</strong></td>
|
60 |
+
<td>65.43</td>
|
61 |
+
<td><strong>35.60</strong></td>
|
62 |
+
<td>69.43</td>
|
63 |
+
</tr>
|
64 |
+
</table>
|