Update README.md
Browse files
README.md
CHANGED
@@ -37,7 +37,34 @@ This model is intended for researchers and practitioners looking for a powerful
|
|
37 |
The `Locutusque/Hyperion-2.0-Mistral-7B` model was fine-tuned on the Hyperion-v2.0 dataset, which amalgamates various datasets rich in diversity and complexity, including programming, medical texts, mathematical problems, and reasoning tasks.
|
38 |
|
39 |
## Evaluation Results
|
40 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
41 |
|
42 |
## How to Use
|
43 |
```python
|
|
|
37 |
The `Locutusque/Hyperion-2.0-Mistral-7B` model was fine-tuned on the Hyperion-v2.0 dataset, which amalgamates various datasets rich in diversity and complexity, including programming, medical texts, mathematical problems, and reasoning tasks.
|
38 |
|
39 |
## Evaluation Results
|
40 |
+
0-shot AGIEval
|
41 |
+
| Tasks |Version|Filter|n-shot| Metric |Value | |Stderr|
|
42 |
+
|---------------------------------|-------|------|-----:|--------|-----:|---|-----:|
|
43 |
+
|agieval_nous |N/A |none | 0|acc |0.3602|± |0.0929|
|
44 |
+
| | |none | 0|acc_norm|0.3342|± |0.0764|
|
45 |
+
| - agieval_aqua_rat | 1|none | 0|acc |0.2402|± |0.0269|
|
46 |
+
| | |none | 0|acc_norm|0.2441|± |0.0270|
|
47 |
+
| - agieval_logiqa_en | 1|none | 0|acc |0.2965|± |0.0179|
|
48 |
+
| | |none | 0|acc_norm|0.3226|± |0.0183|
|
49 |
+
| - agieval_lsat_ar | 1|none | 0|acc |0.2348|± |0.0280|
|
50 |
+
| | |none | 0|acc_norm|0.2000|± |0.0264|
|
51 |
+
| - agieval_lsat_lr | 1|none | 0|acc |0.3667|± |0.0214|
|
52 |
+
| | |none | 0|acc_norm|0.3373|± |0.0210|
|
53 |
+
| - agieval_lsat_rc | 1|none | 0|acc |0.4981|± |0.0305|
|
54 |
+
| | |none | 0|acc_norm|0.4089|± |0.0300|
|
55 |
+
| - agieval_sat_en | 1|none | 0|acc |0.6359|± |0.0336|
|
56 |
+
| | |none | 0|acc_norm|0.5777|± |0.0345|
|
57 |
+
| - agieval_sat_en_without_passage| 1|none | 0|acc |0.3883|± |0.0340|
|
58 |
+
| | |none | 0|acc_norm|0.3544|± |0.0334|
|
59 |
+
| - agieval_sat_math | 1|none | 0|acc |0.3500|± |0.0322|
|
60 |
+
| | |none | 0|acc_norm|0.2682|± |0.0299|
|
61 |
+
|
62 |
+
| Groups |Version|Filter|n-shot| Metric |Value | |Stderr|
|
63 |
+
|------------|-------|------|-----:|--------|-----:|---|-----:|
|
64 |
+
|agieval_nous|N/A |none | 0|acc |0.3602|± |0.0929|
|
65 |
+
| | |none | 0|acc_norm|0.3342|± |0.0764|
|
66 |
+
|
67 |
+
5-shot AGIEval coming soon.
|
68 |
|
69 |
## How to Use
|
70 |
```python
|