Update README.md
Browse files
README.md
CHANGED
@@ -1,9 +1,14 @@
|
|
1 |
---
|
2 |
library_name: peft
|
|
|
|
|
|
|
|
|
|
|
|
|
3 |
---
|
4 |
## Training procedure
|
5 |
|
6 |
-
|
7 |
The following `bitsandbytes` quantization config was used during training:
|
8 |
- load_in_8bit: True
|
9 |
- load_in_4bit: False
|
@@ -14,7 +19,41 @@ The following `bitsandbytes` quantization config was used during training:
|
|
14 |
- bnb_4bit_quant_type: fp4
|
15 |
- bnb_4bit_use_double_quant: False
|
16 |
- bnb_4bit_compute_dtype: float32
|
17 |
-
### Framework versions
|
18 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
19 |
|
20 |
-
|
|
|
1 |
---
|
2 |
library_name: peft
|
3 |
+
datasets:
|
4 |
+
- ChristophSchuhmann/essays-with-instructions
|
5 |
+
language:
|
6 |
+
- en
|
7 |
+
metrics:
|
8 |
+
- perplexity
|
9 |
---
|
10 |
## Training procedure
|
11 |
|
|
|
12 |
The following `bitsandbytes` quantization config was used during training:
|
13 |
- load_in_8bit: True
|
14 |
- load_in_4bit: False
|
|
|
19 |
- bnb_4bit_quant_type: fp4
|
20 |
- bnb_4bit_use_double_quant: False
|
21 |
- bnb_4bit_compute_dtype: float32
|
|
|
22 |
|
23 |
+
### Model Description
|
24 |
+
|
25 |
+
For more information on how it was created, check out the following link: https://github.com/DunnBC22/NLP_Projects/blob/main/OPT%20Models/Essays%20With%20Instructions%20-%20Fine-Tune%20-%20OPT.ipynb
|
26 |
+
|
27 |
+
### Intended uses & limitations
|
28 |
+
|
29 |
+
This is intended to show the possibilities. It is mainly limited by the input data.
|
30 |
+
|
31 |
+
### Training & Evaluation Dataset
|
32 |
+
|
33 |
+
Dataset Source: https://huggingface.co/datasets/ChristophSchuhmann/essays-with-instructions
|
34 |
+
|
35 |
+
### Hyperparameters Used
|
36 |
+
|
37 |
+
| Hyperperameter | Value |
|
38 |
+
|:-----:|:-----:|
|
39 |
+
| Model Checkpoint | facebook/opt-2.7b |
|
40 |
+
| per_device_train_batch_size | 8 |
|
41 |
+
| gradient_accumulation_steps | 4 |
|
42 |
+
| fp16 | True |
|
43 |
+
| warmup_steps | 75 |
|
44 |
+
| learning_rate | 2e-4 |
|
45 |
+
| Training Steps | 150 |
|
46 |
+
|
47 |
+
### Framework versions
|
48 |
+
|
49 |
+
| Library | Version |
|
50 |
+
|:-----:|:-----:|
|
51 |
+
| Python | 3.10.1 |
|
52 |
+
| Torch | 2.0.1+cu118 |
|
53 |
+
| Datasets | 2.14.4 |
|
54 |
+
| Transformer | 4.31.0 |
|
55 |
+
| PEFT | 0.4.0 |
|
56 |
+
|
57 |
+
### Metric
|
58 |
|
59 |
+
Perplexity = 9.46
|