DunnBC22 commited on
Commit
7aafbd2
·
1 Parent(s): c0037ae

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +42 -3
README.md CHANGED
@@ -1,9 +1,14 @@
1
  ---
2
  library_name: peft
 
 
 
 
 
 
3
  ---
4
  ## Training procedure
5
 
6
-
7
  The following `bitsandbytes` quantization config was used during training:
8
  - load_in_8bit: True
9
  - load_in_4bit: False
@@ -14,7 +19,41 @@ The following `bitsandbytes` quantization config was used during training:
14
  - bnb_4bit_quant_type: fp4
15
  - bnb_4bit_use_double_quant: False
16
  - bnb_4bit_compute_dtype: float32
17
- ### Framework versions
18
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
19
 
20
- - PEFT 0.4.0
 
1
  ---
2
  library_name: peft
3
+ datasets:
4
+ - ChristophSchuhmann/essays-with-instructions
5
+ language:
6
+ - en
7
+ metrics:
8
+ - perplexity
9
  ---
10
  ## Training procedure
11
 
 
12
  The following `bitsandbytes` quantization config was used during training:
13
  - load_in_8bit: True
14
  - load_in_4bit: False
 
19
  - bnb_4bit_quant_type: fp4
20
  - bnb_4bit_use_double_quant: False
21
  - bnb_4bit_compute_dtype: float32
 
22
 
23
+ ### Model Description
24
+
25
+ For more information on how it was created, check out the following link: https://github.com/DunnBC22/NLP_Projects/blob/main/OPT%20Models/Essays%20With%20Instructions%20-%20Fine-Tune%20-%20OPT.ipynb
26
+
27
+ ### Intended uses & limitations
28
+
29
+ This is intended to show the possibilities. It is mainly limited by the input data.
30
+
31
+ ### Training & Evaluation Dataset
32
+
33
+ Dataset Source: https://huggingface.co/datasets/ChristophSchuhmann/essays-with-instructions
34
+
35
+ ### Hyperparameters Used
36
+
37
+ | Hyperperameter | Value |
38
+ |:-----:|:-----:|
39
+ | Model Checkpoint | facebook/opt-2.7b |
40
+ | per_device_train_batch_size | 8 |
41
+ | gradient_accumulation_steps | 4 |
42
+ | fp16 | True |
43
+ | warmup_steps | 75 |
44
+ | learning_rate | 2e-4 |
45
+ | Training Steps | 150 |
46
+
47
+ ### Framework versions
48
+
49
+ | Library | Version |
50
+ |:-----:|:-----:|
51
+ | Python | 3.10.1 |
52
+ | Torch | 2.0.1+cu118 |
53
+ | Datasets | 2.14.4 |
54
+ | Transformer | 4.31.0 |
55
+ | PEFT | 0.4.0 |
56
+
57
+ ### Metric
58
 
59
+ Perplexity = 9.46