rigarashi commited on
Commit
e977129
·
verified ·
1 Parent(s): 6693864

Model save

Browse files
Files changed (2) hide show
  1. README.md +14 -44
  2. model.safetensors +1 -1
README.md CHANGED
@@ -1,28 +1,13 @@
1
  ---
2
- base_model: google/mt5-small
3
- datasets:
4
- - arrow
5
  license: apache-2.0
6
- metrics:
7
- - bleu
8
  tags:
9
  - generated_from_trainer
 
 
10
  model-index:
11
  - name: mt5-base-ainu
12
- results:
13
- - task:
14
- type: text2text-generation
15
- name: Sequence-to-sequence Language Modeling
16
- dataset:
17
- name: arrow
18
- type: arrow
19
- config: default
20
- split: None
21
- args: default
22
- metrics:
23
- - type: bleu
24
- value: 35.096583697771194
25
- name: Bleu
26
  ---
27
 
28
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -30,10 +15,10 @@ should probably proofread and complete it, then remove this comment. -->
30
 
31
  # mt5-base-ainu
32
 
33
- This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the arrow dataset.
34
  It achieves the following results on the evaluation set:
35
- - Loss: 1.3701
36
- - Bleu: 35.0966
37
 
38
  ## Model description
39
 
@@ -65,28 +50,13 @@ The following hyperparameters were used during training:
65
 
66
  ### Training results
67
 
68
- | Training Loss | Epoch | Step | Validation Loss | Bleu |
69
- |:-------------:|:-------:|:------:|:---------------:|:-------:|
70
- | 2.73 | 0.9999 | 9341 | 2.2938 | 11.6604 |
71
- | 2.0536 | 2.0 | 18683 | 1.7883 | 19.8648 |
72
- | 1.7955 | 2.9999 | 28024 | 1.6060 | 24.9245 |
73
- | 1.6331 | 4.0 | 37366 | 1.5038 | 27.6152 |
74
- | 1.5107 | 4.9999 | 46707 | 1.4442 | 28.4249 |
75
- | 1.4255 | 6.0 | 56049 | 1.4058 | 30.0986 |
76
- | 1.3373 | 6.9999 | 65390 | 1.3769 | 30.9550 |
77
- | 1.2758 | 8.0 | 74732 | 1.3546 | 31.6075 |
78
- | 1.2259 | 8.9999 | 84073 | 1.3380 | 32.0155 |
79
- | 1.1555 | 10.0 | 93415 | 1.3318 | 32.5095 |
80
- | 1.1166 | 10.9999 | 102756 | 1.3263 | 32.9619 |
81
- | 1.0564 | 12.0 | 112098 | 1.3220 | 33.5983 |
82
- | 1.0222 | 12.9999 | 121439 | 1.3162 | 33.7293 |
83
- | 0.9764 | 14.0 | 130781 | 1.3304 | 34.0143 |
84
- | 0.9472 | 14.9999 | 140122 | 1.3353 | 34.3792 |
85
- | 0.9034 | 16.0 | 149464 | 1.3429 | 34.4296 |
86
- | 0.8732 | 16.9999 | 158805 | 1.3470 | 34.5817 |
87
- | 0.8465 | 18.0 | 168147 | 1.3554 | 34.9795 |
88
- | 0.8171 | 18.9999 | 177488 | 1.3618 | 34.9659 |
89
- | 0.7912 | 19.9989 | 186820 | 1.3701 | 35.0966 |
90
 
91
 
92
  ### Framework versions
 
1
  ---
 
 
 
2
  license: apache-2.0
3
+ base_model: google/mt5-base
 
4
  tags:
5
  - generated_from_trainer
6
+ metrics:
7
+ - bleu
8
  model-index:
9
  - name: mt5-base-ainu
10
+ results: []
 
 
 
 
 
 
 
 
 
 
 
 
 
11
  ---
12
 
13
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
15
 
16
  # mt5-base-ainu
17
 
18
+ This model is a fine-tuned version of [google/mt5-base](https://huggingface.co/google/mt5-base) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 3.5796
21
+ - Bleu: 0.1912
22
 
23
  ## Model description
24
 
 
50
 
51
  ### Training results
52
 
53
+ | Training Loss | Epoch | Step | Validation Loss | Bleu |
54
+ |:-------------:|:-----:|:----:|:---------------:|:------:|
55
+ | No log | 1.0 | 25 | 7.8298 | 0.0139 |
56
+ | No log | 2.0 | 50 | 4.7659 | 0.3532 |
57
+ | No log | 3.0 | 75 | 3.9025 | 0.2150 |
58
+ | No log | 4.0 | 100 | 3.7058 | 0.2334 |
59
+ | No log | 5.0 | 125 | 3.5796 | 0.1912 |
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
60
 
61
 
62
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:9d2aaca3d6c7e880e5babba51d8e1093a502645168a43bcb2fae5e048a3bd380
3
  size 2329638768
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:66b038fcfeed81a2ca3d241c532cf898a062642d8afcb769a70552f8b523c8e3
3
  size 2329638768