aynumosir
/

mt5-base-ainu

Text2Text Generation

Transformers

Safetensors

mt5

Generated from Trainer

Model card Files Files and versions Community

rigarashi commited on Jul 28, 2024

Commit

7375c26

verified ·

1 Parent(s): 5222a8f

Model save

Browse files

Files changed (1) hide show

README.md +29 -29

README.md CHANGED Viewed

@@ -1,18 +1,18 @@
 ---
 base_model: google/mt5-small
 datasets:
 - arrow
-license: apache-2.0
 metrics:
 - bleu
-tags:
-- generated_from_trainer
 model-index:
 - name: mt5-base-ainu
   results:
   - task:
-      type: text2text-generation
       name: Sequence-to-sequence Language Modeling
     dataset:
       name: arrow
       type: arrow
@@ -20,9 +20,9 @@ model-index:
       split: None
       args: default
     metrics:
-    - type: bleu
-      value: 34.75882910529557
-      name: Bleu
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -32,8 +32,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the arrow dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.3717
-- Bleu: 34.7588
 ## Model description
@@ -67,26 +67,26 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch   | Step   | Validation Loss | Bleu    |
 |:-------------:|:-------:|:------:|:---------------:|:-------:|
-| 2.7342        | 0.9999  | 9341   | 2.2952          | 8.7599  |
-| 2.0583        | 2.0     | 18683  | 1.7903          | 19.5422 |
-| 1.8006        | 2.9999  | 28024  | 1.6075          | 24.0648 |
-| 1.6417        | 4.0     | 37366  | 1.5050          | 27.5308 |
-| 1.516         | 4.9999  | 46707  | 1.4466          | 28.5774 |
-| 1.4319        | 6.0     | 56049  | 1.4077          | 29.7452 |
-| 1.339         | 6.9999  | 65390  | 1.3762          | 30.7138 |
-| 1.2797        | 8.0     | 74732  | 1.3575          | 31.1331 |
-| 1.2266        | 8.9999  | 84073  | 1.3404          | 31.8717 |
-| 1.1595        | 10.0    | 93415  | 1.3375          | 32.4945 |
-| 1.1193        | 10.9999 | 102756 | 1.3315          | 32.6273 |
-| 1.0606        | 12.0    | 112098 | 1.3252          | 33.4770 |
-| 1.0273        | 12.9999 | 121439 | 1.3216          | 33.7973 |
-| 0.982         | 14.0    | 130781 | 1.3328          | 33.9583 |
-| 0.9462        | 14.9999 | 140122 | 1.3364          | 33.9590 |
-| 0.9033        | 16.0    | 149464 | 1.3472          | 34.1416 |
-| 0.8785        | 16.9999 | 158805 | 1.3499          | 34.3651 |
-| 0.8484        | 18.0    | 168147 | 1.3571          | 34.7063 |
-| 0.815         | 18.9999 | 177488 | 1.3641          | 34.6037 |
-| 0.7957        | 19.9989 | 186820 | 1.3717          | 34.7588 |
 ### Framework versions

 ---
+license: apache-2.0
 base_model: google/mt5-small
+tags:
+- generated_from_trainer
 datasets:
 - arrow
 metrics:
 - bleu
 model-index:
 - name: mt5-base-ainu
   results:
   - task:
       name: Sequence-to-sequence Language Modeling
+      type: text2text-generation
     dataset:
       name: arrow
       type: arrow
       split: None
       args: default
     metrics:
+    - name: Bleu
+      type: bleu
+      value: 35.096583697771194
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the arrow dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.3701
+- Bleu: 35.0966
 ## Model description
 | Training Loss | Epoch   | Step   | Validation Loss | Bleu    |
 |:-------------:|:-------:|:------:|:---------------:|:-------:|
+| 2.73          | 0.9999  | 9341   | 2.2938          | 11.6604 |
+| 2.0536        | 2.0     | 18683  | 1.7883          | 19.8648 |
+| 1.7955        | 2.9999  | 28024  | 1.6060          | 24.9245 |
+| 1.6331        | 4.0     | 37366  | 1.5038          | 27.6152 |
+| 1.5107        | 4.9999  | 46707  | 1.4442          | 28.4249 |
+| 1.4255        | 6.0     | 56049  | 1.4058          | 30.0986 |
+| 1.3373        | 6.9999  | 65390  | 1.3769          | 30.9550 |
+| 1.2758        | 8.0     | 74732  | 1.3546          | 31.6075 |
+| 1.2259        | 8.9999  | 84073  | 1.3380          | 32.0155 |
+| 1.1555        | 10.0    | 93415  | 1.3318          | 32.5095 |
+| 1.1166        | 10.9999 | 102756 | 1.3263          | 32.9619 |
+| 1.0564        | 12.0    | 112098 | 1.3220          | 33.5983 |
+| 1.0222        | 12.9999 | 121439 | 1.3162          | 33.7293 |
+| 0.9764        | 14.0    | 130781 | 1.3304          | 34.0143 |
+| 0.9472        | 14.9999 | 140122 | 1.3353          | 34.3792 |
+| 0.9034        | 16.0    | 149464 | 1.3429          | 34.4296 |
+| 0.8732        | 16.9999 | 158805 | 1.3470          | 34.5817 |
+| 0.8465        | 18.0    | 168147 | 1.3554          | 34.9795 |
+| 0.8171        | 18.9999 | 177488 | 1.3618          | 34.9659 |
+| 0.7912        | 19.9989 | 186820 | 1.3701          | 35.0966 |
 ### Framework versions