EndOfLe commited on
Commit
bd3d668
·
verified ·
1 Parent(s): ab71cfa

qwen_new_mage_all_domains_balanced_1.5

Browse files
README.md CHANGED
@@ -18,8 +18,8 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [Qwen/Qwen1.5-1.8B](https://huggingface.co/Qwen/Qwen1.5-1.8B) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 0.2426
22
- - Accuracy: 0.9020
23
 
24
  ## Model description
25
 
@@ -38,25 +38,27 @@ More information needed
38
  ### Training hyperparameters
39
 
40
  The following hyperparameters were used during training:
41
- - learning_rate: 2e-05
42
  - train_batch_size: 32
43
  - eval_batch_size: 32
44
  - seed: 42
45
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
46
  - lr_scheduler_type: linear
47
- - num_epochs: 3
48
 
49
  ### Training results
50
 
51
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
52
  |:-------------:|:------:|:----:|:---------------:|:--------:|
53
- | 0.7734 | 0.0172 | 100 | 0.3509 | 0.8435 |
54
- | 0.4233 | 0.0344 | 200 | 0.6423 | 0.7446 |
55
- | 0.349 | 0.0516 | 300 | 0.3206 | 0.8669 |
56
- | 0.2886 | 0.0688 | 400 | 0.2244 | 0.9020 |
57
- | 0.2777 | 0.0860 | 500 | 0.6114 | 0.7797 |
58
- | 0.246 | 0.1032 | 600 | 0.3122 | 0.8723 |
59
- | 0.2295 | 0.1204 | 700 | 0.2426 | 0.9020 |
 
 
60
 
61
 
62
  ### Framework versions
 
18
 
19
  This model is a fine-tuned version of [Qwen/Qwen1.5-1.8B](https://huggingface.co/Qwen/Qwen1.5-1.8B) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 0.1147
22
+ - Accuracy: 0.9703
23
 
24
  ## Model description
25
 
 
38
  ### Training hyperparameters
39
 
40
  The following hyperparameters were used during training:
41
+ - learning_rate: 5e-06
42
  - train_batch_size: 32
43
  - eval_batch_size: 32
44
  - seed: 42
45
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
46
  - lr_scheduler_type: linear
47
+ - num_epochs: 1
48
 
49
  ### Training results
50
 
51
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
52
  |:-------------:|:------:|:----:|:---------------:|:--------:|
53
+ | 0.1945 | 0.0860 | 500 | 0.4187 | 0.8732 |
54
+ | 0.171 | 0.1720 | 1000 | 0.1541 | 0.9371 |
55
+ | 0.1235 | 0.2580 | 1500 | 0.1469 | 0.9451 |
56
+ | 0.1233 | 0.3441 | 2000 | 0.1433 | 0.9532 |
57
+ | 0.1109 | 0.4301 | 2500 | 0.1395 | 0.9541 |
58
+ | 0.108 | 0.5161 | 3000 | 0.1134 | 0.9622 |
59
+ | 0.0864 | 0.6021 | 3500 | 0.1452 | 0.9649 |
60
+ | 0.0743 | 0.6881 | 4000 | 0.1336 | 0.9676 |
61
+ | 0.0637 | 0.7741 | 4500 | 0.1147 | 0.9703 |
62
 
63
 
64
  ### Framework versions
evaluation_results.json ADDED
@@ -0,0 +1,8 @@
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "eval_loss": 0.24256853759288788,
3
+ "eval_accuracy": 0.9019784172661871,
4
+ "eval_runtime": 35.5314,
5
+ "eval_samples_per_second": 31.296,
6
+ "eval_steps_per_second": 0.985,
7
+ "epoch": 0.12041974883880957
8
+ }
model-00001-of-00002.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:057860dfb9560475776b1326f6e72be637d13e6400017efffe93dc70857ceebf
3
  size 4955308912
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:35a4e2c95d89066e9c03c121a80c973fd69c2e3ae6e2c593e40709687d1b3ed4
3
  size 4955308912
model-00002-of-00002.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f9bee3b96ec81e21c8f142da8a69ea1f4b047d1f344228194ef2f310a61f05d6
3
  size 1147395408
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8dd3e778230e8c8e93f4df9a72a0a124e61c394cd68abbff590dc5b9f2fc7f7b
3
  size 1147395408
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1f4fc2d11e3a36eb8f1706e041cee1728cccf1e3333e5638bee1bccf7879b7eb
3
  size 5304
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:de7dc619012be1971047dc79b3b31c3ea7823352c117be769699db889070abbf
3
  size 5304