qwen_new_mage_all_domains_balanced_1.5

Files changed (5) hide show

README.md CHANGED Viewed

@@ -18,8 +18,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [Qwen/Qwen1.5-1.8B](https://huggingface.co/Qwen/Qwen1.5-1.8B) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.2426
-- Accuracy: 0.9020
 ## Model description
@@ -38,25 +38,27 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 2e-05
 - train_batch_size: 32
 - eval_batch_size: 32
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
-- num_epochs: 3
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss | Accuracy |
 |:-------------:|:------:|:----:|:---------------:|:--------:|
-| 0.7734        | 0.0172 | 100  | 0.3509          | 0.8435   |
-| 0.4233        | 0.0344 | 200  | 0.6423          | 0.7446   |
-| 0.349         | 0.0516 | 300  | 0.3206          | 0.8669   |
-| 0.2886        | 0.0688 | 400  | 0.2244          | 0.9020   |
-| 0.2777        | 0.0860 | 500  | 0.6114          | 0.7797   |
-| 0.246         | 0.1032 | 600  | 0.3122          | 0.8723   |
-| 0.2295        | 0.1204 | 700  | 0.2426          | 0.9020   |
 ### Framework versions

 This model is a fine-tuned version of [Qwen/Qwen1.5-1.8B](https://huggingface.co/Qwen/Qwen1.5-1.8B) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.1147
+- Accuracy: 0.9703
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 5e-06
 - train_batch_size: 32
 - eval_batch_size: 32
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
+- num_epochs: 1
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss | Accuracy |
 |:-------------:|:------:|:----:|:---------------:|:--------:|
+| 0.1945        | 0.0860 | 500  | 0.4187          | 0.8732   |
+| 0.171         | 0.1720 | 1000 | 0.1541          | 0.9371   |
+| 0.1235        | 0.2580 | 1500 | 0.1469          | 0.9451   |
+| 0.1233        | 0.3441 | 2000 | 0.1433          | 0.9532   |
+| 0.1109        | 0.4301 | 2500 | 0.1395          | 0.9541   |
+| 0.108         | 0.5161 | 3000 | 0.1134          | 0.9622   |
+| 0.0864        | 0.6021 | 3500 | 0.1452          | 0.9649   |
+| 0.0743        | 0.6881 | 4000 | 0.1336          | 0.9676   |
+| 0.0637        | 0.7741 | 4500 | 0.1147          | 0.9703   |
 ### Framework versions

evaluation_results.json ADDED Viewed

+{
+    "eval_loss": 0.24256853759288788,
+    "eval_accuracy": 0.9019784172661871,
+    "eval_runtime": 35.5314,
+    "eval_samples_per_second": 31.296,
+    "eval_steps_per_second": 0.985,
+    "epoch": 0.12041974883880957
+}

model-00001-of-00002.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:057860dfb9560475776b1326f6e72be637d13e6400017efffe93dc70857ceebf
 size 4955308912

 version https://git-lfs.github.com/spec/v1
+oid sha256:35a4e2c95d89066e9c03c121a80c973fd69c2e3ae6e2c593e40709687d1b3ed4
 size 4955308912

model-00002-of-00002.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f9bee3b96ec81e21c8f142da8a69ea1f4b047d1f344228194ef2f310a61f05d6
 size 1147395408

 version https://git-lfs.github.com/spec/v1
+oid sha256:8dd3e778230e8c8e93f4df9a72a0a124e61c394cd68abbff590dc5b9f2fc7f7b
 size 1147395408

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:1f4fc2d11e3a36eb8f1706e041cee1728cccf1e3333e5638bee1bccf7879b7eb
 size 5304

 version https://git-lfs.github.com/spec/v1
+oid sha256:de7dc619012be1971047dc79b3b31c3ea7823352c117be769699db889070abbf
 size 5304