StepLaw
/

StepLaw-N_1.0B-D_19.0B-LR1.105e-02-BS2097152

@@ -27,13 +27,13 @@ This model is part of the [StepLaw-N_1.0B-D_19.0B](https://huggingface.co/collec
 ### Training Parameters
 - **Learning rate (lr)**: 1.105e-02
-- **Batch size (bs)**: 1024
 - **Training iterations**: 9536
 - **Training tokens (D)**: 20.0B
 ## Model Description
-StepLaw models are trained with various hyperparameter settings to enable research on scaling laws and hyperparameter optimization. This specific model was trained with learning rate 1.105e-02 and batch size 1024 for 9536 iterations, using a total of 20.0B training tokens.
 ## Usage Example

 ### Training Parameters
 - **Learning rate (lr)**: 1.105e-02
+- **Batch size (bs)**: 2097152
 - **Training iterations**: 9536
 - **Training tokens (D)**: 20.0B
 ## Model Description
+StepLaw models are trained with various hyperparameter settings to enable research on scaling laws and hyperparameter optimization. This specific model was trained with learning rate 1.105e-02 and batch size 2097152 for 9536 iterations, using a total of 20.0B training tokens.
 ## Usage Example