Update README.md
Browse files
README.md
CHANGED
@@ -21,7 +21,7 @@ Hyperparams used to train this model:
|
|
21 |
"eval_steps": 50,
|
22 |
"vocab_size": 50257,
|
23 |
"warmup_tokens": 10000,
|
24 |
-
"gradient_accumulation_steps":
|
25 |
```
|
26 |
---
|
27 |
EXAMPLE USAGE
|
|
|
21 |
"eval_steps": 50,
|
22 |
"vocab_size": 50257,
|
23 |
"warmup_tokens": 10000,
|
24 |
+
"gradient_accumulation_steps": 16
|
25 |
```
|
26 |
---
|
27 |
EXAMPLE USAGE
|