pgptlformer-tinystories / re-pqt-rmsXrms-ATTNII_rev2-d46b31ce-94f1-41c7-89cb-358a7a3f316b.txt
SQCU's picture
compiled models train faster so you can train more of them in a short experiment, to better convergence.
921107d verified
raw
history contribute delete
570 kB
File too large to display, you can check the raw version instead.