Pdf Scaling Law With Learning Rate Annealing Openreview

Leo Migdal
-
pdf scaling law with learning rate annealing openreview