cosine annealing does: lr = lr_min + 0.5(lr_max - lr_min)(1 + cos(πt/T))

Cosine annealing adjusts learning rate cyclically between a maximum and minimum value over time

Image: ManfredKloeppel, CC BY 3.0, via Wikimedia Commons

cosine annealing does: lr = lr_min + 0.5(lr_max - lr_min)(1 + cos(πt/T))

Cosine annealing adjusts learning rate cyclically between a maximum and minimum value over time

Related concepts

One email a day: 5 concepts + the 5 stories that matter →

Swipe through 100 ML concepts daily

Open TickerNews