learning rate warmup does: starts small to avoid early training instability

Learning rate warmup gradually increases the learning rate from zero to a predefined value to stabilize training initially

Image: Prime Minister's Office, GODL-India, via Wikimedia Commons

learning rate warmup does: starts small to avoid early training instability

Learning rate warmup gradually increases the learning rate from zero to a predefined value to stabilize training initially

Related concepts

One email a day: 5 concepts + the 5 stories that matter →

Swipe through 100 ML concepts daily

Open TickerNews