RMSprop fixes about AdaGrad: uses exponential moving average instead of sum

RMSprop uses an exponentially decaying average of squared gradients, unlike AdaGrad's cumulative sum

Image: Brown, J., O.J. Ferrians, Jr., J.A. Heginbottom, and E.S. Melnikov. 1998, revised February 2001. Circum-arctic map of pe, Public domain, via Wikimedia Commons

RMSprop fixes about AdaGrad: uses exponential moving average instead of sum

RMSprop uses an exponentially decaying average of squared gradients, unlike AdaGrad's cumulative sum

Related concepts

One email a day: 5 concepts + the 5 stories that matter →

Swipe through 100 ML concepts daily

Open TickerNews