Why don't we use non-constant learning rates for gradient decent for things other then neural networks? – stats.stackexchange.com

Deep learning literature is full of clever tricks with using non-constant learning rates in gradient descent. Things like exponential decay, RMSprop, Adagrad etc. are easy to implement and are ...

from Hot Questions - Stack Exchange OnStackOverflow
via Blogspot

ONLINE WEB TRICKS

Labels

Search

Tags

flx

flx

Why don't we use non-constant learning rates for gradient decent for things other then neural networks? – stats.stackexchange.com

Share this

Unknown

0 Comment to "Why don't we use non-constant learning rates for gradient decent for things other then neural networks? – stats.stackexchange.com"

Featured post

How To Use Two WhatsApp Account/Numbers In One Phone

Popular Posts