Decoupled Weight Decay Regularization
Cited in this thesis
Frequently Cited Together
- Generalization and parameter estimation in feedforward nets: Some experiments1 chapter
- Bert: Pre-training of deep bidirectional transformers for language understanding1 chapter
- Idiot's Bayes—not so stupid after all?1 chapter
- Adaptive mixtures of local experts1 chapter
- Gaussian error linear units (gelus)1 chapter
- Identification of biological tissues by rapid evaporative ionization mass spectr1 chapter
BibTeX
@article{Loshchilov2017,
author = {Loshchilov, Ilya and Hutter, Frank},
journal = {arXiv preprint arXiv:1711.05101},
title = {Decoupled weight decay regularization},
year = {2017},
}