Weight Decay & Batch Normalization

来源:互联网 发布:韩国抄袭知乎 编辑:程序博客网 时间:2024/05/21 13:53

先把这两篇极好的资料放上来,等周末再整理吧。

weight decay:

https://stats.stackexchange.com/questions/29130/difference-between-neural-net-weight-decay-and-learning-rate

batch normalization:

https://kratzert.github.io/2016/02/12/understanding-the-gradient-flow-through-the-batch-normalization-layer.html

原创粉丝点击