Maxout Networks

来源:互联网 发布:c语言if else嵌套 编辑:程序博客网 时间:2024/06/09 23:17

Motivation

  • in multiple dimensions a maxout unit can approximate arbitrary convex functions

    这里写图片描述

Contributions

  • maxout is cross channel pooling
  • maxout enhances dropout’s abilities as a model averaging technique.
  • Dropout is generally viewed as an indiscriminately applicable tool that reliably yields a modest improvement in performance when applied to almost any model.

Experiments

  • better performance
    这里写图片描述

    这里写图片描述
    (Rectifier units do best without cross-channel pooling but with the same number of filters, meaning that the size of the state and the number of parameters must be about k times higher for rectifiers to obtain generalization performance approaching that of maxout.)

  • The activations of maxout units are not sparse
    这里写图片描述

  • Model averaging better
    这里写图片描述

这里写图片描述