语义分割--Large Kernel Matters--Improve Semantic Segmentation by Global Convolutional Network

来源：互联网发布：mac os 显示内容放大编辑：程序博客网时间：2024/05/16 05:09

Large Kernel Matters–Improve Semantic Segmentation by Global Convolutional Network
https://arxiv.org/abs/1703.02719

语义分割问题需要同时解决两个问题：classification 和 localization，将图像中的每个物体精确分割出来，同时对每个物体进行分类。分类和定位这两个问题对于CNN 设计要求有所区别。
For the classification task, the models are required to be invariant to various transformations like translation and rotation.

But for the localization task, models should be transformation-sensitive, i.e., precisely locate every pixel for each semantic category
这里写图片描述
当前的语义分割算法主要侧重于 localization , which may be suboptimal for classification

怎么解决这个contradictory 了？这里我们的策略是使用 Large Kernel

这里我们设计了一个 Global Convolutional Network 采用 Large Kernel
from the localization view, the structure must be fully-convolutional without any fully-connected layer or global pooling layer that used by many classification networks,since the latter will discard localization information

from the classification view, motivated by the densely-connected structure of classification models, the kernel size of the convolutional structure should be as large as possible.

这里写图片描述

对于 GCN 模块计算量问题： Instead of directly using larger kernel or global convolution, our GCN module employs a
combination of 1 × k + k × 1 and k × 1 + 1 × k convolutions, which enables densely connections within a large k×k region inthe feature map.

为了提升物体边缘分割精度，提出Boundary Refinement
we propose a Boundary Refinement (BR) block shown in Figure 2 C. Here, we models the boundary alignment as a residual structure.

Kernal 尺寸越大，效果越好
这里写图片描述