Deep Matching Prior Network: Toward Tighter Multi-oriented Text Detection

来源:互联网 发布:java 读取文件字符串 编辑:程序博客网 时间:2024/06/04 00:46

Retrospective research has only focused on using rectangular bounding box or horizontal sliding window to localize text, which may result in redundant background noise, unnecessary overlap or even information loss. To address these issues, we propose a new Convolutional Neural Networks (CNNs) based method, named Deep Matching Prior Network (DMPNet), to detect text with tighter quadrangle.
这里写图片描述

quadrangle 四边形

  • firstly, roughly recalling text with quadrilateral sliding window
  • then, using a shared Monte-Carlo method for fast and accurate computing of polygonal areas;
  • finely localizing text with quadrangle and design a Smooth Ln loss for
    moderately adjusting the predicted bounding box.

设计多几个sliding window

这里写图片描述

改进计算overlap的方式

这里写图片描述

回归十个参数

这里写图片描述

In future, we will explore using shape-adaptive sliding windows toward tighter scene text detection.

阅读全文
0 0
原创粉丝点击