文字检测与识别资源
来源:互联网 发布:c语言disc是什么意思 编辑:程序博客网 时间:2024/05/21 06:13
综述
[2015-PAMI-Overview]Text Detection and Recognition in Imagery: A Survey[paper]
[2014-Front.Comput.Sci-Overview]Scene Text Detection and Recognition: Recent Advances and Future Trends[paper]
自然场景文字检测
[2017-AAAI] TextBoxes: A Fast TextDetector with a Single Deep Neural Network[paper][code]
[2016-arXiv]Accurate Text Localization in Natural Image with Cascaded Convolutional TextNetwork [paper]
[2016-arXiv]DeepText : A Unified Framework for Text Proposal Generation and Text Detectionin Natural Images [paper] [data]
[2016-arXiv]TextProposals: a Text-specific Selective Search Algorithm for Word Spotting inthe Wild [paper] [code]
[2016-arXiv] SceneText Detection via Holistic, Multi-Channel Prediction [paper]
[2016-CVPR] CannyText Detector: Fast and Robust Scene Text Localization Algorithm [paper]
[2016-CVPR]Synthetic Data for Text Localisation in Natural Images [paper] [data][code]
[2016-ECCV]Detecting Text in Natural Image with Connectionist Text Proposal Network[paper][demo][code]
[2016-TIP]Text-Attentional Convolutional Neural Networks for Scene Text Detection [paper]
[2016-IJDAR]TextCatcher: a method to detect curved and challenging text in natural scenes[paper]
[2016-CVPR]Multi-oriented text detection with fully convolutional networks [paper]
[2015-TPRMI]Real-time Lexicon-free Scene Text Localization and Recognition[paper]
[2015-CVPR]Symmetry-Based Text Line Detection in Natural Scenes[paper][code]
[2015-ICCV]FASText: Efficient unconstrained scene text detector[paper][code]
[2015-D.PhilThesis] Deep Learning for Text Spotting [paper]
[2015 ICDAR]Object Proposals for Text Extraction in the Wild [paper] [code]
[2014-ECCV] DeepFeatures for Text Spotting [paper] [code] [model] [GitXiv]
[2014-TPAMI] WordSpotting and Recognition with Embedded Attributes [paper] [homepage] [code]
[2014-TPRMI]Robust Text Detection in Natural Scene Images[paper]
[2014-ECCV] RobustScene Text Detection with Convolution Neural Network Induced MSER Trees [paper]
[2012-CVPR]Real-time scene text localization and recognition[paper][code]
[2010-CVPR]DetectingText in Natural Scenes with Stroke Width Transform [paper] [code]
自然场景文字识别
[2016-AAAI]Reading Scene Text in Deep Convolutional Sequences [paper]
[2016-IJCV]Reading Text in the Wild with Convolutional Neural Networks [paper] [demo] [homepage]
[2016-CVPR]Recursive Recurrent Nets with Attention Modeling for OCR in the Wild [paper]
[2016-CVPR] RobustScene Text Recognition with Automatic Rectification [paper]
[2016-NIPs] Generative Shape Models: Joint Text Recognition and Segmentation with Very Little Training Data[paper]
[2015-CoRR] AnEnd-to-End Trainable Neural Network for Image-based Sequence Recognition andIts Application to Scene Text Recognition [paper] [code]
[2015-ICDAR]AutomaticScript Identification in the Wild[paper]
[2015-ICLR] Deepstructured output learning for unconstrained text recognition [paper]
[2014-NIPS]Synthetic Data and Artificial Neural Networks for Natural Scene TextRecognition [paperhomepage] [model]
[2014-TIP] A Unified Framework for Multi-Oriented Text Detection and Recognition [paper]
[2012-ICPR]End-to-End Text Recognition with Convolutional Neural Networks [paper] [code] [SVHN Dataset]
数据集
COCO-Text (ComputerVision Group, Cornell) 2016
63,686images, 173,589 text instances, 3 fine-grained text attributes.
Task:text location and recognition
COCO-Text API
Synthetic Data for Text Localisation in Natural Image (VGG)2016
800k thousand images
8 million synthetic word instances
download
Synthetic Word Dataset (Oxford, VGG) 2014
9million images covering 90k English words
Task:text recognition, segmentation
download
IIIT 5K-Words 2012
5000images from Scene Texts and born-digital (2k training and 3k testing images)
Eachimage is a cropped word image of scene text with case-insensitive labels
Task:text recognition
download
StanfordSynth(Stanford, AI Group) 2012
Smallsingle-character images of 62 characters (0-9, a-z, A-Z)
Task:text recognition
download
MSRA Text Detection 500 Database(MSRA-TD500) 2012
500natural images(resolutions of the images vary from 1296x864 to 1920x1280)
Chinese,English or mixture of both
Task:text detection
Street View Text (SVT) 2010
350high resolution images (average size 1260 × 860) (100 images for training and250 images for testing)
Onlyword level bounding boxes are provided with case-insensitive labels
Task:text location
KAIST Scene_Text Database 2010
3000images of indoor and outdoor scenes containing text
Korean,English (Number), and Mixed (Korean + English + Number)
Task:text location, segmentation and recognition
Chars74k 2009
Over74K images from natural images, as well as a set of synthetically generatedcharacters
Smallsingle-character images of 62 characters (0-9, a-z, A-Z)
Task:text recognition
ICDARBenchmark Datasets
Dataset
Discription
Competition Paper
ICDAR 2015
1000 training images and 500 testing images
paper
ICDAR 2013
229 training images and 233 testing images
paper
ICDAR 2011
229 training images and 255 testing images
paper
ICDAR 2005
1001 training images and 489 testing images
paper
ICDAR 2003
181 training images and 251 testing images(word level and character level)
paper
开源库
Tesseract: c++ based tools for documents analysis and OCR,support 60+ languages [code]
Ocropy:Python-based tools for document analysis and OCR [code]
CLSTM : A small C++ implementation of LSTM networks,focused on OCR [code]
ConvolutionalRecurrent Neural Network,Torch7 based [code]
Attention-OCR: Visual Attention based OCR [code]
Umaru: An OCR-system based on torch using the techniqueof LSTM/GRU-RNN, CTC and referred to the works of rnnlib and clstm [code]
其他
DeepFont:Identify Your Font from An Image[paper]
Writer-independentFeature Learning for Offline Signature Verification using Deep ConvolutionalNeural Networks[paper]
End-to-EndInterpretation of the French Street Name Signs Dataset [paper] [code]
Extractingtext from an image using Ocropus [blog]
手写字识别
[2016-Arvix]Drawingand Recognizing Chinese Characters with Recurrent Neural Network [paper]
LearningSpatial-Semantic Context with Fully Convolutional Recurrent Network for OnlineHandwritten Chinese Text Recognition [paper]
StrokeSequence-Dependent Deep Convolutional Neural Network for Online HandwrittenChinese Character Recognition [paper]
HighPerformance Offline Handwritten Chinese Character Recognition Using GoogLeNetand Directional Feature Maps [paper] [github]
DeepHCCR:Offline Handwritten Chinese Character Recognition based on GoogLeNet andAlexNet (With CaffeModel) [code]
如何用卷积神经网络CNN识别手写数字集?[blog][blog1][blog2] [blog4] [blog5] [code6]
Scan,Attend and Read: End-to-End Handwritten Paragraph Recognition with MDLSTMAttention [paper]
MLPaint:the Real-Time Handwritten Digit Recognizer [blog][code][demo]
caffe-ocr: OCR with caffe deep learning framework [code] (单字分类器)
牌照等识别
ReadingCar License Plates Using Deep Convolutional Neural Networks and LSTMs [paper]
Numberplate recognition with Tensorflow [blog] [code]
end-to-end-for-plate-recognition[code]
ApplyingOCR Technology for Receipt Recognition[blog][mirror]
破解验证码
Usingdeep learning to break a Captcha system [blog] [code]
Breakingreddit captcha with 96% accuracy [blog] [code]
I'mnot a human: Breaking the Google reCAPTCHA [paper]
NeuralNet CAPTCHA Cracker [slides] [code] [demo]
Recurrentneural networks for decoding CAPTCHAS [blog] [code] [demo]
Readingirctc captchas with 95% accuracy using deep learning [code]
端到端的OCR:基于CNN的实现 [blog]
IAm Robot: (Deep) Learning to Break Semantic Image CAPTCHAs [paper]
参考
[1]http://handong1587.github.io/deep_learning/2015/10/09/ocr.html
[2]https://github.com/chongyangtao/Awesome-Scene-Text-Recognition
原文地址:http://blog.csdn.net/peaceinmind/article/details/51387367
- 文字检测与识别资源
- 文字检测与识别资源
- 文字检测与识别资源
- 文字检测与识别资源
- 文字检测与识别资源
- [转]文字检测与识别资源
- 文字的检测与识别资源
- 【总结】文字检测与识别资源
- 文字的检测与识别资源
- 文字检测、分割、识别可用资源
- 文字检测与文字识别开源程序
- 上百份文字的检测与识别资源,包含数据集、code和paper
- 文字检测与识别1-MSER
- 文字检测与识别2-字符过滤
- 文字检测与识别3-字符合并
- 文字检测与识别1-MSER
- 文字检测与识别1-MSER
- 文字检测与识别1-MSER
- Struts2异常处理
- 你不能错过的2016Java技术调查
- vaadin之UI组件
- Java面向对象(一)
- GCD介绍: 基本概念和Dispatch Queue
- 文字检测与识别资源
- iOS跳转到系统设置wifi界面
- FTP文件传输协议两种模式-主动模式和被动模式
- 队列(一)循环队列的线性存储
- Linux前台和后台任务的切换
- 【51单片机】童年回忆,咬手鲨鱼改装闹钟
- 【C】VC++ 6.0经典C语言学习小工具的使用
- 360杀死了mysql服务和tomcat
- java基础学习集合之Set 九-6