用yolo区分文字和模糊图像

来源:互联网 发布:mac照片无法访问iphone 编辑:程序博客网 时间:2024/06/05 16:32

想不到深度学习如此强大, 为了区分文字和模糊图像,仅用了160张图像,迭代2000次,就可以较好地区分出来了:

绿色框表示模糊,粉红色框表示文字。

如果是用传统方式编程序实现,难度太大了。


不过也有很多分类错的,误识别为模糊的:



观察一下发现蓝天和婴儿误分类为模糊,那就新增2个分类,baby和sky,果然模糊的错误分类减少了:


cfg/yolo-tiny-text-blur.cfg

[net]batch=64subdivisions=64height=448width=448channels=3momentum=0.9decay=0.0005learning_rate=0.0001policy=stepssteps=20,40,60,80,20000,30000scales=5,5,2,2,.1,.1max_batches = 40000[crop]crop_width=448crop_height=448flip=0angle=0saturation = 1.5exposure = 1.5[convolutional]filters=16size=3stride=1pad=1activation=leaky[maxpool]size=2stride=2[convolutional]filters=32size=3stride=1pad=1activation=leaky[maxpool]size=2stride=2[convolutional]filters=64size=3stride=1pad=1activation=leaky[maxpool]size=2stride=2[convolutional]filters=128size=3stride=1pad=1activation=leaky[maxpool]size=2stride=2[convolutional]filters=256size=3stride=1pad=1activation=leaky[maxpool]size=2stride=2[convolutional]filters=512size=3stride=1pad=1activation=leaky[maxpool]size=2stride=2[convolutional]filters=1024size=3stride=1pad=1activation=leaky[convolutional]filters=1024size=3stride=1pad=1activation=leaky[convolutional]filters=1024size=3stride=1pad=1activation=leaky[connected]output=256activation=linear[connected]output=4096activation=leaky[dropout]probability=.5[connected]output=686activation=linear[detection]classes=4coords=4rescore=1side=7num=2softmax=0sqrt=1jitter=.2object_scale=1noobject_scale=.5class_scale=1coord_scale=5





0 0
原创粉丝点击