用python写的opencv实时监测和解析二维码和条形码最牛教程

来源：互联网发布：淘宝会员等级没了2017 编辑：程序博客网时间：2024/05/09 14:04

今天，我实现了一个很有趣的demo，它可以在视频里找到并解析二维码，然后把解析的内容实时在屏幕上显示出来。

然后我们直入主题，首先你得确保你装了opencv,python,zbar等环境。然后这个教程对于学过opencv的人可能更好理解，但是没学过也无妨，到时候也可以直接用。

比如我的电脑上的环境是opencv2.4.x,python2.7,和最新的zbar,在Ubuntu 12.12的系统下运行的

假设你的opencv已经安装好了，那么我们就可以安装zbar

你可以先更新一下

sudo apt-get update

然后在输入

sudo apt-get install python-zbar

如果环境装好了，我们就可以接着下一步操作了。

首先让我们来实现找到在图片里面找到二维码的功能

先新建一个python文件叫做;simple_barcode_detection.py

代码如下，这定义了一个函数，实现从一副图片里面找出二维码的位置

我们要检测的二维码的图片

import numpy as np
import cv2

def detect(image):
# 把图像从RGB装换成灰度图
gray = cv2.cvtColor(image, cv2.COLOR_BGR2GRAY)

# 使用Scharr操作（指定使用ksize = -1）构造灰度图在水平和竖直方向上的梯度幅值表示。
gradX = cv2.Sobel(gray, ddepth = cv2.cv.CV_32F, dx = 1, dy = 0, ksize = -1)
gradY = cv2.Sobel(gray, ddepth = cv2.cv.CV_32F, dx = 0, dy = 1, ksize = -1)

#Scharr操作后，从x的梯度减去y的梯度
gradient = cv2.subtract(gradX, gradY)
gradient = cv2.convertScaleAbs(gradient)

#经过上面的操作后看起来是这样

# 对上述的梯度图采用用9x9的核进行平均模糊,这是有利于降噪的

#然后进行二值化处理，要么是255(白)要么是0(黑)

blurred = cv2.blur(gradient, (9, 9))
(_, thresh) = cv2.threshold(blurred, 225, 255, cv2.THRESH_BINARY)

#模糊与二值化处理后，看起来是这个样子，

#上面的操作后发现有一些条形码的竖杠之间存在一些缝隙，并使我们的算法能检测到斑点区域，我们进行一些基本的形态学操作
#我们首先使用cv2.getStructuringElement构造一个长方形内核。这个内核的宽度大于长度，因此我们可以消除条形码中垂直条之间的缝隙。
# 这里进行形态学操作，将上一步得到的内核应用到我们的二值图中，以此来消除竖杠间的缝隙。
kernel = cv2.getStructuringElement(cv2.MORPH_RECT, (21, 7))
closed = cv2.morphologyEx(thresh, cv2.MORPH_CLOSE, kernel)

#上述操作后看起来是这个样子

#我们发现还是存在一些小斑点，这时我们可以使用opencv里面的腐蚀和膨胀来处理他们，来去除白色的斑点

closed = cv2.erode(closed, None, iterations = 4)
closed = cv2.dilate(closed, None, iterations = 4)

#这时候的效果看起来是这样的

# 接下来我们来找轮廓
(cnts, _) = cv2.findContours(closed.copy(), cv2.RETR_EXTERNAL,
cv2.CHAIN_APPROX_SIMPLE)

#如果没找到则返回为空
if len(cnts) == 0:
return None

#找到了就通过面积来排序，并计算旋转角
# 给最大的轮廓找到边框
c = sorted(cnts, key = cv2.contourArea, reverse = True)[0]
rect = cv2.minAreaRect(c)

#box(里面是ndarry数组，包含了4个顶点的位置)
box = np.int0(cv2.cv.BoxPoints(rect))

# 返回box
return box

最终结果

好了，上面的解释里面有中文，可能python解释的时候会通不过，我下面直接给出代码

import numpy as npimport cv2def detect(image):# convert the image to grayscalegray = cv2.cvtColor(image, cv2.COLOR_BGR2GRAY)# compute the Scharr gradient magnitude representation of the images# in both the x and y directiongradX = cv2.Sobel(gray, ddepth = cv2.cv.CV_32F, dx = 1, dy = 0, ksize = -1)gradY = cv2.Sobel(gray, ddepth = cv2.cv.CV_32F, dx = 0, dy = 1, ksize = -1)# subtract the y-gradient from the x-gradientgradient = cv2.subtract(gradX, gradY)gradient = cv2.convertScaleAbs(gradient)# blur and threshold the imageblurred = cv2.blur(gradient, (9, 9))(_, thresh) = cv2.threshold(blurred, 225, 255, cv2.THRESH_BINARY)# construct a closing kernel and apply it to the thresholded imagekernel = cv2.getStructuringElement(cv2.MORPH_RECT, (21, 7))closed = cv2.morphologyEx(thresh, cv2.MORPH_CLOSE, kernel)# perform a series of erosions and dilationsclosed = cv2.erode(closed, None, iterations = 4)closed = cv2.dilate(closed, None, iterations = 4)# find the contours in the thresholded image(cnts, _) = cv2.findContours(closed.copy(), cv2.RETR_EXTERNAL,cv2.CHAIN_APPROX_SIMPLE)# if no contours were found, return Noneif len(cnts) == 0:return None# otherwise, sort the contours by area and compute the rotated# bounding box of the largest contourc = sorted(cnts, key = cv2.contourArea, reverse = True)[0]rect = cv2.minAreaRect(c)box = np.int0(cv2.cv.BoxPoints(rect))# return the bounding box of the barcodereturn box

完成了上述的工作，我们就完成了二维码和条形码的定位，接下去实现视频里面二维码的解析

你可以新建一个python文件，barcode_vid.py

解析二维码我们需要用zbar这个模块和PIL，PIL在python里面装好了

我们先导入模块

# import the necessary packages
import simple_barcode_detection
import cv2
import numpy as np
import zbar
from PIL import Image

#接下去是创建一个扫描器,他可以解析二维码的内容

# create a reader
scanner = zbar.ImageScanner()
# configure the reader
scanner.parse_config('enable')

#设置屏幕显示字体

font=cv2.FONT_HERSHEY_SIMPLEX

#启用摄像头

camera=cv2.VideoCapture(0)

#接下去是一个大的while循环

while True:

#得到当前的帧
# grab the current frame
(grabbed, frame) = camera.read()

#检测视频是否到底了，如果检测视频文件里面的二维码或条形码用这个，如果开启摄像头就无所谓了
# check to see if we have reached the end of the
# video
if not grabbed:
break

调用刚才我们建的那个函数来查找二维码返回二维码的位置
# detect the barcode in the image
box = simple_barcode_detection.detect(frame)
if box != None:

#这下面的3步得到扫描区域，扫描区域要比检测出来的位置要大
min=np.min(box,axis=0)
max=np.max(box,axis=0)

roi=frame[min[1]-10:max[1]+10,min[0]-10:max[0]+10]

#把区域里的二维码传换成RGB，并把它转换成pil里面的图像，因为zbar得调用pil里面的图像，而不能用opencv的图像
roi=cv2.cvtColor(roi,cv2.COLOR_BGR2RGB)
pil= Image.fromarray(frame).convert('L')
width, height = pil.size
raw = pil.tostring()

#把图像装换成数据
zarimage = zbar.Image(width, height, 'Y800', raw)

#扫描器进行扫描
scanner.scan(zarimage)

#得到结果
for symbol in zarimage:
# 对结果进行一些有用的处理
print 'decoded', symbol.type, 'symbol', '"%s"' %symbol.data
cv2.drawContours(frame, [box], -1, (0, 255, 0), 2)

#把解析的内容放到视频上
cv2.putText(frame,symbol.data,(20,100),font,1,(0,255,0),4)

# show the frame and record if the user presses a key
cv2.imshow("Frame", frame)
key = cv2.waitKey(1) & 0xFF

# if the 'q' key is pressed, stop the loop
if key == ord("q"):
break

# cleanup the camera and close any open windows
camera.release()
cv2.destroyAllWindows()

CSDN不能上传视频，我下面传一下图片

下面还是上源码

# import the necessary packages
import simple_barcode_detection
import cv2
import numpy as np
import zbar
from PIL import Image

# create a reader
scanner = zbar.ImageScanner()
# configure the reader
scanner.parse_config('enable')
font=cv2.FONT_HERSHEY_SIMPLEX
camera=cv2.VideoCapture(0)
while True:
# grab the current frame
(grabbed, frame) = camera.read()

# check to see if we have reached the end of the
# video
if not grabbed:
break

# detect the barcode in the image
box = simple_barcode_detection.detect(frame)
if box != None:
min=np.min(box,axis=0)
max=np.max(box,axis=0)
roi=frame[min[1]-10:max[1]+10,min[0]-10:max[0]+10]
print roi.shape
roi=cv2.cvtColor(roi,cv2.COLOR_BGR2RGB)
pil= Image.fromarray(frame).convert('L')
width, height = pil.size
raw = pil.tostring()
# wrap image data
zarimage = zbar.Image(width, height, 'Y800', raw)
# scan the image for barcodes
scanner.scan(zarimage)
# extract results
for symbol in zarimage:
# do something useful with results
print 'decoded', symbol.type, 'symbol', '"%s"' %symbol.data
cv2.drawContours(frame, [box], -1, (0, 255, 0), 2)
cv2.putText(frame,symbol.data,(20,100),font,1,(0,255,0),4)
# if a barcode was found, draw a bounding box on the frame

# show the frame and record if the user presses a key
cv2.imshow("Frame", frame)
key = cv2.waitKey(1) & 0xFF

# if the 'q' key is pressed, stop the loop
if key == ord("q"):
break

# cleanup the camera and close any open windows
camera.release()
cv2.destroyAllWindows()

我明明可以靠脸吃饭，偏偏要靠才华！
求点赞！

3 0

用python写的opencv实时监测和解析二维码和条形码最牛教程

我明明可以靠脸吃饭，偏偏要靠才华！求点赞！

我明明可以靠脸吃饭，偏偏要靠才华！
求点赞！