debian 搭建python图像识别环境

来源:互联网 发布:店宝宝软件好不好 编辑:程序博客网 时间:2024/06/08 14:56

1、安装基础包

apt-get install libjpeg-dev libpng-dev libtiff* gcc automake libtool python-imaging
2、安装leptonica

wget http://www.leptonica.com/source/leptonica-1.72.tar.gz
tar -zxvf leptonica-1.72.tar.gz  
cd leptonica-1.72
./configure && make && make install

(3)编译tesseract了,所用版本 3.04。
wget https://github.com/tesseract-ocr/tesseract/archive/3.04.00.tar.gz
tar -zxvf 3.04.00.tar.gz  
cd tesseract-3.04.00/
./autogen.sh (如果前置条件不满足,编译过程会报错,错误信息会提示缺少的包名,按照提示直接yum安装即可)
./configure
make && make install
apt-get install tesseract-ocr-eng tesseract-ocr-chi-sim

ldconfig

cd /usr/local/share/tessdata/
wget https://github.com/tesseract-ocr/tessdata/raw/master/eng.traineddata
wget https://github.com/tesseract-ocr/tessdata/raw/master/chi_sim.traineddata
wget https://github.com/tesseract-ocr/tessdata/raw/master/chi_tra.traineddata
到这里可以先弄一张图片测试下能否解析出文字了

tesseract 11.jpg bbb  -psm 3 -l chi_sim+eng

cat bbb.txt


(4)pytesser,无需安装,直接下载解压即可,进入解压后的文件夹进行测试

http://download.csdn.net/download/pyliang_2008/5564135



0 0