kaldi训练thchs30详细步骤

来源：互联网发布：查询淘宝账号编辑：程序博客网时间：2024/05/16 10:59

一、下载源码，https://github.com/kaldi-asr。

二、进入thchs30/s5文件夹，修改run.sh里面的数据路径，我才用了直接下载的方式，因为之前没有这方面的数据。

三、修改cmd.sh:

export train_cmd=run.pl
export decode_cmd=run.pl
export mkgraph_cmd=run.pl

export cuda_cmd=run.pl

四、运行sh run.sh

五、等待训练，训练过程中可能会出现一些异常，当时还担心是数据没有下载完，等到后面的测试结果出来才发现，这些异常都算正常。

六、回到src目录下，运行：make ext，这个过程也耗费了四五个小时，我没耐心等待，后来直接测试了，也没啥影响。

运行完毕后，src下回出现onlinebin，进去后会有online-wav-gmm-decode-faster和online-gmm-decode-faster，前者测试wav文件，后者测试麦克风输入的音频。

我没有使用麦克风测试音频的识别情况，所以不再详细解释。

七、将voxforge中的online_demo拷贝到egs/thchs30/.

八、进入egs/thchs30/online_demo/，online_demo下面新建online-data和work，online-data下面新建目录audio和models，models下新建目录tri1。

audio下放你要回放的wav，把s5下的exp下的tri1下的final.mdl和35.mdl（final.mdl是快捷方式）考过去。把s5下的exp下的tri1下的graph_word里面的words.txt,和HCLG.fst，考到models的tri1下。

九、修改online_demo下的run.sh

a)将下面这段注释掉：（这段是voxforge例子中下载现网的测试语料和识别模型的。我们测试语料自己准备，模型就是tri1了）

if [ ! -s ${data_file}.tar.bz2 ]; then
echo "Downloading test models and data ..."
wget -T 10 -t 3 $data_url;

if [ ! -s ${data_file}.tar.bz2 ]; then
echo "Download of $data_file has failed!"
exit 1
fi
fi

b) 然后再找到如下这句，将其路径改成tri1

# Change this to "tri2a" if you like to test using a ML-trained model
ac_model_type=tri2b_mmi

# Alignments and decoding results

---------------------------------------------------------------

改成：

# Change this to "tri2a" if you like to test using a ML-trained model
ac_model_type=tri1

c）这里修改的不全面，只要原文中的不要删除，只进行model的替换即可。

online-wav-gmm-decode-faster --verbose=1 --rt-min=0.8 --rt-max=0.85\
--max-active=4000 --beam=12.0 --acoustic-scale=0.0769 \
scp:$decode_dir/input.scp $ac_model/model

————————————————————————————

改成：online-wav-gmm-decode-faster --verbose=1 --rt-min=0.8 --rt-max=0.85\
--max-active=4000 --beam=12.0 --acoustic-scale=0.0769 \
scp:$decode_dir/input.scp $ac_model/final.mdl

十、在audio中方一个空的trans.txt，或者等到下一步work中生成后放入也可。

十一、运行命令sh run.sh --test-mode simulated

十二、效果展示

到了这里基本的步骤已经写完了，其他的深入研究，待续......

参考：http://m.blog.csdn.net/lijin6249/article/details/51838936

阅读全文

0 0