CMUSphinx Wiki

来源：互联网发布：计算机图论算法编辑：程序博客网时间：2024/05/16 15:44

CMUSphinx Wiki

This page contains collaboratively developed documentation for the CMU Sphinx speech recognition engines.

此页面包含的是关于CMU Sphinx的语音识别引擎的合作开发文档。

Start User Documentation 开始用户文档

This section contains links to documents which describe how to use Sphinx to recognize speech. Currently, we have very little in the way of end-user tools, so it may be a bit sparse for the forseeable future.

CMUSphinx Tutorial For Developers: Getting started with CMUSphinx for developers

Basic concepts of speech
Overview of the CMUSphinx toolkit
Before you start
Building application using pocketsphinx
Building application using sphinx4
Building language models
Adapting existing acoustic model
Building the acoustic model
Building a dictionary
Using pocketsphinx on Android

本部分包含一些链接文件，这些文件描述了如何使用Sphinx来识别语音。目前，我们拥有的关于终端用户的工具很少，因此在可预见的将来，可能还是很少。

CMUSphinx 开发者教程：CMUSphinx 开发者入门指南

语音基本概念

CMUSphinx工具包概览

开始之前

使用pocketsphinx构建应用

使用sphinx4构建应用

构建语言模型

自适应现有声学模型

构建声学模型

构建字典

在Android上使用pocketsphinx

You are in trouble - read theFrequenty Asked Questions (FAQ)

Decoder Space

Sphinx4 Space : Information about sphinx4, design, code, performance, history.

Reference

These documents describe the excruciating detail of APIs, or provide other useful background information for CMUSphinx developers.

Doxygen documentation for PocketSphinx
Doxygen documentation for SphinxBase
ePyDoc documentation for SphinxTrain Python Modules
JavaDocs for Sphinx4

解码空间

Sphinx4 空间：sphinx4、设计、代码、性能、历史信息

参考

这些文档有的说明难以忍受的APIs细节，有的为CMUSphinx开发人员提供有用的背景信息

PocketSphinx的文档生成器

SphinxBase的文档成器

SphinxTrain Python模块的ePyDoc文档

Sphinx4 的JavaDocs

Developer Documentation

This section contains various internal information for CMUSphinx developers. But we hope it will be still usable for you.

Sphinx-4 Regression Tests: How to run regression tests
Layout of SphinxTrain code: An overview of the SphinxTrain source code for researchers and developers
CMUCLMTK development: Development guide for the CMU-Cambridge Language Modeling Toolkit.
Language Features for SphinxBase, SphinxThree, and SphinxTrain
Upcoming CMU Sphinx Software Releases: Plans for upcoming releases of Sphinx
Release Check List: How to make a release
Web Site Layout: How to organize information

File formats

Acoustic Model Format
MFC files

Data sources:

Data Sources

Materials for GSOC

Information for Students: Students information
Tasks for Summer Of Code Projects: Ideas for students

GSoc Previous years

Google Summer of Code 2012 Projects: Google Summer of Code 2012 Projects

开发者文档

本节包含为CMUSphinx开发者提供的各种内部信息，但是，我们希望这将对你仍然有用。

Sphinx-4的回归测试：如何执行回归测试

SphinxTrain的代码布局：

CMUCLMTK开发：

SphinxBase, SphinxThree, and SphinxTrain的语言特征

即将发布的CMU Sphinx软件

发布清单：

网站布局：

文件格式：

声学模型格式

MFC文件

数据源

GSOC资料

学生信息

夏季代码工程的任务

GSOC往年

google 2012年的夏季代码工程

Speech Recognition Theory

This section tries to collect research ideas for specific problems in speech recognition

Lattices
WFST
Search Algorithms
Language Models
Features
Noise Robustness
Adaptation

语音识别理论：

本节试图收集语音识别的具体问题的研究思路

网格

WFST

搜索算法

语言模型

特征

噪音鲁棒性

自适应