CMUSphinx Wiki

来源:互联网 发布:计算机图论算法 编辑:程序博客网 时间:2024/05/16 15:44

CMUSphinx Wiki

 

This page contains collaboratively developed documentation for the CMU Sphinx speech recognition engines.

此页面包含的是关于CMU Sphinx的语音识别引擎的合作开发文档。

 

Start User Documentation   开始用户文档

This section contains links to documents which describe how to use Sphinx to recognize speech.  Currently, we have very little in the way of end-user tools, so it may be a bit sparse for the  forseeable future.

CMUSphinx Tutorial For Developers: Getting started with CMUSphinx for developers

  • Basic concepts of speech
  • Overview of the CMUSphinx toolkit
  • Before you start
  • Building application using pocketsphinx
  • Building application using sphinx4
  • Building language models
  • Adapting existing acoustic model
  • Building the acoustic model
  • Building a dictionary
  • Using pocketsphinx on Android

本部分包含一些链接文件,这些文件描述了如何使用Sphinx来识别语音。目前,我们拥有的关于终端用户的工具很少,因此在可预见的将来,可能还是很少。

CMUSphinx 开发者教程:CMUSphinx 开发者入门指南

       语音基本概念

       CMUSphinx工具包概览

       开始之前

       使用pocketsphinx构建应用

       使用sphinx4构建应用

       构建语言模型

       自适应现有声学模型

       构建声学模型

       构建字典

       在Android上使用pocketsphinx

 

You are in trouble - read theFrequenty Asked Questions (FAQ)

      See also some more docs:

  • Decoder Versions: Description of the software packages
  • Download Details: How to obtain CMUSphinx packages
  • How to get help and discuss things: How to get help and discuss things
  • http://cmusphinx.sourceforge.net/doc/speech.ppt Cool presentation done byHeather Dewey-Hagborg

If you want to find out where CMUSphinx works, see

  • Projects that use Sphinx: These projects, both commercial and free, use Sphinx in one form or another

遇到麻烦 - 阅读 常见问题

      另见更多的文档:

      解码器版本:软件包说明

      下载详细信息:如何获得CMUSphinx包

      如何获取帮助和讨论

      http://cmusphinx.sourceforge.net/doc/speech.ppt Heather Dewey-Hagborg做的很酷的介绍

如果你想找出CMUSphinx在哪些地方应用,请参阅

      使用Sphinx的项目:这些项目以某种形式使用Sphinx,有商业和免费的两种

 

Advanced User Documentation

These documents either describe some particular aspect of the Sphinx codebase in detail, or they serve as a developer's guide to accomplishing some particular task.

  • Building on IPhone: Building Pocketsphinx on various platforms
  • Integrating CMUSphinx with Telephony Servers - Asterisk and Freeswitch: How to use pocketsphinx in Asterisk.
  • The Incomplete Guide to Sphinx-3 Performance Tuning: How to tune the decoder to be fast (or rather, not horribly slow)
  • Pocketsphinx optimizations for embedded devices.
  • Phoneme Recognition (caveat emptor): How to use Sphinx3 for phoneme recognition.
  • Segmentation and Diarization using LIUM tools: Using LIUM tools for speech segmentation and speaker diarization
  • Training an acoustic model with LDA and MLLT feature transforms: How to train acoustic models with LDA and MLLT feature transforms
  • Using PocketSphinx with GStreamer and Python (or Vala): How to use PocketSphinx withGStreamer and Python
  • InstallingPythonStuff: How to install Python and necessary modules for SphinxTrain development
  • MMIE Training in SphinxTrain: How to perform MMIE training.
  • http://www.speech.cs.cmu.edu/sphinx/tutorial.html Robust Group Tutorial (classic tutorial from CMU Speech Group website)

高级用户文档

      这些文档要么在Sphinx代码的某些特别的方面做了详细的说明,要么作为开发人员的指南,完成一些特定的任务。

      在Iphone上构建:在各种平台上构建Pocketsphinx

      电话服务集成CMUSphinx

      Sphinx-3性能调节的不完整指南:如何调整解码器变快(更确切的说,不是可怕的慢)      嵌入式设备优化pocketsphinx。

      音素识别(使用者当心):如何使用Sphinx3识别音素

      使用LIUM工具进行分段和分类:使用LIUM工具进行语音分段和说话人分类

      训练声学模型时使用LDA和MLLT做特征变换:如何使用LDA和MLLT特征变换来训练声学模型

      使用GStreamer和Python的PocketSphinx

      安装Python文件:如何安装Python和SphinxTrain开发的必要模块

      SphinxTrain中MMIE训练

 

Decoder Space

  • Sphinx4 Space : Information about sphinx4, design, code, performance, history.

Reference

These documents describe the excruciating detail of APIs, or provide other useful background information for CMUSphinx developers.

  • Doxygen documentation for PocketSphinx
  • Doxygen documentation for SphinxBase
  • ePyDoc documentation for SphinxTrain Python Modules
  • JavaDocs for Sphinx4

解码空间

      Sphinx4 空间:sphinx4、设计、代码、性能、历史信息

参考

     这些文档有的说明难以忍受的APIs细节,有的为CMUSphinx开发人员提供有用的背景信息

     PocketSphinx的文档生成器

     SphinxBase的文档成器

     SphinxTrain Python模块的ePyDoc文档

     Sphinx4 的JavaDocs

 

Developer Documentation

This section contains various internal information for CMUSphinx developers. But we hope it will be still usable for you.

  • Sphinx-4 Regression Tests: How to run regression tests
  • Layout of SphinxTrain code: An overview of the SphinxTrain source code for researchers and developers
  • CMUCLMTK development: Development guide for the CMU-Cambridge Language Modeling Toolkit.
  • Language Features for SphinxBase, SphinxThree, and SphinxTrain
  • Upcoming CMU Sphinx Software Releases: Plans for upcoming releases of Sphinx
  • Release Check List: How to make a release
  • Web Site Layout: How to organize information

File formats

  • Acoustic Model Format
  • MFC files

Data sources:

  • Data Sources

Materials for GSOC

  • Information for Students: Students information
  • Tasks for Summer Of Code Projects: Ideas for students

GSoc Previous years

  • Google Summer of Code 2012 Projects: Google Summer of Code 2012 Projects

 

开发者文档

       本节包含为CMUSphinx开发者提供的各种内部信息,但是,我们希望这将对你仍然有用。

      Sphinx-4的回归测试:如何执行回归测试

      SphinxTrain的代码布局:

      CMUCLMTK开发:

      SphinxBase, SphinxThree, and SphinxTrain的语言特征

      即将发布的CMU Sphinx软件

      发布清单:

      网站布局:

文件格式:

     声学模型格式

     MFC文件

数据源

     数据源

GSOC资料

     学生信息

     夏季代码工程的任务

GSOC往年

     google 2012年的夏季代码工程

 

Speech Recognition Theory

    This section tries to collect research ideas for specific problems in speech recognition

    • Lattices
    • WFST
    • Search Algorithms
    • Language Models
    • Features
    • Noise Robustness
    • Adaptation

    语音识别理论:

    本节试图收集语音识别的具体问题的研究思路

          网格

          WFST

          搜索算法

          语言模型

          特征

          噪音鲁棒性

          自适应

原创粉丝点击