Voice input 语音输入

来源:互联网 发布:用友网络股票基本面 编辑:程序博客网 时间:2024/04/28 22:56
  • Voice input 语音输入
    Voice is one of the three key forms of input on HoloLens. It allows you to directly command a hologram without having to use gestures. You simply gaze at a hologram and speak your command. Voice input can be a natural way to communicate your intent. Voice is especially good at traversing complex interfaces because it lets users cut through nested menus with one command.语音是HoloLens上的三种输入形式之一。 它允许您直接命令全息图,而无需使用手势。 你只是凝视全息图并说出你的命令。 语音输入可以是一种自然的方式来传达你的意图。 语音特别适合遍历复杂的接口,因为它允许用户使用一个命令来切换嵌套菜单。
    Voice input is powered by the same engine that supports speech in all other Universal Windows Apps.语音输入由支持所有其他通用Windows应用程序中的语音的同一引擎供电。

    Contents
    • 1 The "select" command  选择命令
    • 2 Hey Cortana 
    • 3 "See It, Say It" 看到它,它说
    • 4 Voice commands for fast Hologram Manipulation  用于快速全息图操作的语音命令
    • 5 Dictation  听写
    • 6 Communication 通讯
    • 7 Troubleshooting  故障排除
    • 8 See also 参见

    The "select" command  选择命令

    Even without specifically adding voice support to your app, your users can activate your holograms simply by saying "select". This behaves the same as a press and release with your hand or a clicker. You will hear a sound and see a tooltip with "select" appear as confirmation. "Select" is enabled by a low power keyword detection algorithm so it is always available for you to say at any time with minimal battery life impact, even with your hands at your side.即使没有专门为您的应用程序添加语音支持,您的用户可以简单地通过说“选择”激活全息图。 这种行为与用手或点击器按下和释放相同。 您将听到一个声音,看到一个带有“选择”的工具提示显示为确认。 “选择”通过低功率关键字检测算法启用,因此它始终可供您随时说出,同时最小的电池寿命影响,即使您的双手在您身边。



    Hey Cortana 
    You can also say "Hey Cortana" to bring up Cortana at anytime. You don't have to wait for her to appear to continue asking her your question or giving her an instruction - for example, try saying "Hey Cortana what's the weather?" as a single sentence. For more information about Cortana and what you can do, simply ask her! Say "Hey Cortana what can I say?" and she'll pull up a list of working and suggested commands. If you're already in the Cortana app you can also click the ? icon on the sidebar to pull up this same menu.你也可以说“嘿Cortana”随时提出Cortana。 你不必等待她出现继续问她的问题或给她一个指示 - 例如,试试说“ Hey Cortana what's the weather?”作为一个单句。 有关Cortana的更多信息,你可以做什么,只要问她! 说“ Hey Cortana what can I say?” 她会提出一份工作和建议命令的列表。 如果你已经在Cortana应用程序,你也可以单击? 图标在侧栏上拉起这个相同的菜单。
    HoloLens-specific commands HoloLens专用命令
    • What can I say? 我能说什么?
    • Go home | Go to Start - instead of bloom to get to Start Menu 主菜单/转到开始,而不是bloom,以进入开始菜单
    • Launch <app> 启动app
    • Move <app> here 在此处移动app
    • Take a picture  拍照
    • Start recording 开始录制
    • Stop recording 停止录音
    • Increase the brightness 增加亮度
    • Decrease the brightness 降低亮度
    • Increase the volume 增加音量
    • Decrease the volume 降低音量
    • Mute | Unmute 静音/取消音量
    • Shut down the device 关闭设备
    • Restart the device 重启设备
    • Go to sleep 睡眠
    • What time is it? 几点了
    • How much battery do I have left? 我还剩多少电量
    • Call <contact> (requires HoloSkype) 拨打contact 需要HoloSkype

    "See It, Say It"  

    HoloLens has a "see it, say it" model for voice input, where labels on buttons tell users what voice commands they can say as well. For example, when looking at a 2D app, a user can say the "Adjust" command which they see in the App bar to adjust the position of the app in the world.HoloLens有一个“看它,说它”的语音输入模型,按钮上的标签告诉用户他们可以说什么语音命令。 例如,当查看2D应用时,用户可以说他们在应用栏中看到的“调整”命令来调整应用在世界中的位置。

    When apps follow this rule, users can easily understand what to say to control the system. To reinforce this, while gazing at a button, you will see a "voice dwell" tooltip that comes up after a second if the button is voice-enabled and displays the command to speak to "press" it.当应用遵循此规则时,用户可以很容易地理解要说什么来控制系统。 为了加强这一点,当注视一个按钮时,你会看到一个“语音停留”工具提示,如果按钮是启用语音,并显示命令说“按”一秒钟。

    Voice commands for fast Hologram Manipulation

    There are also a number of voice commands you can say while gazing at a hologram to quickly perform manipulation tasks. These voice commands work on 2D apps as well as 3D objects you have placed in the world.还有一些语音命令,你可以说,同时凝视全息图,以快速执行操作任务。 这些语音命令适用于2D应用程序以及您放置在世界中的3D对象。

    Hologram Manipulation Commands Hologram操作命令

    • Face me  面向我
    • Bigger | Enhance 更大/提高
    • Smaller 更小

    Dictation 听写

    Rather than typing with air-taps, voice dictation can be more efficient to enter text into an app. This can greatly accelerate input with less effort for the user.  而不是用空气敲击键入,语音听写可以更有效地输入文本到应用程序。 这可以大大加速输入,而用户的努力较少。


    Any time the holographic keyboard is active, you can switch to dictation mode instead of typing. Select the microphone on the side of the text input box to get started.任何时候全息键盘是活动的,你可以切换到听写模式,而不是打字。 选择文本输入框旁边的麦克风即可开始。

    Communication 通讯

    For applications that want to take advantage of the customized audio input processing options provided by HoloLens, it is important to understand the various audio stream categories your app can consume. Windows 10 supports several different stream categories and HoloLens makes use of three of these to enable custom processing to optimize the microphone audio quality tailored for speech, communication and other which can be used for ambient environment audio capture (i.e. "camcorder") scenarios.对于希望利用HoloLens提供的定制音频输入处理选项的应用程序,了解应用程序可以使用的各种音频流类别非常重要。 Windows 10支持若干不同的流类别,并且HoloLens利用这些流类别中的三个,以使得能够进行定制处理以优化针对语音,通信和其他可以用于周围环境音频捕获(即,“摄像机”)场景的麦克风音频质量。
    • The AudioCategory_Communications stream category is customized for call quality and narration scenarios and provides the client with a 16kHz 24bit mono audio stream of the user's voiceAudioCategory_Communications流类别为呼叫质量和叙述场景定制,并为客户端提供用户语音的16kHz 24位单声道音频流
    • The AudioCategory_Speech stream category is customized for the HoloLens (Windows) speech engine and provides it with a 16kHz 24bit mono stream of the user's voice. This category can be used by 3rd party speech engines if needed.AudioCategory_Speech流类别是为HoloLens(Windows)语音引擎定制的,并为其提供用户语音的16kHz 24位单声道流。如果需要,此类别可由第三方语音引擎使用。
    • The AudioCategory_Other stream category is customized for ambient environment audio recording and provides the client with a 48kHz 24 bit stereo audio stream.AudioCategory_其他流类别是针对环境音频记录定制的,并为客户端提供48kHz的24位立体声音频流。
    All this audio processing is hardware accelerated which means the features drain a lot less power than if the same processing was done on the HoloLens CPU. Avoid running other audio input processing on the CPU to maximize system battery life and take advantage of the built in, offloaded audio input processing.所有这些音频处理都是硬件加速的,这意味着与HoloLens CPU上的相同处理相比,这些功能消耗的功率少得多。避免在CPU上运行其他音频输入处理,以最大限度延长系统电池寿命,并利用内置,卸载的音频输入处理。

    Troubleshooting

    If you're having any issues using "select" and "Hey Cortana", try moving to a quieter space, turning away from the source of noise, or by speaking louder. At this time, all speech recognition on HoloLens is tuned and optimized specifically to native speakers of United States English. 如果您使用“select ”和Hey Cortana ”遇到的任何问题,尝试移动到一个安静的空间,或者说话的声音从转动的噪音源移开。这时,在HoloLens所有的语音识别调整和专门美国英语为母语的优化。

    See also

    • Voice input in DirectX
    • Voice input in Unity
    • Holograms 212
0 0
原创粉丝点击