Speech recognition using loosely coupled components

US9666190B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9666190-B2
Application numberUS-201615218492-A
CountryUS
Kind codeB2
Filing dateJul 25, 2016
Priority dateJun 13, 2011
Publication dateMay 30, 2017
Grant dateMay 30, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An automatic speech recognition system includes an audio capture component, a speech recognition processing component, and a result processing component which are distributed among two or more logical devices and/or two or more physical devices. In particular, the audio capture component may be located on a different logical device and/or physical device from the result processing component. For example, the audio capture component may be on a computer connected to a microphone into which a user speaks, while the result processing component may be on a terminal server which receives speech recognition results from a speech recognition processing server.

First claim

Opening claim text (preview).

What is claimed is: 1. A system comprising: an audio capture component, the audio capture component comprising means for capturing a first audio signal representing first speech of a user to produce a first captured audio signal; a speech recognition processing component comprising means for performing automatic speech recognition on the first captured audio signal to produce first speech recognition results; a first result processing component, the first result processing component comprising first means for processing the first speech recognition results to produce first result output; a second result processing component, the second result processing component comprising second means for processing the first speech recognition results to produce second result output; a context sharing component comprising means for identifying a first one of the first and second result processing components as being associated with a first context of the user at a first time, the context sharing component further comprising: means for receiving credentials from the user; means for identifying, based on the credentials, a list of at least one result processing component authorized for use on behalf of the user at the first time; means for determining that the at least one result processing component in the list is associated with the context of the user at the first time; and means for identifying a location of the at least one result processing component associated with the context of the user at the first time; and speech recognition result provision means for providing, via a method selected based on the identified location, the first speech recognition results to the identified first one of the first and second result processing components. 2. The system of claim 1 , wherein: the audio capture component further comprises means for capturing a second audio signal representing second speech of the user to produce a second captured audio signal; the speech recognition processing component further comprises means for performing automatic speech recognition on the second captured audio signal to produce second speech recognition results; the context sharing component further comprises means for identifying a second one of the first and second result processing components as being associated with a second context of the user at a second time, wherein the second one of the first and second result processing components differs from the first one of the first and second result processing components; and wherein the speech recognition result provision means further comprises means for providing the second speech recognition results to the identified second one of the first and second result processing components. 3. The system of claim 1 , wherein the credentials comprise a username and password of the user. 4. A computer-implemented method for use with a system: wherein the system comprises: an audio capture component; a speech recognition processing component; a first result processing component; a second result processing component; a context sharing component; and speech recognition result provision means; wherein the method comprises: (A) using the audio capture component to capture a first audio signal representing first speech of a user to produce a first captured audio signal; (B) using the speech recognition processing component to perform automatic speech recognition on the first captured audio signal to produce first speech recognition results; (C) using the first result processing component to process the first speech recognition results to produce first result output; (D) using second result processing component to process the first speech recognition results to produce second result output; (E) using the context sharing component to identify a first one of the first and second result processing components as being associated with a first context of the user at a first time, wherein using the context sharing component to identify further comprises: receiving credentials from the user; identifying, based on the credentials, a list of at least one result processing component authorized for use on behalf of the user at the first time; determining that the at least one result processing component in the list is associated with the context of the user at the first time; and identifying a location of the at least one result processing component associated with the context of the user at the first time; and (F) using the speech recognition result provision means to provide, via a method selected based on the identified location, the first speech recognition results to the identified first one of the first and second result processing components. 5. The method of claim 4 , further comprising: (G) using the audio capture component to capture a second audio signal representing second speech of the user to produce a second captured audio signal; (H) using the speech recognition processing component to perform automatic speech recognition on the second captured audio signal to produce second speech recognition results; (I) using the context sharing component to identify a second one of the first and second result processing components as being associated with a second context of the user at a second time, wherein the second one of the first and second result processing components differs from the first one of the first and second result processing components; and (J) using the speech recognition result provision means to provide the second speech recognition results to the identified second one of the first and second result processing components. 6. The method of claim 4 , wherein the credentials comprise a username and password of the user.

Assignees

Inventors

Classifications

  • G10L15/22Primary

    Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title

  • G10L15/30Primary

    Distributed recognition, e.g. in client-server systems, for mobile phones or network applications · CPC title

  • of application context · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9666190B2 cover?
An automatic speech recognition system includes an audio capture component, a speech recognition processing component, and a result processing component which are distributed among two or more logical devices and/or two or more physical devices. In particular, the audio capture component may be located on a different logical device and/or physical device from the result processing component. Fo…
Who is the assignee on this patent?
Mmodal Ip Llc
What technology area does this patent fall under?
Primary CPC classification G10L15/22. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue May 30 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).