Speech recognition adaptation systems based on adaptation data

US9620128B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9620128-B2
Application numberUS-201213564647-A
CountryUS
Kind codeB2
Filing dateAug 1, 2012
Priority dateMay 31, 2012
Publication dateApr 11, 2017
Grant dateApr 11, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The instant application includes computationally-implemented systems and methods that include managing adaptation data, the adaptation data is at least partly based on at least one speech interaction of a particular party, facilitating transmission of the adaptation data to a target device when there is an indication of a speech-facilitated transaction between the target device and the particular party, such that the adaptation data is to be applied to the target device to assist in execution of the speech-facilitated transaction, and facilitating acquisition of adaptation result data that is based on at least one aspect of the speech-facilitated transaction and to be used in determining whether to modify the adaptation data. In addition to the foregoing, other aspects are described in the claims, drawings, and text.

First claim

Opening claim text (preview).

What is claimed is: 1. A computationally-implemented method, comprising: managing adaptation data that is stored at a reference location, wherein the adaptation data is at least partly based on at least one speech interaction of a particular party; determining an availability of the adaptation data by comparing a property of the adaptation data located at the referenced location with an expected value of the property of the adaptation data; facilitating transmission of the adaptation data to a target device when there is an indication of a speech-facilitated transaction between the target device and the particular party, wherein the adaptation data is configured to be applied to the target device to assist in execution of the speech-facilitated transaction; and facilitating acquisition of adaptation result data that is based on at least one aspect of the speech-facilitated transaction and configured to be used in determining whether to modify the adaptation data, upon receipt of an indication from the target device of a status of the speech-facilitated transaction between the target device and the particular party, wherein said status includes an indicator of a success in determining speech of the speech-facilitated transaction. 2. The computationally-implemented method of claim 1 , wherein said managing adaptation data, wherein the adaptation data is at least partly based on at least one speech interaction of a particular party comprises: managing adaptation data, wherein the adaptation data is at least partly based on at least one speech interaction of a particular party with a particular device. 3. The computationally-implemented method of claim 1 , wherein said managing adaptation data, wherein the adaptation data is at least partly based on at least one speech interaction of a particular party comprises: managing adaptation data, wherein the adaptation data includes one or more of: a training set of audio data and corresponding transcript data; a regional dialect speech modification algorithm; a foreign language accent modifier algorithm; a speech impediment modification algorithm tailored to a particular user; a frequently mispronounced word recognition adjustment algorithm; a speech processing algorithm tailored to a user based at least one accent and/or tone; a list of favorite words of a particular user; an ambient noise level adjustment algorithm; a value of a parameter in a speech interpretation algorithm; a list of one or more words in a pronunciation dictionary whose pronunciations deviate a predetermined amount from their general pronunciations; a training set of audio data and corresponding transcript data; a phrase completion algorithm used to assist in interpreting spoken words based on context; a pronunciation dictionary; and a training set of one or more words related to a target device and one or more pronunciations of the one or more words. 4. The computationally-implemented method of claim 1 , wherein said facilitating transmission of the adaptation data to a target device when there is an indication of a speech-facilitated transaction between the target device and the particular party, wherein the adaptation data is configured to be applied to the target device to assist in execution of the speech-facilitated transaction comprises: transmitting adaptation data to a target device when there is indication of a speech-facilitated transaction between the target device and the particular party, wherein the adaptation data is configured to be applied to the target device to assist in execution of the speech-facilitated transaction. 5. The computationally-implemented method of claim 1 , wherein said facilitating transmission of the adaptation data to a target device when there is an indication of a speech-facilitated transaction between the target device and the particular party, wherein the adaptation data is configured to be applied to the target device to assist in execution of the speech-facilitated transaction comprises: facilitating transmission of the adaptation data to the target device upon receipt of an indication from the target device of a particular number of attempts to receive a particular type of response from the particular party, wherein the adaptation data is configured to be applied to the target device to assist in execution of the speech-facilitated transaction. 6. The computationally-implemented method of claim 1 , wherein said facilitating transmission of the adaptation data to a target device when there is an indication of a speech-facilitated transaction between the target device and the particular party, wherein the adaptation data is configured to be applied to the target device to assist in execution of the speech-facilitated transaction comprises: facilitating transmission of the adaptation data to the target device upon receipt of an indication from the target device that a speech recognition component of the target device is processing speech of the particular party below a particular success rate. 7. The computationally-implemented method of claim 1 , wherein said facilitating transmission of the adaptation data to a target device when there is an indication of a speech-facilitated transaction between the target device and the particular party, wherein the adaptation data is configured to be applied to the target device to assist in execution of the speech-facilitated transaction comprises: facilitating transmission of the adaptation data to the target device upon receipt of an indication from the target device that a speech recognition component of the target device has a confidence rate below a particular threshold. 8. The computationally-implemented method of claim 1 , wherein said facilitating transmission of the adaptation data to a target device when there is an indication of a speech-facilitated transaction between the target device and the particular party, wherein the adaptation data is configured to be applied to the target device to assist in execution of the speech-facilitated transaction comprises: facilitating transmission of the adaptation data to a target device when there is an indication of a speech-facilitated transaction between the target device and the particular party, wherein the adaptation data is configured to be applied to the target device to improve performance in processing speech received during execution of the speech-facilitated transaction. 9. The computationally-implemented method of claim 8 , wherein said facilitating transmission of the adaptation data to a target device when there is an indication of a speech-facilitated transaction between the target device and the particular party, wherein the adaptation data is configured to be applied to the target device to improve performance in processing speech received during execution of the speech-facilitated transaction comprises: facilitating transmission of the adaptation data to a target device when there is an indication of a speech-facilitated transaction between the target device and the particular party, wherein the adaptation data is configured to be applied to the target device to improve speed in processing speech received during execution of the speech-facilitated transaction. 10. The computationally-implemented method of claim 1 , wherein said facilitating acquisition of adaptation result data that is based on at least one aspect of the speech-facilitated transaction and configured to be used in determining whether to modify the adaptation data comprises: generating adaptation result data that is based on at least one aspect of the speech-facilitated transaction and configured to be used in determining whether to modify the adaptation data.

Assignees

Inventors

Classifications

  • G10L15/07Primary

    to the speaker · CPC title

  • Distributed recognition, e.g. in client-server systems, for mobile phones or network applications · CPC title

  • Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice (G10L15/14 takes precedence) · CPC title

  • G10L19/00Primary

    Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis (in musical instruments G10H) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9620128B2 cover?
The instant application includes computationally-implemented systems and methods that include managing adaptation data, the adaptation data is at least partly based on at least one speech interaction of a particular party, facilitating transmission of the adaptation data to a target device when there is an indication of a speech-facilitated transaction between the target device and the particul…
Who is the assignee on this patent?
Levien Royce A, Lord Richard T, Lord Robert W, and 2 more
What technology area does this patent fall under?
Primary CPC classification G10L15/07. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Apr 11 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).