Methods and apparatus for interpreting received speech data using speech recognition

US9418679B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9418679-B2
Application numberUS-201414457809-A
CountryUS
Kind codeB2
Filing dateAug 12, 2014
Priority dateAug 12, 2014
Publication dateAug 16, 2016
Grant dateAug 16, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method for processing a received set of speech data, wherein the received set of speech data comprises an utterance, is provided. The method executes a process to generate a plurality of confidence scores, wherein each of the plurality of confidence scores is associated with one of a plurality of candidate utterances; determines a plurality of difference values, each of the plurality of difference values comprising a difference between two of the plurality of confidence scores; and compares the plurality of difference values to determine at least one disparity.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for processing a received set of speech data, wherein the received set of speech data comprises an utterance, the method comprising: receiving a set of speech data; executing a process to calculate a plurality of probabilities based on the received set of speech data, wherein each of the plurality of probabilities indicates a statistical likelihood that the set of speech data comprises one of a plurality of candidate utterances; determining a plurality of difference values, each of the plurality of difference values comprising a difference between two of the plurality of probabilities; and comparing the plurality of difference values to determine at least one disparity; wherein the determining step further comprises: calculating a first difference between a first probability and a second probability; calculating a second difference between a third probability and a fourth probability; and comparing the first difference to the second difference to determine the at least one disparity; wherein the plurality of probabilities comprises the first probability, the second probability, the third probability, and the fourth probability; and wherein the plurality of difference values comprises the first difference and the second difference. 2. The method of claim 1 , further comprising: when the disparity is greater than a disparity threshold, identifying a correct candidate utterance associated with one of the plurality of probabilities, wherein the correct candidate utterance comprises one of the plurality of candidate utterances; and completing a task associated with the correct candidate utterance. 3. The method of claim 1 , further comprising: when the disparity is less than a disparity threshold, present a user with a request to repeat the utterance. 4. The method of claim 1 , further comprising: when the disparity comprises a value within a range of predetermined values, present a user with at least one of the plurality of candidate utterances for verification. 5. The method of claim 1 , wherein each of the plurality of probabilities comprises a calculated probability that a particular string of phonemes corresponds to the received set of speech data. 6. The method of claim 1 , further comprising: identifying one or more probabilities above a probability threshold; and determining the plurality of difference values using the identified one or more probabilities. 7. A system for processing a received set of speech data, wherein the received set of speech data comprises an utterance, the system comprising: a user interface module, configured to receive the set of speech data; a probability module, configured to calculate a plurality of probabilities based on the received set of speech data, each of the calculated plurality of probabilities indicating a statistical likelihood that the set of speech data comprises one of a plurality of candidate utterances; and a data analysis module, configured to: calculate a plurality of difference values, each of the plurality of difference values comprising a difference between two of the plurality of probabilities; and compare the plurality of difference values to determine at least one disparity; wherein the data analysis module is further configured to: calculate a first difference between a first probability and a second probability; calculate a second difference between a third probability and a fourth probability; and compare the first difference to the second difference to determine the at least one disparity: wherein the plurality of probabilities comprises the first probability, the second probability, the third probability, and the fourth probability; and wherein the plurality of difference values comprises the first difference and the second difference. 8. The system of claim 7 , wherein, when the disparity is greater than a disparity threshold, the data analysis module is further configured to: identify a correct candidate utterance associated with one of the plurality of probabilities, wherein the correct candidate utterance comprises one of the plurality of candidate utterances; and complete a task associated with the correct candidate utterance. 9. The system of claim 7 , wherein, when the disparity is less than a disparity threshold, the user interface module is further configured to present a user with a request to repeat the utterance. 10. The system of claim 7 , wherein, when the at least one disparity comprises a value within a range of predetermined values, the user interface module is further configured to present a user with at least one of a plurality of candidate utterances for verification. 11. The system of claim 7 , wherein the data analysis module is further configured to: identify one or more probabilities above a probability threshold; and determine the plurality of difference values using the identified one or more probabilities. 12. A non-transitory, computer-readable medium containing instructions thereon, which, when executed by a processor, perform a method comprising: receiving a set of speech data, wherein an utterance comprises the set of speech data; calculating a plurality of probabilities, wherein each of the plurality of probabilities indicates a statistical likelihood that the set of speech data comprises one of a plurality of candidate utterances; determining a plurality of difference values, each of the plurality of difference values comprising a difference between two of the plurality of probabilities; and comparing the plurality of difference values to determine at least one disparity; wherein the method further comprises: calculating a first difference between a first probability and a second probability; calculating a second difference between a third probability and a fourth probability; and comparing the first difference to the second difference to determine the at least one disparity: wherein the plurality of probabilities comprises the first probability, the second probability, the third probability, and the fourth probability; and wherein the plurality of difference values comprises the first difference and the second difference. 13. The non-transitory, computer-readable medium of claim 12 , wherein the method further comprises: when the disparity is greater than a disparity threshold, identifying a correct candidate utterance associated with one of the plurality of probabilities, wherein the candidate utterance comprises one of the plurality of candidate utterances; and completing a task associated with the correct candidate utterance. 14. The non-transitory, computer-readable medium of claim 12 , wherein the method further comprises: when the disparity is less than a disparity threshold, presenting a user with a request to repeat the utterance. 15. The non-transitory, computer-readable medium of claim 12 , wherein the method further comprises: when the disparity comprises a value within a range of predetermined values, presenting a user with at least one of the plurality of candidate utterances for verification. 16. The non-transitory, computer-readable medium of claim 12 , wherein the method further comprises: identifying one or more probabilities above a probability threshold; and determining the plurality of difference values using the identified one or more probabilities. 17. The non-transitory, computer-readable medium of claim 12 , wherein each of the plurality of probabilities comprises a calculated probability that a particular string of codewords corresponds to the received set

Assignees

Inventors

Classifications

  • G10L15/08Primary

    Speech classification or search · CPC title

  • Feature extraction for speech recognition; Selection of recognition unit · CPC title

  • Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title

  • G10L25/51Primary

    for comparison or discrimination · CPC title

  • Hidden Markov Models [HMMs] · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9418679B2 cover?
A method for processing a received set of speech data, wherein the received set of speech data comprises an utterance, is provided. The method executes a process to generate a plurality of confidence scores, wherein each of the plurality of confidence scores is associated with one of a plurality of candidate utterances; determines a plurality of difference values, each of the plurality of diffe…
Who is the assignee on this patent?
Honeywell Int Inc
What technology area does this patent fall under?
Primary CPC classification G10L15/08. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Aug 16 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).