Methods and apparatus for correcting recognition errors

US10522133B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10522133-B2
Application numberUS-201213479010-A
CountryUS
Kind codeB2
Filing dateMay 23, 2012
Priority dateMay 23, 2011
Publication dateDec 31, 2019
Grant dateDec 31, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Techniques for error correction using a history list comprising at least one misrecognition and correction information associated with each of the at least one misrecognitions indicating how a user corrected the associated misrecognition. The techniques include converting data input from a user to generate a text segment, determining whether at least a portion of the text segment appears in the history list as one of the at least one misrecognitions, if the at least a portion of the text segment appears in the history list as one of the at least one misrecognitions, obtaining the correction information associated with the at least one misrecognition, and correcting the at least a portion of the text segment based, at least in part, on the correction information.

First claim

Opening claim text (preview).

What is claimed is: 1. A method of correcting speech recognition errors, the method comprising: converting, using at least one automatic speech recognition engine, speech data input by a first user to generate a text segment; determining whether at least a portion of the text segment appears as a misrecognition in a history list, the history list comprising: a plurality of misrecognitions that resulted from errors in automated speech recognition previously performed on speech input received from the first user; and correction information associated with each of the plurality of misrecognitions indicating how the first user corrected the associated misrecognition; when the at least a portion of the text segment appears in the history list as one of the plurality of misrecognitions, obtaining, from the history list, the correction information associated with the one misrecognition; determining whether to automatically correct the text segment based on the correction information obtained from the history list; when it is determined not to automatically correct the text segment based on the correction information obtained from the history list, outputting the text segment without correction for presentation to the first user; and when it is determined to automatically correct the text segment based on the correction information obtained from the history list, automatically correcting the at least a portion of the text segment using the correction information obtained from the history list; when it is determined that the first user edited the text segment to produce corrected text, determining whether to insert the text segment and the corrected text into the history list at least in part by determining whether the first user is replacing the text segment which was correctly recognized as presented; and inserting the text segment and the corrected text to the history list when it is determined that the text segment and the corrected text is to be inserted into the history list. 2. The method of claim 1 , wherein when it is determined to automatically correct the text segment based on the correction information obtained from the history list, presenting the text segment with the corrected at least a portion of the text segment to the first user. 3. The method of claim 1 , wherein outputting the text segment without correction for presentation to the first user comprises outputting the text segment to the first user along with the correction information as a suggested correction for the at least a portion of the text segment. 4. The method of claim 1 , wherein determining whether to insert the text segment and the corrected text into the history list is performed using a confidence value generated by the at least one automatic speech recognition engine when converting the speech data input by the first user to generate the text segment. 5. The method of claim 1 , wherein determining whether to insert the text segment and the corrected text into the history list comprises determining whether the corrected text is in an N-best list generated by the at least one automatic speech recognition engine when converting the speech data input by the first user to generate the text segment. 6. The method of claim 1 , wherein the history list comprises a plurality of previous recognized text segments, and wherein at least one of the plurality of previously recognized text segments includes an expiration after which the at least one of the plurality of previously recognized text segments is removed from the history list, the expiration being associated with at least one duration and/or at least one event. 7. The method of claim 1 , wherein determining whether to insert the text segment and the corrected text into the history list comprises determining whether the first user is correcting the misrecognition in the text segment or is replacing the text segment which was correctly recognized as presented. 8. The method of claim 7 , wherein determining whether the first user is correcting the misrecognition in the text segment or is replacing the text segment which was correctly recognized as presented is based, at least in part, on: at least one alternative recognition result for the speech data, a confidence score for the text segment generated by the at least one automatic speech recognition engine when converting the speech data input by the first user to generate the text segment, or both the at least one alternative recognition result and the confidence score. 9. The method of claim 7 , further comprising: in response to a determination that the first user is replacing the text segment which was correctly recognized as presented, refraining from inserting the text segment and the corrected text to the history list. 10. The method of claim 7 , further comprising: in response to a determination that the first user is correcting the misrecognition in the text segment, inserting the text segment and the corrected text to the history list. 11. The method of claim 1 , wherein when it is determined not to automatically correct the text segment based on the correction information obtained from the history list, alerting the first user to possible misrecognition. 12. The method of claim 11 , wherein alerting the first user to possible misrecognition comprises highlighting the text segment to the first user. 13. At least one non-transitory computer readable medium storing instructions that, when executed on at least one processor, perform a method of correcting speech recognition errors, the method comprising: converting, using at least one automatic speech recognition engine, speech data input by a first user to generate a text segment; determining whether at least a portion of the text segment appears as a misrecognition in a history list, the history list comprising: a plurality of misrecognitions that resulted from errors in automated speech recognition previously performed on speech input received from the first user; and correction information associated with each of the plurality of misrecognitions indicating how the first user corrected the associated misrecognition; when the at least a portion of the text segment appears in the history list as one of the plurality of misrecognitions, obtaining, from the history list, the correction information associated with the one misrecognition; determining whether to automatically correct the text segment based on the correction information obtained from the history list; when it is determined not to automatically correct the text segment based on the correction information obtained from the history list, outputting the text segment without correction for presentation to the first user; and when it is determined to automatically correct the text segment based on the correction information obtained from the history list, automatically correcting the at least a portion of the text segment using the correction information obtained from the history list; when it is determined that the first user edited the text segment to produce corrected text, determining whether to insert the text segment and the corrected text into the history list at least in part by determining whether the first user is replacing the text segment which was correctly recognized as presented; and inserting the text segment and the corrected text to the history list when it is determined that the text segment and the corrected text is to be inserted into the history list. 14. The at least one non-transitory computer readable medium of claim 13 , wherein when it is determined to automatically correct the text segment based

Assignees

Inventors

Classifications

  • Speech synthesis; Text to speech systems · CPC title

  • Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice (G10L15/14 takes precedence) · CPC title

  • Text processing (natural language analysis G06F40/20; semantic analysis G06F40/30; processing or translation of natural language G06F40/40) · CPC title

  • using statistical models, e.g. Hidden Markov Models [HMMs] (G10L15/18 takes precedence) · CPC title

  • Semantic analysis · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10522133B2 cover?
Techniques for error correction using a history list comprising at least one misrecognition and correction information associated with each of the at least one misrecognitions indicating how a user corrected the associated misrecognition. The techniques include converting data input from a user to generate a text segment, determining whether at least a portion of the text segment appears in the…
Who is the assignee on this patent?
Labsky Martin, Kleindienst Jan, Macek Tomas, and 4 more
What technology area does this patent fall under?
Primary CPC classification G10L13/08. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Dec 31 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).