Who is the assignee on this patent?

Labsky Martin, Kleindienst Jan, Macek Tomas, and 4 more

What technology area does this patent fall under?

Primary CPC classification G10L13/08. Mapped technology areas include Physics.

When was this patent published?

Publication date Tue Dec 31 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Methods and apparatus for correcting recognition errors

US10522133B2 · US · B2

Patent metadata
Field	Value
Publication number	US-10522133-B2
Application number	US-201213479010-A
Country	US
Kind code	B2
Filing date	May 23, 2012
Priority date	May 23, 2011
Publication date	Dec 31, 2019
Grant date	Dec 31, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Techniques for error correction using a history list comprising at least one misrecognition and correction information associated with each of the at least one misrecognitions indicating how a user corrected the associated misrecognition. The techniques include converting data input from a user to generate a text segment, determining whether at least a portion of the text segment appears in the history list as one of the at least one misrecognitions, if the at least a portion of the text segment appears in the history list as one of the at least one misrecognitions, obtaining the correction information associated with the at least one misrecognition, and correcting the at least a portion of the text segment based, at least in part, on the correction information.

First claim

Opening claim text (preview).

What is claimed is: 1. A method of correcting speech recognition errors, the method comprising: converting, using at least one automatic speech recognition engine, speech data input by a first user to generate a text segment; determining whether at least a portion of the text segment appears as a misrecognition in a history list, the history list comprising: a plurality of misrecognitions that resulted from errors in automated speech recognition previously performed on speech input received from the first user; and correction information associated with each of the plurality of misrecognitions indicating how the first user corrected the associated misrecognition; when the at least a portion of the text segment appears in the history list as one of the plurality of misrecognitions, obtaining, from the history list, the correction information associated with the one misrecognition; determining whether to automatically correct the text segment based on the correction information obtained from the history list; when it is determined not to automatically correct the text segment based on the correction information obtained from the history list, outputting the text segment without correction for presentation to the first user; and when it is determined to automatically correct the text segment based on the correction information obtained from the history list, automatically correcting the at least a portion of the text segment using the correction information obtained from the history list; when it is determined that the first user edited the text segment to produce corrected text, determining whether to insert the text segment and the corrected text into the history list at least in part by determining whether the first user is replacing the text segment which was correctly recognized as presented; and inserting the text segment and the corrected text to the history list when it is determined that the text segment and the corrected text is to be inserted into the history list. 2. The method of claim 1 , wherein when it is determined to automatically correct the text segment based on the correction information obtained from the history list, presenting the text segment with the corrected at least a portion of the text segment to the first user. 3. The method of claim 1 , wherein outputting the text segment without correction for presentation to the first user comprises outputting the text segment to the first user along with the correction information as a suggested correction for the at least a portion of the text segment. 4. The method of claim 1 , wherein determining whether to insert the text segment and the corrected text into the history list is performed using a confidence value generated by the at least one automatic speech recognition engine when converting the speech data input by the first user to generate the text segment. 5. The method of claim 1 , wherein determining whether to insert the text segment and the corrected text into the history list comprises determining whether the corrected text is in an N-best list generated by the at least one automatic speech recognition engine when converting the speech data input by the first user to generate the text segment. 6. The method of claim 1 , wherein the history list comprises a plurality of previous recognized text segments, and wherein at least one of the plurality of previously recognized text segments includes an expiration after which the at least one of the plurality of previously recognized text segments is removed from the history list, the expiration being associated with at least one duration and/or at least one event. 7. The method of claim 1 , wherein determining whether to insert the text segment and the corrected text into the history list comprises determining whether the first user is correcting the misrecognition in the text segment or is replacing the text segment which was correctly recognized as presented. 8. The method of claim 7 , wherein determining whether the first user is correcting the misrecognition in the text segment or is replacing the text segment which was correctly recognized as presented is based, at least in part, on: at least one alternative recognition result for the speech data, a confidence score for the text segment generated by the at least one automatic speech recognition engine when converting the speech data input by the first user to generate the text segment, or both the at least one alternative recognition result and the confidence score. 9. The method of claim 7 , further comprising: in response to a determination that the first user is replacing the text segment which was correctly recognized as presented, refraining from inserting the text segment and the corrected text to the history list. 10. The method of claim 7 , further comprising: in response to a determination that the first user is correcting the misrecognition in the text segment, inserting the text segment and the corrected text to the history list. 11. The method of claim 1 , wherein when it is determined not to automatically correct the text segment based on the correction information obtained from the history list, alerting the first user to possible misrecognition. 12. The method of claim 11 , wherein alerting the first user to possible misrecognition comprises highlighting the text segment to the first user. 13. At least one non-transitory computer readable medium storing instructions that, when executed on at least one processor, perform a method of correcting speech recognition errors, the method comprising: converting, using at least one automatic speech recognition engine, speech data input by a first user to generate a text segment; determining whether at least a portion of the text segment appears as a misrecognition in a history list, the history list comprising: a plurality of misrecognitions that resulted from errors in automated speech recognition previously performed on speech input received from the first user; and correction information associated with each of the plurality of misrecognitions indicating how the first user corrected the associated misrecognition; when the at least a portion of the text segment appears in the history list as one of the plurality of misrecognitions, obtaining, from the history list, the correction information associated with the one misrecognition; determining whether to automatically correct the text segment based on the correction information obtained from the history list; when it is determined not to automatically correct the text segment based on the correction information obtained from the history list, outputting the text segment without correction for presentation to the first user; and when it is determined to automatically correct the text segment based on the correction information obtained from the history list, automatically correcting the at least a portion of the text segment using the correction information obtained from the history list; when it is determined that the first user edited the text segment to produce corrected text, determining whether to insert the text segment and the corrected text into the history list at least in part by determining whether the first user is replacing the text segment which was correctly recognized as presented; and inserting the text segment and the corrected text to the history list when it is determined that the text segment and the corrected text is to be inserted into the history list. 14. The at least one non-transitory computer readable medium of claim 13 , wherein when it is determined to automatically correct the text segment based

Assignees

Inventors

Classifications

G10L13/00
Speech synthesis; Text to speech systems · CPC title
G10L15/06
Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice (G10L15/14 takes precedence) · CPC title
G06F40/10
Text processing (natural language analysis G06F40/20; semantic analysis G06F40/30; processing or translation of natural language G06F40/40) · CPC title
G10L15/14
using statistical models, e.g. Hidden Markov Models [HMMs] (G10L15/18 takes precedence) · CPC title
G06F40/30
Semantic analysis · CPC title

Patent family

Related publications grouped by family.

View patent family 47219819

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10522133B2 cover?: Techniques for error correction using a history list comprising at least one misrecognition and correction information associated with each of the at least one misrecognitions indicating how a user corrected the associated misrecognition. The techniques include converting data input from a user to generate a text segment, determining whether at least a portion of the text segment appears in the…
Who is the assignee on this patent?: Labsky Martin, Kleindienst Jan, Macek Tomas, and 4 more
What technology area does this patent fall under?: Primary CPC classification G10L13/08. Mapped technology areas include Physics.
When was this patent published?: Publication date Tue Dec 31 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).