Data processing method, and storage medium and electronic device thereof
US-2024339107-A1 · Oct 10, 2024 · US
US10522133B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-10522133-B2 |
| Application number | US-201213479010-A |
| Country | US |
| Kind code | B2 |
| Filing date | May 23, 2012 |
| Priority date | May 23, 2011 |
| Publication date | Dec 31, 2019 |
| Grant date | Dec 31, 2019 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Techniques for error correction using a history list comprising at least one misrecognition and correction information associated with each of the at least one misrecognitions indicating how a user corrected the associated misrecognition. The techniques include converting data input from a user to generate a text segment, determining whether at least a portion of the text segment appears in the history list as one of the at least one misrecognitions, if the at least a portion of the text segment appears in the history list as one of the at least one misrecognitions, obtaining the correction information associated with the at least one misrecognition, and correcting the at least a portion of the text segment based, at least in part, on the correction information.
Opening claim text (preview).
What is claimed is: 1. A method of correcting speech recognition errors, the method comprising: converting, using at least one automatic speech recognition engine, speech data input by a first user to generate a text segment; determining whether at least a portion of the text segment appears as a misrecognition in a history list, the history list comprising: a plurality of misrecognitions that resulted from errors in automated speech recognition previously performed on speech input received from the first user; and correction information associated with each of the plurality of misrecognitions indicating how the first user corrected the associated misrecognition; when the at least a portion of the text segment appears in the history list as one of the plurality of misrecognitions, obtaining, from the history list, the correction information associated with the one misrecognition; determining whether to automatically correct the text segment based on the correction information obtained from the history list; when it is determined not to automatically correct the text segment based on the correction information obtained from the history list, outputting the text segment without correction for presentation to the first user; and when it is determined to automatically correct the text segment based on the correction information obtained from the history list, automatically correcting the at least a portion of the text segment using the correction information obtained from the history list; when it is determined that the first user edited the text segment to produce corrected text, determining whether to insert the text segment and the corrected text into the history list at least in part by determining whether the first user is replacing the text segment which was correctly recognized as presented; and inserting the text segment and the corrected text to the history list when it is determined that the text segment and the corrected text is to be inserted into the history list. 2. The method of claim 1 , wherein when it is determined to automatically correct the text segment based on the correction information obtained from the history list, presenting the text segment with the corrected at least a portion of the text segment to the first user. 3. The method of claim 1 , wherein outputting the text segment without correction for presentation to the first user comprises outputting the text segment to the first user along with the correction information as a suggested correction for the at least a portion of the text segment. 4. The method of claim 1 , wherein determining whether to insert the text segment and the corrected text into the history list is performed using a confidence value generated by the at least one automatic speech recognition engine when converting the speech data input by the first user to generate the text segment. 5. The method of claim 1 , wherein determining whether to insert the text segment and the corrected text into the history list comprises determining whether the corrected text is in an N-best list generated by the at least one automatic speech recognition engine when converting the speech data input by the first user to generate the text segment. 6. The method of claim 1 , wherein the history list comprises a plurality of previous recognized text segments, and wherein at least one of the plurality of previously recognized text segments includes an expiration after which the at least one of the plurality of previously recognized text segments is removed from the history list, the expiration being associated with at least one duration and/or at least one event. 7. The method of claim 1 , wherein determining whether to insert the text segment and the corrected text into the history list comprises determining whether the first user is correcting the misrecognition in the text segment or is replacing the text segment which was correctly recognized as presented. 8. The method of claim 7 , wherein determining whether the first user is correcting the misrecognition in the text segment or is replacing the text segment which was correctly recognized as presented is based, at least in part, on: at least one alternative recognition result for the speech data, a confidence score for the text segment generated by the at least one automatic speech recognition engine when converting the speech data input by the first user to generate the text segment, or both the at least one alternative recognition result and the confidence score. 9. The method of claim 7 , further comprising: in response to a determination that the first user is replacing the text segment which was correctly recognized as presented, refraining from inserting the text segment and the corrected text to the history list. 10. The method of claim 7 , further comprising: in response to a determination that the first user is correcting the misrecognition in the text segment, inserting the text segment and the corrected text to the history list. 11. The method of claim 1 , wherein when it is determined not to automatically correct the text segment based on the correction information obtained from the history list, alerting the first user to possible misrecognition. 12. The method of claim 11 , wherein alerting the first user to possible misrecognition comprises highlighting the text segment to the first user. 13. At least one non-transitory computer readable medium storing instructions that, when executed on at least one processor, perform a method of correcting speech recognition errors, the method comprising: converting, using at least one automatic speech recognition engine, speech data input by a first user to generate a text segment; determining whether at least a portion of the text segment appears as a misrecognition in a history list, the history list comprising: a plurality of misrecognitions that resulted from errors in automated speech recognition previously performed on speech input received from the first user; and correction information associated with each of the plurality of misrecognitions indicating how the first user corrected the associated misrecognition; when the at least a portion of the text segment appears in the history list as one of the plurality of misrecognitions, obtaining, from the history list, the correction information associated with the one misrecognition; determining whether to automatically correct the text segment based on the correction information obtained from the history list; when it is determined not to automatically correct the text segment based on the correction information obtained from the history list, outputting the text segment without correction for presentation to the first user; and when it is determined to automatically correct the text segment based on the correction information obtained from the history list, automatically correcting the at least a portion of the text segment using the correction information obtained from the history list; when it is determined that the first user edited the text segment to produce corrected text, determining whether to insert the text segment and the corrected text into the history list at least in part by determining whether the first user is replacing the text segment which was correctly recognized as presented; and inserting the text segment and the corrected text to the history list when it is determined that the text segment and the corrected text is to be inserted into the history list. 14. The at least one non-transitory computer readable medium of claim 13 , wherein when it is determined to automatically correct the text segment based
Speech synthesis; Text to speech systems · CPC title
Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice (G10L15/14 takes precedence) · CPC title
Text processing (natural language analysis G06F40/20; semantic analysis G06F40/30; processing or translation of natural language G06F40/40) · CPC title
using statistical models, e.g. Hidden Markov Models [HMMs] (G10L15/18 takes precedence) · CPC title
Semantic analysis · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.