Data processing method, and storage medium and electronic device thereof
US-2024339107-A1 · Oct 10, 2024 · US
US9236045B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-9236045-B2 |
| Application number | US-201213478930-A |
| Country | US |
| Kind code | B2 |
| Filing date | May 23, 2012 |
| Priority date | May 23, 2011 |
| Publication date | Jan 12, 2016 |
| Grant date | Jan 12, 2016 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Techniques for presenting data input as a plurality of data chunks including a first data chunk and a second data chunk. The techniques include converting the plurality of data chunks to a textual representation comprising a plurality of text chunks including a first text chunk corresponding to the first data chunk and a second text chunk corresponding to the second data chunk, respectively, and providing a presentation of at least part of the textual representation such that the first text chunk is presented differently than the second text chunk to, when presented, assist a user in proofing the textual representation.
Opening claim text (preview).
What is claimed is: 1. A method for assisting a user verify accuracy of and/or correct text obtained by performing automatic speech recognition on speech input by the user, the method comprising: using at least one computer hardware processor to perform: receiving speech input by the user over a course of multiple user turns as a plurality of speech chunks, each of the plurality of speech chunks comprising speech spoken by the user during a respective single user turn, the plurality of speech chunks including a first speech chunk comprising data corresponding to at least two words spoken by the user; converting, by performing automatic speech recognition, the plurality of speech chunks to a textual representation comprising a plurality of text chunks, each of the plurality of speech chunks corresponding to a respective one of the plurality of text chunks, the plurality of text chunks comprising a first text chunk corresponding to the first speech chunk and comprising at least two recognized words corresponding to the at least two words; and for each text chunk in the plurality of text chunks: automatically designating the text chunk of the plurality of text chunks as an active text chunk, whenever the text chunk corresponds to a last speech chunk input by the user; and providing a visual presentation of the active text chunk and at least one other text chunk in the plurality of text chunks such that the active text chunk is visually presented differently than the at least one other text chunk to assist the user, when presented, in proofing the textual representation. 2. The method of claim 1 , further comprising: designating another of the plurality of text chunks as the active text chunk in response to user input indicating that the user would like to select a different one of the plurality of text chunks to be the active text chunk; and modifying the visual presentation to highlight the newly designated active text chunk. 3. The method of claim 1 , further comprising deleting at least a portion of the active text chunk from the textual representation in response to receiving user input to delete the at least a portion of the active text chunk. 4. The method of claim 1 , further comprising replacing at least a portion of the active text chunk with different text converted from further speech input from the user in response to receiving user input to replace the at least a portion of the active text chunk. 5. The method of claim 1 , wherein the visual presentation includes a visual presentation of each of the plurality of text chunks. 6. The method of claim 5 , further comprising visually rendering the visual presentation to the user via a display. 7. The method of claim 1 , wherein the textual representation is formed, at least in part, of a plurality of words, the method further comprising: designating one of the plurality of words as an active word in response to a user selecting a word mode; designating another of the plurality of words as the active word in response to user input indicating that the user would like to select a different one of the plurality of words to be the active word; and modifying the visual presentation to highlight the newly designated active word. 8. The method of claim 1 , wherein the textual representation is formed, at least in part, of a plurality of characters, the method further comprising: designating one of the plurality of characters as an active character in response to a user selecting a character mode; designating another of the plurality of characters as the active character in response to user input indicating that the user would like to select a different one of the plurality of characters to be the active character; and modifying the visual presentation to highlight the newly designated active character. 9. The method of claim 1 , wherein the active text chunk comprises at least two words. 10. A system for assisting a user verify accuracy of and/or correct text obtained by performing automatic speech recognition on speech input by the user, the system comprising: at least one computer hardware processor configured to perform: receiving speech input by the user over a course of multiple user turns as a plurality of speech chunks, each of the plurality of speech chunks comprising speech spoken by the user during a respective single user turn, the plurality of speech chunks including a first speech chunk comprising data corresponding to at least two words spoken by the user; converting, by performing automatic speech recognition, the plurality of speech chunks to a textual representation comprising a plurality of text chunks, each of the plurality of speech chunks corresponding to a respective one of the plurality of text chunks, the plurality of text chunks comprising a first text chunk corresponding to the first speech chunk and comprising at least two recognized words corresponding to the at least two words; and for each text chunk in the plurality of text chunks: automatically designating the text chunk of the plurality of text chunks as an active text chunk, whenever the text chunk corresponds to a last speech chunk input by the user; and providing a visual presentation of the active text chunk and at least one other text chunk in the plurality of text chunks such that the active text chunk is visually presented differently than the at least one other text chunk to assist the user, when presented, in proofing the textual representation. 11. The system of claim 10 , wherein the at least one computer hardware processor is configured to designate another of the plurality of text chunks as the active text chunk in response to user input indicating that the user would like to select a different one of the plurality of text chunks to be the active text chunk, and modifying the visual presentation to highlight the newly designated active text chunk. 12. The system of claim 10 , wherein the at least one computer hardware processor is configured to remove at least a portion of the active text chunk from the textual representation in response to receiving an indication from the user to delete the at least a portion of the active text chunk. 13. The system of claim 10 , wherein the at least one computer hardware processor is configured to replace at least a portion of the active text chunk in response to receiving user input to replace at least a portion of the active text chunk with different text converted from further data input from the user. 14. The system of claim 10 , wherein the at least one computer hardware processor is configured to generate a visual presentation of each of the plurality of text chunks. 15. The system of claim 14 , further comprising at least one display coupled to the at least one computer hardware processor to display the visual presentation to the user. 16. The system of claim 10 , wherein the textual representation is formed, at least in part, of a plurality of words, and wherein the at least one hardware processor is configured to designate one of the plurality of words as an active word in response to a user selecting a word mode, designate another of the plurality of words as the active word in response to user input indicating that the user would like to select a different one of the plurality of words to be the active word, and modify the visual presentation to highlight the newly designated active word. 17. The system of claim 10 , wherein the textual representation is formed, at least in part, of a plurality of characters, and wherein the at least one hardware processor is configured
Constructional details of speech recognition systems · CPC title
Assessment or evaluation of speech recognition systems · CPC title
using statistical models, e.g. Hidden Markov Models [HMMs] (G10L15/18 takes precedence) · CPC title
Parsing for meaning understanding · CPC title
Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids (G10L15/26 takes precedence) · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.