Methods and apparatus for proofing of a text input

US9236045B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9236045-B2
Application numberUS-201213478930-A
CountryUS
Kind codeB2
Filing dateMay 23, 2012
Priority dateMay 23, 2011
Publication dateJan 12, 2016
Grant dateJan 12, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Techniques for presenting data input as a plurality of data chunks including a first data chunk and a second data chunk. The techniques include converting the plurality of data chunks to a textual representation comprising a plurality of text chunks including a first text chunk corresponding to the first data chunk and a second text chunk corresponding to the second data chunk, respectively, and providing a presentation of at least part of the textual representation such that the first text chunk is presented differently than the second text chunk to, when presented, assist a user in proofing the textual representation.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for assisting a user verify accuracy of and/or correct text obtained by performing automatic speech recognition on speech input by the user, the method comprising: using at least one computer hardware processor to perform: receiving speech input by the user over a course of multiple user turns as a plurality of speech chunks, each of the plurality of speech chunks comprising speech spoken by the user during a respective single user turn, the plurality of speech chunks including a first speech chunk comprising data corresponding to at least two words spoken by the user; converting, by performing automatic speech recognition, the plurality of speech chunks to a textual representation comprising a plurality of text chunks, each of the plurality of speech chunks corresponding to a respective one of the plurality of text chunks, the plurality of text chunks comprising a first text chunk corresponding to the first speech chunk and comprising at least two recognized words corresponding to the at least two words; and for each text chunk in the plurality of text chunks: automatically designating the text chunk of the plurality of text chunks as an active text chunk, whenever the text chunk corresponds to a last speech chunk input by the user; and providing a visual presentation of the active text chunk and at least one other text chunk in the plurality of text chunks such that the active text chunk is visually presented differently than the at least one other text chunk to assist the user, when presented, in proofing the textual representation. 2. The method of claim 1 , further comprising: designating another of the plurality of text chunks as the active text chunk in response to user input indicating that the user would like to select a different one of the plurality of text chunks to be the active text chunk; and modifying the visual presentation to highlight the newly designated active text chunk. 3. The method of claim 1 , further comprising deleting at least a portion of the active text chunk from the textual representation in response to receiving user input to delete the at least a portion of the active text chunk. 4. The method of claim 1 , further comprising replacing at least a portion of the active text chunk with different text converted from further speech input from the user in response to receiving user input to replace the at least a portion of the active text chunk. 5. The method of claim 1 , wherein the visual presentation includes a visual presentation of each of the plurality of text chunks. 6. The method of claim 5 , further comprising visually rendering the visual presentation to the user via a display. 7. The method of claim 1 , wherein the textual representation is formed, at least in part, of a plurality of words, the method further comprising: designating one of the plurality of words as an active word in response to a user selecting a word mode; designating another of the plurality of words as the active word in response to user input indicating that the user would like to select a different one of the plurality of words to be the active word; and modifying the visual presentation to highlight the newly designated active word. 8. The method of claim 1 , wherein the textual representation is formed, at least in part, of a plurality of characters, the method further comprising: designating one of the plurality of characters as an active character in response to a user selecting a character mode; designating another of the plurality of characters as the active character in response to user input indicating that the user would like to select a different one of the plurality of characters to be the active character; and modifying the visual presentation to highlight the newly designated active character. 9. The method of claim 1 , wherein the active text chunk comprises at least two words. 10. A system for assisting a user verify accuracy of and/or correct text obtained by performing automatic speech recognition on speech input by the user, the system comprising: at least one computer hardware processor configured to perform: receiving speech input by the user over a course of multiple user turns as a plurality of speech chunks, each of the plurality of speech chunks comprising speech spoken by the user during a respective single user turn, the plurality of speech chunks including a first speech chunk comprising data corresponding to at least two words spoken by the user; converting, by performing automatic speech recognition, the plurality of speech chunks to a textual representation comprising a plurality of text chunks, each of the plurality of speech chunks corresponding to a respective one of the plurality of text chunks, the plurality of text chunks comprising a first text chunk corresponding to the first speech chunk and comprising at least two recognized words corresponding to the at least two words; and for each text chunk in the plurality of text chunks: automatically designating the text chunk of the plurality of text chunks as an active text chunk, whenever the text chunk corresponds to a last speech chunk input by the user; and providing a visual presentation of the active text chunk and at least one other text chunk in the plurality of text chunks such that the active text chunk is visually presented differently than the at least one other text chunk to assist the user, when presented, in proofing the textual representation. 11. The system of claim 10 , wherein the at least one computer hardware processor is configured to designate another of the plurality of text chunks as the active text chunk in response to user input indicating that the user would like to select a different one of the plurality of text chunks to be the active text chunk, and modifying the visual presentation to highlight the newly designated active text chunk. 12. The system of claim 10 , wherein the at least one computer hardware processor is configured to remove at least a portion of the active text chunk from the textual representation in response to receiving an indication from the user to delete the at least a portion of the active text chunk. 13. The system of claim 10 , wherein the at least one computer hardware processor is configured to replace at least a portion of the active text chunk in response to receiving user input to replace at least a portion of the active text chunk with different text converted from further data input from the user. 14. The system of claim 10 , wherein the at least one computer hardware processor is configured to generate a visual presentation of each of the plurality of text chunks. 15. The system of claim 14 , further comprising at least one display coupled to the at least one computer hardware processor to display the visual presentation to the user. 16. The system of claim 10 , wherein the textual representation is formed, at least in part, of a plurality of words, and wherein the at least one hardware processor is configured to designate one of the plurality of words as an active word in response to a user selecting a word mode, designate another of the plurality of words as the active word in response to user input indicating that the user would like to select a different one of the plurality of words to be the active word, and modify the visual presentation to highlight the newly designated active word. 17. The system of claim 10 , wherein the textual representation is formed, at least in part, of a plurality of characters, and wherein the at least one hardware processor is configured

Assignees

Inventors

Classifications

  • Constructional details of speech recognition systems · CPC title

  • Assessment or evaluation of speech recognition systems · CPC title

  • using statistical models, e.g. Hidden Markov Models [HMMs] (G10L15/18 takes precedence) · CPC title

  • Parsing for meaning understanding · CPC title

  • Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids (G10L15/26 takes precedence) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9236045B2 cover?
Techniques for presenting data input as a plurality of data chunks including a first data chunk and a second data chunk. The techniques include converting the plurality of data chunks to a textual representation comprising a plurality of text chunks including a first text chunk corresponding to the first data chunk and a second text chunk corresponding to the second data chunk, respectively, an…
Who is the assignee on this patent?
Labsky Martin, Kleindienst Jan, Macek Tomas, and 5 more
What technology area does this patent fall under?
Primary CPC classification G10L13/08. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jan 12 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).