Method for providing context-based correction of voice recognition results

US9448991B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9448991-B2
Application numberUS-201414218608-A
CountryUS
Kind codeB2
Filing dateMar 18, 2014
Priority dateMar 18, 2014
Publication dateSep 20, 2016
Grant dateSep 20, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Context-based corrections of voice recognition results are provided by displaying text-based result from a speech-to-text conversion operation on a display screen of an electronic client device. One or more element categories associated with corresponding portions of the text-based result are identified. Graphical icons corresponding to the element categories are also displayed on the display in areas where the corresponding portions of the text-based result are also displayed. A user selection of one of the graphical icons is then detected, and an edit operation is enabled for the portion of the text-based result associated with the selected graphical icon. An updated version of the text-based results is then displayed on the display.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for providing context-based corrections of voice recognition results using an electronic client device, the method comprising the acts of: storing a plurality of element categories in a memory of the electronic client device; storing, in the memory, a plurality of recognized words and phrases corresponding to the plurality of element categories; displaying, by the electronic client device on a display, a text-based result from a speech-to-text conversion operation, wherein the result comprises a plurality of words; identifying, by the electronic client device, an element category associated with a portion of the text-based result, wherein said identifying the element category comprises the acts of: parsing the plurality of words to determine if at least one of the plurality of recognized words and phrases is contained in the text-based result, performing a lookup operation of the plurality of element categories using an identified one of the plurality of recognized words and phrases from said parsing, and identifying said element category from among the plurality of element categories in response to the lookup operation producing a match between one of the plurality of element categories and the identified one of the plurality of recognized words and phrases; displaying, by the electronic client device on the display, a graphical icon corresponding to the element category in an area of the display corresponding to where the portion of the text-based result is also displayed; detecting a user selection, by the electronic client device, of the graphical icon; enabling, by the electronic client device and in response to the user selection, an edit operation to be performed on the portion of the text-based result associated with the graphical icon; and displaying, by the electronic client device on the display, an updated version of the text-based result based on said edit operation. 2. The method of claim 1 , wherein displaying, by the client device on the display, the graphical icon corresponding to the element category comprises displaying the graphical icon and the portion of the text-based result in an at least partially overlapping manner on the display of the client device. 3. The method of claim 1 , wherein detecting the user selection comprises detecting a finger gesture on the display in an area corresponding to where the graphical icon is displayed. 4. The method of claim 1 , wherein enabling the edit operation comprises activating an area of the display for text entry, wherein the area corresponds to where the portion of the text-based result is displayed. 5. The method of claim 4 , wherein during the edit operation, the plurality of words of the text-based results that are outside the portion of the text-based result are uneditable. 6. The method of claim 1 , further comprising transmitting, by the client device over a network, an electronic message comprising the updated version of the text-based result. 7. An electronic client device configured to enable context-based corrections of voice recognition results, the electronic device comprising: a display; a memory containing processor-executable instructions for enabling context-based corrections of voice recognition results; and a processor electrically coupled to the display and to the memory, the processor configured to execute the processor-executable instructions to: store, in the memory, a plurality of element categories; store, in the memory, a plurality of recognized words and phrases corresponding to the plurality of element categories; display, on the display, a text-based result from a speech-to-text conversion operation, wherein the result comprises a plurality of words, identify an element category associated with a portion of the text-based result, display, on the display, a graphical icon corresponding to the element category in an area of the display corresponding to where the portion of the text-based result is also displayed, detect a user selection of the graphical icon, enable, in response to the user selection, an edit operation to be performed on the portion of the text-based result associated with the graphical icon, and display, on the display, an updated version of the text-based based on said edit operation, wherein the processor is configured to execute the processor-executable instructions to identify the element category by further executing processor-executable instructions to: parse the plurality of words to determine if at least one of the plurality of recognized words and phrases is contained in the text-based result, perform a lookup operation of the plurality of element categories using an identified one of the plurality of recognized words and phrases from said parsing, and identify said element category from among the plurality of element categories in response to the lookup operation producing a match between one of the plurality of element categories and the identified one of the plurality of recognized words and phrases. 8. The electronic client device of claim 7 , wherein the processor is configured to execute the processor-executable instructions to display the graphical icon by displaying the graphical icon and the portion of the text-based result in an at least partially overlapping manner on the display. 9. The electronic client device of claim 7 , wherein the display is a touchscreen display and the processor is further configured to execute the processor-executable instructions to detect the user selection by detecting a finger gesture on the touchscreen display in an area corresponding to where the graphical icon is displayed. 10. The electronic client device of claim 7 , wherein the display is a touchscreen display and the processor is further configured to execute the processor-executable instructions to enable the edit operation by activating an area of the touchscreen display for text entry, wherein the area corresponds to where the portion of the text-based result is displayed. 11. The electronic client device of claim 10 , wherein during the edit operation, the plurality of words of the text-based results that are outside the portion of the text-based result are uneditable. 12. The electronic client device of claim 7 , wherein the processor is further configured to execute the processor-executable instructions to transmit, over a network, an electronic message comprising the updated version of the text-based result. 13. A computer program product, comprising: a processor readable medium having processor executable code embodied therein to enable context-based corrections of voice recognition results using an electronic client device, the processor readable medium having: processor executable program code to store, in a memory of the electronic client device, a plurality of element categories; processor executable program code to store, in the memory, a plurality of recognized words and phrases corresponding to the plurality of element categories; processor executable program code to display, on a display of the electronic client device, a text-based result from a speech-to-text conversion operation, wherein the result comprises a plurality of words, processor executable program code to identify an element category associated with a portion of the text-based result, processor executable program code to display, on the display, a graphical icon corresponding to the element category in an area of the display corresponding to where the portion of the text-based result is also displayed, processor executable program code to detect a user selection of the graphical icon, processor e

Assignees

Inventors

Classifications

  • Speech recognition (G10L17/00 takes precedence) · CPC title

  • Parsing for meaning understanding · CPC title

  • Speech to text systems (G10L15/08 takes precedence) · CPC title

  • G10L15/22Primary

    Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title

  • G06F40/232Primary

    Orthographic correction, e.g. spell checking or vowelisation · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9448991B2 cover?
Context-based corrections of voice recognition results are provided by displaying text-based result from a speech-to-text conversion operation on a display screen of an electronic client device. One or more element categories associated with corresponding portions of the text-based result are identified. Graphical icons corresponding to the element categories are also displayed on the display i…
Who is the assignee on this patent?
Bayerische Motoren Werke Ag
What technology area does this patent fall under?
Primary CPC classification G10L15/22. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Sep 20 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).