Speech-Controlled Actions Based on Keywords and Context Thereof

US2016379633A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2016379633-A1
Application numberUS-201514754457-A
CountryUS
Kind codeA1
Filing dateJun 29, 2015
Priority dateJun 29, 2015
Publication dateDec 29, 2016
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A device includes a plurality of components, a memory having a keyword recognition module and a context recognition module, a microphone configured to receive an input speech spoken by a user, an analog-to-digital converter configured to convert the input speech from an analog form to a digital form and generate a digitized speech, and a processor. The processor is configured to detect, using the keyword recognition module, a keyword in the digitized speech, initiate, in response to detecting the keyword by the keyword recognition module, an action to be taken one of the plurality of components, wherein the keyword is associated with the action, determine, using the context recognition module, a context for the keyword, and execute the action if the context determined by the context recognition module indicates that the keyword is a command.

First claim

Opening claim text (preview).

What is claimed is: 1 . A device comprising: a plurality of components; a memory including a keyword recognition module and a context recognition module; a microphone configured to receive an input speech spoken by a user; an analog-to-digital converter configured to convert the input speech from an analog form to a digital form and generate a digitized speech; a processor configured to: detect, using the keyword recognition module, a keyword in the digitized speech; initiate, in response to detecting the keyword by the keyword recognition module, an action to be taken by one of the plurality of components, wherein the keyword is associated with the action; determine, using the context recognition module, a context for the keyword; and execute the action if the context determined by the context recognition module indicates that the keyword is a command. 2 . The device of claim 1 , wherein the context recognition module utilizes a voice activity detector to determine the context. 3 . The device of claim 1 , wherein the keyword recognition module is continuously listening for the keyword and the context recognition module is configured to begin listening when the input speech is received from the microphone. 4 . The device of claim 1 , wherein the processor is configured to: prior to determining the context of the keyword, receive one or more second inputs from the user; and analyze the context of the keyword based on the one or more second inputs. 5 . The device of claim 4 , wherein the one or more second inputs is a non-verbal input including a physical gesture. 6 . The device of claim 4 , wherein the one or more second inputs are received from one of a motion sensor and a video camera. 7 . The device of claim 1 , wherein the context recognition module determines that the keyword is in the digitized speech based on an indication received from the keyword recognition module. 8 . The device of claim 1 , wherein the context of the keyword includes a location of the user. 9 . The device of claim 1 , wherein the processor is further configured to: display a result of executing the action on a display. 10 . The device of claim 1 , wherein executing the action includes operating an appliance. 11 . A method for speech recognition by a device having a microphone, a processor, and a memory including a keyword recognition module and a context recognition module, the method comprising: detecting, using the keyword recognition module, a keyword in a digitized speech; initiating, in response to detecting the keyword by the keyword recognition module, an action to be taken by one of the plurality of components, wherein the keyword is associated with the action; determining, using the context recognition module, a context for the keyword; and executing the action if the context determined by the context recognition module indicates that the keyword is a command. 12 . The method of claim 11 , wherein the context recognition module utilizes a voice activity detector to determine the context. 13 . The method of claim 11 , wherein the keyword recognition module is continuously listening for the keyword and the context recognition module is configured to begin listening when the input speech is received from the microphone. 14 . The method of claim 11 , further comprising: prior to determining the context of the keyword, receiving one or more second inputs from the user; and analyzing the context of the keyword based on the one or more second inputs. 15 . The method of claim 14 , wherein the one or more second inputs include a non-verbal input including a physical gesture. 16 . The method of claim 14 , wherein the one or more second inputs are received from one of a motion sensor and a video camera. 17 . The method of claim 11 , wherein the context recognition module determines that the keyword is in the digitized speech based on an indication received from the keyword recognition module. 18 . The method of claim 11 , wherein the context of the keyword includes a location of a user. 19 . The method of claim 11 , further comprising: displaying a result of executing the action on a display. 20 . The method of claim 11 , wherein executing the action includes operating an appliance.

Assignees

Inventors

Classifications

  • of application context · CPC title

  • Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules · CPC title

  • G10L15/22Primary

    Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title

  • of the speaker; Human-factor methodology · CPC title

  • Execution procedure of a spoken command · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2016379633A1 cover?
A device includes a plurality of components, a memory having a keyword recognition module and a context recognition module, a microphone configured to receive an input speech spoken by a user, an analog-to-digital converter configured to convert the input speech from an analog form to a digital form and generate a digitized speech, and a processor. The processor is configured to detect, using t…
Who is the assignee on this patent?
Disney Entpr Inc
What technology area does this patent fall under?
Primary CPC classification G10L15/22. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Dec 29 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).