Methods and apparatus for detecting a voice command

US9361885B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9361885-B2
Application numberUS-201313795960-A
CountryUS
Kind codeB2
Filing dateMar 12, 2013
Priority dateMar 12, 2013
Publication dateJun 7, 2016
Grant dateJun 7, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Some aspects include a method of monitoring an acoustic environment of a mobile device operating in a low power mode, the mobile device having a first and second processor, the method comprises receiving acoustic input while the mobile device is operating in the low power mode, performing at least one first processing stage on the acoustic input using the first processor, prior to engaging the second processor, to evaluate whether the acoustic input includes a voice command, performing at least one second processing stage on the acoustic input using the second processor to evaluate whether the acoustic input includes a voice command if further processing is needed to determine whether the acoustic input includes a voice command, and initiating responding to the voice command when either the at least one first processing stage or the at least one second processing stage determines that the acoustic input includes a voice command.

First claim

Opening claim text (preview).

What is claimed is: 1. A method of monitoring an acoustic environment of a mobile device for voice commands when the mobile device is operating in an idle mode, the mobile device having a first processor and a second processor, the method comprising: receiving acoustic input while the mobile device is operating in the idle mode; performing at least one first processing stage on the acoustic input using the first processor, prior to engaging the second processor to process the acoustic input, to evaluate whether the acoustic input includes a voice command; performing at least one second processing stage on the acoustic input using the second processor to evaluate whether the acoustic input includes a voice command if further processing is needed to determine whether the acoustic input includes a voice command; wherein performing the at least one first processing stage or the at least one second processing stage includes, while the mobile device is operating in the idle mode, transmitting at least a portion of the acoustic input to at least one server via a network for processing by the at least one server at least to evaluate whether the acoustic input includes a voice command; and initiating responding to the voice command when either the at least one first processing stage or the at least one second processing stage determines that the acoustic input includes a voice command. 2. The method of claim 1 , wherein the first processor or the second processor is located at the at least one server. 3. The method of claim 1 , wherein the first processor is a digital signal processor configured to operate at a lower power than the second processor. 4. The method of claim 1 , wherein performing the at least one first processing stage includes performing at least one voice activity detection processing stage including performing at least one of spectral analysis on the acoustic input to evaluate whether the spectrum of the acoustic input is indicative of voice activity, periodicity analysis to evaluate whether the signal periodicity is indicative of voice activity, and phone loop analysis to evaluate whether the acoustic input includes speech. 5. The method of claim 1 , wherein performing the at least one first processing stage comprises performing limited vocabulary automatic speech recognition on the acoustic input to detect whether the acoustic input includes an explicit voice trigger. 6. The method of claim 1 , wherein performing the at least one second processing stage comprises performing at least one speech processing stage including at least one of automatic speech recognition on the acoustic input, at least one classification processing stage, and at least one natural language processing stage to understand the meaning of the voice command. 7. The method of claim 6 , wherein performing the at least one second processing stage comprises performing limited vocabulary automatic speech recognition on the acoustic input to detect whether the acoustic input includes an explicit voice trigger. 8. The method of claim 1 , further comprising providing, prior to activating the second processor or turning on a display of the mobile device, at least one indication to the user to confirm that the acoustic input was received and is being processed. 9. The method of claim 1 , further comprising transitioning the mobile device to an active mode to respond to the voice command when either the at least one first processing stage or the at least one second processing stage determines that the acoustic input includes a voice command. 10. The method of claim 1 , wherein at least one task specified by the voice command is performed with the mobile device remaining in the idle mode. 11. The method of claim 1 , further comprising performing one or more default actions when a specific task could not be ascertained from the acoustic input. 12. The method of claim 1 , wherein performing the at least one first processing stage comprises evaluating at least one contextual cue selected from a group consisting of: time of day, motion of the mobile device, location of the mobile device, an occurrence of an event on the mobile device, recent activity on the mobile device and state of a battery powering the mobile device. 13. The method of claim 1 , wherein performing the at least one second processing stage comprises evaluating at least one contextual cue selected from a group consisting of: time of day, motion of the mobile device, location of the mobile device, an occurrence of an event on the mobile device, recent activity on the mobile device and state of a battery powering the mobile device. 14. A mobile device comprising: at least one input to receive acoustic input from an environment of the mobile device while the mobile device is operating in an idle mode; a first processor to perform at least one first processing stage on the acoustic input to evaluate whether the acoustic input includes a voice command; a second processor to perform at least one second processing stage on the acoustic input to evaluate whether the acoustic input includes a voice command, and wherein the mobile device is configured to provide, prior to waking up the mobile device, at least one indication to the user to confirm that the acoustic input was received and is being processed, wherein the at least one first processing stage is performed prior to engaging the second processor, wherein the at least one second processing stage is performed if further processing is needed to determine whether the acoustic input includes a voice command, and wherein response to the voice command is initiated when either the at least one first processing stage or the at least one second processing stage determines that the acoustic input includes a voice command. 15. The mobile device of claim 14 , wherein the at least one indication to the user includes illuminating one or more light emitting diodes of the mobile device. 16. The mobile device of claim 14 , wherein the at least one low power indication includes at least one audio signal indicating that the acoustic input was received and is being processed. 17. The mobile device of claim 14 , wherein performing the at least one first processing stage or the at least one second processing stage includes, while the mobile device is operating in the idle mode, transmitting at least a portion of the acoustic input to at least one server via a network for processing by the at least one server at least to evaluate whether the acoustic input includes a voice command.

Assignees

Inventors

Classifications

  • G10L15/22Primary

    Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title

  • Multiple recognisers used in sequence or in parallel; Score combination systems therefor, e.g. voting systems · CPC title

  • Execution procedure of a spoken command · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9361885B2 cover?
Some aspects include a method of monitoring an acoustic environment of a mobile device operating in a low power mode, the mobile device having a first and second processor, the method comprises receiving acoustic input while the mobile device is operating in the low power mode, performing at least one first processing stage on the acoustic input using the first processor, prior to engaging the …
Who is the assignee on this patent?
Nuance Communications Inc
What technology area does this patent fall under?
Primary CPC classification G10L15/22. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jun 07 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).