Method and system for operating a multi-function portable electronic device using voice-activation
US-8977255-B2 · Mar 10, 2015 · US
US9361885B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-9361885-B2 |
| Application number | US-201313795960-A |
| Country | US |
| Kind code | B2 |
| Filing date | Mar 12, 2013 |
| Priority date | Mar 12, 2013 |
| Publication date | Jun 7, 2016 |
| Grant date | Jun 7, 2016 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Some aspects include a method of monitoring an acoustic environment of a mobile device operating in a low power mode, the mobile device having a first and second processor, the method comprises receiving acoustic input while the mobile device is operating in the low power mode, performing at least one first processing stage on the acoustic input using the first processor, prior to engaging the second processor, to evaluate whether the acoustic input includes a voice command, performing at least one second processing stage on the acoustic input using the second processor to evaluate whether the acoustic input includes a voice command if further processing is needed to determine whether the acoustic input includes a voice command, and initiating responding to the voice command when either the at least one first processing stage or the at least one second processing stage determines that the acoustic input includes a voice command.
Opening claim text (preview).
What is claimed is: 1. A method of monitoring an acoustic environment of a mobile device for voice commands when the mobile device is operating in an idle mode, the mobile device having a first processor and a second processor, the method comprising: receiving acoustic input while the mobile device is operating in the idle mode; performing at least one first processing stage on the acoustic input using the first processor, prior to engaging the second processor to process the acoustic input, to evaluate whether the acoustic input includes a voice command; performing at least one second processing stage on the acoustic input using the second processor to evaluate whether the acoustic input includes a voice command if further processing is needed to determine whether the acoustic input includes a voice command; wherein performing the at least one first processing stage or the at least one second processing stage includes, while the mobile device is operating in the idle mode, transmitting at least a portion of the acoustic input to at least one server via a network for processing by the at least one server at least to evaluate whether the acoustic input includes a voice command; and initiating responding to the voice command when either the at least one first processing stage or the at least one second processing stage determines that the acoustic input includes a voice command. 2. The method of claim 1 , wherein the first processor or the second processor is located at the at least one server. 3. The method of claim 1 , wherein the first processor is a digital signal processor configured to operate at a lower power than the second processor. 4. The method of claim 1 , wherein performing the at least one first processing stage includes performing at least one voice activity detection processing stage including performing at least one of spectral analysis on the acoustic input to evaluate whether the spectrum of the acoustic input is indicative of voice activity, periodicity analysis to evaluate whether the signal periodicity is indicative of voice activity, and phone loop analysis to evaluate whether the acoustic input includes speech. 5. The method of claim 1 , wherein performing the at least one first processing stage comprises performing limited vocabulary automatic speech recognition on the acoustic input to detect whether the acoustic input includes an explicit voice trigger. 6. The method of claim 1 , wherein performing the at least one second processing stage comprises performing at least one speech processing stage including at least one of automatic speech recognition on the acoustic input, at least one classification processing stage, and at least one natural language processing stage to understand the meaning of the voice command. 7. The method of claim 6 , wherein performing the at least one second processing stage comprises performing limited vocabulary automatic speech recognition on the acoustic input to detect whether the acoustic input includes an explicit voice trigger. 8. The method of claim 1 , further comprising providing, prior to activating the second processor or turning on a display of the mobile device, at least one indication to the user to confirm that the acoustic input was received and is being processed. 9. The method of claim 1 , further comprising transitioning the mobile device to an active mode to respond to the voice command when either the at least one first processing stage or the at least one second processing stage determines that the acoustic input includes a voice command. 10. The method of claim 1 , wherein at least one task specified by the voice command is performed with the mobile device remaining in the idle mode. 11. The method of claim 1 , further comprising performing one or more default actions when a specific task could not be ascertained from the acoustic input. 12. The method of claim 1 , wherein performing the at least one first processing stage comprises evaluating at least one contextual cue selected from a group consisting of: time of day, motion of the mobile device, location of the mobile device, an occurrence of an event on the mobile device, recent activity on the mobile device and state of a battery powering the mobile device. 13. The method of claim 1 , wherein performing the at least one second processing stage comprises evaluating at least one contextual cue selected from a group consisting of: time of day, motion of the mobile device, location of the mobile device, an occurrence of an event on the mobile device, recent activity on the mobile device and state of a battery powering the mobile device. 14. A mobile device comprising: at least one input to receive acoustic input from an environment of the mobile device while the mobile device is operating in an idle mode; a first processor to perform at least one first processing stage on the acoustic input to evaluate whether the acoustic input includes a voice command; a second processor to perform at least one second processing stage on the acoustic input to evaluate whether the acoustic input includes a voice command, and wherein the mobile device is configured to provide, prior to waking up the mobile device, at least one indication to the user to confirm that the acoustic input was received and is being processed, wherein the at least one first processing stage is performed prior to engaging the second processor, wherein the at least one second processing stage is performed if further processing is needed to determine whether the acoustic input includes a voice command, and wherein response to the voice command is initiated when either the at least one first processing stage or the at least one second processing stage determines that the acoustic input includes a voice command. 15. The mobile device of claim 14 , wherein the at least one indication to the user includes illuminating one or more light emitting diodes of the mobile device. 16. The mobile device of claim 14 , wherein the at least one low power indication includes at least one audio signal indicating that the acoustic input was received and is being processed. 17. The mobile device of claim 14 , wherein performing the at least one first processing stage or the at least one second processing stage includes, while the mobile device is operating in the idle mode, transmitting at least a portion of the acoustic input to at least one server via a network for processing by the at least one server at least to evaluate whether the acoustic input includes a voice command.
Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title
Multiple recognisers used in sequence or in parallel; Score combination systems therefor, e.g. voting systems · CPC title
Execution procedure of a spoken command · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.