Task flow identification based on user intent
US-10706841-B2 · Jul 7, 2020 · US
US11626116B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-11626116-B2 |
| Application number | US-202016775228-A |
| Country | US |
| Kind code | B2 |
| Filing date | Jan 28, 2020 |
| Priority date | Dec 17, 2013 |
| Publication date | Apr 11, 2023 |
| Grant date | Apr 11, 2023 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A speech-based system includes a local device in a user premises and a network-based control service that directs the local device to perform actions for a user. The control service may specify a first action that is to be performed upon detection by the local device of a stimulus. In some cases, performing the first action may rely on the availability of network communications with the control service or with another service. In these cases, the control service also specifies a second, fallback action that does not rely upon network communications. Upon detecting the stimulus, the local device performs the first action if network communications are available. If network communications are not available, the local device performs the second, fallback action.
Opening claim text (preview).
What is claimed is: 1. A device comprising: one or more network communications interfaces; one or more microphones; one or more processors; and one or more non-transitory computer-readable media storing computer-executable instructions that, when executed by the one or more processors, cause the one or more processors to perform acts comprising: generating, using the one or more microphones, audio data based at least on speech captured by the one or more microphones; transmitting, via the one or more network communications interfaces, the audio data to one or more network-based computing devices for automated speech recognition of the speech; and receiving, via the one or more network communications interfaces and from the one or more network-based computing devices, a command causing the device to perform one or more actions at a specified time, wherein the one or more actions comprise transmitting a request for content to a content provider. 2. The device of claim 1 , wherein the transmitting comprises transmitting the audio data to the one or more network-based computing devices for causing the network-based computing devices to perform the automated speech recognition by: generating first text data corresponding to the audio data; and analyzing the first text data to determine a voice command included in the speech. 3. The device of claim 1 , wherein the content provider comprises a network source. 4. The device of claim 1 , wherein the command includes a specified uniform resource locator (URL) associated with the content provider. 5. The device of claim 1 , wherein the one or more non-transitory computer-readable media further store computer-executable instructions that, when executed by the one or more processors, cause the one or more processors to perform acts comprising: determining occurrence of the specified time; and performing the one or more actions. 6. The device of claim 1 , wherein the audio data specifies the specified time. 7. The device of claim 1 , further comprising a speaker, and wherein the content comprises at least one of an audible alert, music, or synthesized speech. 8. The device of claim 1 , further comprising a speaker, wherein the one or more actions further comprise outputting the content. 9. The device of claim 1 , the acts further comprising: receiving, from the one or more network-based computing devices, data generated by the one or more network-based computing devices via the automated speech recognition of the audio data; receiving user input indicating that the data accurately represents the speech; and transmitting, via the one or more network communications interfaces, a confirmation to the one or more network-based computing devices that the data accurately represents the speech. 10. The device as claim 1 recites, wherein the specified time comprises a specific point in time. 11. The device as claim 1 recites, wherein the content comprises dynamically updated content based at least in part on the specified time. 12. The device as claim 1 recites, wherein the content comprises at least one of audio content or visual content. 13. A device comprising: one or more network communications interfaces; one or more microphones; one or more processors; and one or more non-transitory computer-readable media storing computer-executable instructions that, when executed by the one or more processors, cause the one or more processors to perform acts comprising: generating, using the one or more microphones, audio data based at least on speech captured by the one or more microphones; transmitting, via the one or more network communications interfaces, the audio data to one or more network-based computing devices for automated speech recognition of the speech; and receiving, via the one or more network communications interfaces and from the one or more network-based computing devices, a command causing the device to perform an action in response to detecting a specified condition, wherein the action comprises transmitting a request for content to a content provider. 14. The device of claim 13 , wherein the one or more non-transitory computer-readable media further store computer-executable instructions that, when executed by the one or more processors, cause the one or more processors to perform acts comprising: determining that the specified condition is met; and performing the action. 15. The device of claim 13 , wherein the command further instructs the device to perform an alternate action based at least in part on one or more of the device being unable to perform the action or the device being unable to connect to another device. 16. The device of claim 13 , wherein the specified condition comprises at least one of occurrence of a time of day, occurrence of an environmental condition, or receipt of an error message. 17. The device of claim 13 , wherein the command further instructs the device to output an audible alert, send an email, send a text message, or initiate a telephone call in response to detecting the specified condition. 18. The device of claim 13 , wherein the audio data comprises first audio data and wherein the command further instructs the device to perform an alternate action based at least in part on the device being unable to perform the action, the alternate action comprising outputting locally available second audio data comprising at least one of music, synthesized speech, a cached report, or an error message. 19. The device of claim 13 , wherein the audio data comprises first audio data and wherein the command further instructs the device to output locally stored second audio data corresponding to an error message based at least in part on the device being unable to perform the action, the error message being at least one of specific to a current error associated with the device being unable to perform the action or general to multiple errors. 20. The device of claim 13 , wherein the command further instructs the device to perform an alternate action based at least in part on the device being unable to perform the action, the alternate action comprising at least one of monitoring local sensor data, caching monitored data, controlling a local device, or outputting a notification. 21. A method comprising: generating, using one or more microphones of an audio device, audio data based at least on speech captured by the one or more microphones; transmitting, via one or more network communications interfaces of the audio device, the audio data to one or more network-based computing devices for automated speech recognition of the speech; and receiving, via the one or more network communications interfaces and from the one or more network-based computing devices, a command instructing the audio device to perform an action at a specified time, wherein the action comprises transmitting a request for content to a content provider. 22. A method comprising: generating, using one or more microphones of an audio device, first audio data based at least on speech captured by the one or more microphones and corresponding to the speech; transmitting, via one or more network communications interfaces of the audio device, the first audio data to one or more network-based computing devices for automated speech recognition of the speech; and receiving, via the one or more network communications interfaces and from the one or more network-based computing devices, a command instructing the audio device to pe
Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title
Execution procedure of a spoken command · CPC title
based on user interaction within the home (receiver circuitry for displaying additional information being controlled by a remote control apparatus H04N21/42204) · CPC title
Distributed recognition, e.g. in client-server systems, for mobile phones or network applications · CPC title
Reporting to a device located outside the home and the home network (access arrangements H04L12/2856; for remote control or remote monitoring of applications H04L67/025; telephonic communication systems adapted for combination with telemetering systems H04M11/002) · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.