Context aware voice interface for computing devices
US-2015089373-A1 · Mar 26, 2015 · US
US2017256256A1 · US · A1
| Field | Value |
|---|---|
| Publication number | US-2017256256-A1 |
| Application number | US-201615057453-A |
| Country | US |
| Kind code | A1 |
| Filing date | Mar 1, 2016 |
| Priority date | Mar 1, 2016 |
| Publication date | Sep 7, 2017 |
| Grant date | — |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Methods, systems, and apparatus for receiving, by a voice action system, data specifying a new voice action for an application different from the voice action system. A voice action intent for the application is generated based at least on the data, wherein the voice action intent comprises data that, when received by the application, requests that the application perform one or more operations specified for the new voice action. The voice action intent is associated with trigger terms specified for the new voice action. The voice action system is configured to receive an indication of a user utterance obtained by a device having the application installed, and determines that a transcription of the user utterance corresponds to the trigger terms associated with the voice action intent. In response to the determination, the voice action system provides the voice action intent to the device.
Opening claim text (preview).
What is claimed is: 1 . A computer-implemented method comprising: receiving, by a voice action system, data defining a new voice action that does not currently exist for a software application installed on one or more devices, the software application being different from said voice action system, the data indicating one or more operations for the software application to perform the new voice action and one or more trigger terms for triggering the new voice action; generating, by the voice action system, a voice action passive data structure based at least on the data defining the new voice action, wherein the voice action passive data structure comprises data that, when received by the software application, causes the software application to perform the one or more operations to perform the new voice action; associating, by the voice action system, the voice action passive data structure with the one or more trigger terms for triggering the new voice action; receiving, by the voice action system, a user command utterance obtained by a user device, the user device having the software application installed; determining, by the voice action system, that a transcription of the user command utterance corresponds to the one or more trigger terms associated with the voice action passive data structure; and in response to the determination, providing, by the voice action system, the voice action passive data structure to the user device which is remote from the voice action system, thereby causing the software application installed on the user device to perform the one or more operations to perform the new voice action. 2 . The computer-implemented method of claim 1 , wherein the data defining the new voice action specifies a context, the context specifying a status of a user device or of the software application when the new voice action is enabled. 3 . The computer-implemented method of claim 1 , wherein the new voice action is a voice-enabled command that the software application is not programmed to support. 4 . The computer-implemented method of claim 1 , wherein receiving the data defining the new voice action comprises receiving the data from a developer who published the software application. 5 . The computer-implemented method of claim 1 , wherein the voice action system does not receive the data defining the new voice action from the software application installed on the user device. 6 . The computer-implemented method of claim 2 , wherein the context specifies that a specific activity that the software application is performing is in a particular activity state. 7 . The computer-implemented method of claim 2 , comprising: associating, by the voice action system, the voice action passive data structure with the context; receiving, by the voice action system, context information indicating a status of the user device or of the software application installed on the user device; determining, by the voice action system, that the context information satisfies the context; and in response to determining that the transcription of the user command utterance corresponds to the one or more trigger terms associated with the voice action passive data structure and that the context information satisfies the context, providing, by the voice action system, the voice action passive data structure to the user device, thereby causing the software application installed on the user device to perform the one or more operations to perform the new voice action. 8 . The computer-implemented method of claim 7 , wherein receiving the context information indicating the status of the user device or of the software application installed on the user device comprises: providing, by the voice action system to the user device, a request for particular context information; and receiving, by the voice action system, the particular context information in response to the request. 9 . The computer-implemented method of claim 7 , comprising: determining, by the voice action system, that the context information satisfies a context for a second voice action, and that the transcription of the user command utterance corresponds to one or more trigger terms associated with a voice action passive data structure for the second voice action, wherein the voice action passive data structure for the second voice action comprises data that, when received by a software application associated with the second voice action, causes the software application associated with the second voice action to perform one or more operations to perform the second voice action; in response to the determination, selecting, by the voice action system, a voice action from among the new voice action and the second voice action; and providing, by the voice action system, the voice action passive data structure associated with the selected voice action to the user device, thereby causing the software application installed on the user device to perform the one or more operations to perform the selected voice action. 10 . The computer-implemented method of claim 9 , wherein selecting the selected voice action from among the new voice action and the second voice action comprises selecting the selected voice action in response to receiving data indicating a user selection of one of the new voice action or the second voice action. 11 . The computer-implemented method of claim 9 , wherein selecting the selected voice action from among the new voice action and the second voice action comprises: assigning a score to each of the new voice action and the second voice action; and selecting the selected voice action based at least on the score assigned to each of the new voice action and the second voice action. 12 . The computer-implemented method of claim 9 , wherein selecting the selected voice action from among the new voice action and the second voice action comprises selecting the selected voice action in response to determining that the software application associated with the selected voice action is operating in the foreground. 13 . The computer-implemented method of claim 1 , wherein generating the voice action passive data structure comprises determining that the one or more operations to perform the new voice action are capable of being performed by the software application. 14 . The computer-implemented method of claim 1 , comprising: determining, by the voice action system, that the transcription of the user command utterance is similar to the one or more trigger terms associated with the voice action passive data structure; in response to the determination, providing, by the voice action system to the user device, data indicating a request for user input that confirms whether the user command utterance corresponds to the one or more trigger terms associated with the voice action passive data structure or was intended to cause the software application to perform the new voice action; in response to the request, receiving, by the voice action system and from the user device, data indicating a confirmation; and in response to receiving the data indicating the confirmation, providing, by the voice action system, the voice action passive data structure to the user device, thereby causing the software application installed on the user device to perform the one or more operations to perform the new voice action. 15 . The computer-implemented method of claim 1 , comprising: receiving, by the voice action system, a request to deploy the new voice action; and deploying, by the voice action system, the new voice action in response to the request,
Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title
Physics · mapped topic
of application context · CPC title
Execution procedure of a spoken command · CPC title
of the speaker; Human-factor methodology · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.