Task initiation using long-tail voice commands
US-2018096681-A1 · Apr 5, 2018 · US
US12315510B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-12315510-B2 |
| Application number | US-202418777134-A |
| Country | US |
| Kind code | B2 |
| Filing date | Jul 18, 2024 |
| Priority date | Oct 3, 2017 |
| Publication date | May 27, 2025 |
| Grant date | May 27, 2025 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
The present disclosure is generally related to a data processing system to selectively invoke applications for execution. A data processing system can receive an input audio signal and can parse the input audio signal to identify a command. The data processing system can identify a first functionality of a first digital assistant application hosted on the data processing system in the vehicle and a second functionality of a second digital assistant application accessible via a client device. The data processing system can determine that one of the first functionality or the second functionality supports the command. The data processing system can select one of the first digital assistant application or the second digital assistant application based on the determination. The data processing system invoke one of the first digital assistant application or the second digital assistant application based on the selection.
Opening claim text (preview).
What is claimed: 1. A method implemented by one or more processors of a computing device, the method comprising: receiving, via one or more microphones of the computing device, an input audio signal that includes: (i) a first portion capturing a given trigger word or phrase, and (ii) a second portion capturing a request; determining, based on processing the first portion of the input audio signal that includes the given trigger word or phrase, a given digital assistant application, from among a plurality of digital assistant applications that are accessible by the computing device, that is associated with the given trigger word or phrase, wherein the plurality of digital assistant applications that are accessible by the computing device include at least a first digital assistant application and a second digital assistant application, determining, based on processing the second portion of the input audio signal that includes the request, an action to be performed responsive to the request; generating, based on the action to be performed responsive to the request, an action data structure including instructions that, when executed, cause the action to be performed responsive to the request; in response to determining that the given trigger word or phrase is a first trigger word or phrase that is associated with the first digital assistant application: causing the first digital assistant application to perform the action to satisfy the request, wherein causing the first digital assistant application to perform the action to satisfy the request comprises causing the first digital assistant application to: receive the action data structure via a first application programming interface (API) associated with the first digital assistant application; and execute the instructions, included in the action data structure, to cause the action to be performed responsive to the request; and in response to determining that the given trigger word or phrase is a second trigger word or phrase that is associated with the second digital assistant application: causing the second digital assistant application to perform the action to satisfy the request, wherein causing the second digital assistant application to perform the action to satisfy the request comprises causing the second digital assistant application to: receive the action data structure via a second API associated with the second digital assistant application; and execute the instructions, included in the action data structure, to cause the action to be performed responsive to the request. 2. The method of claim 1 , wherein the plurality of digital assistant applications that are accessible by the computing device further include a third digital assistant application. 3. The method of claim 2 , further comprising: in response to determining that the given trigger word or phrase is a third trigger word or phrase that is associated with the third digital assistant application: causing the third digital assistant application to perform the action to satisfy the request. 4. The method of claim 1 , wherein an indication of the given digital assistant application is stored in association with the given trigger word or phrase in one or more databases that are accessible to the computing device. 5. The method of claim 1 , wherein determining the given digital assistant application that is associated with the given trigger word or phrase and based on processing the first portion of the input audio signal that includes the given trigger word or phrase comprises: parsing, using a natural language processor component of the computing device, the first portion of the input audio signal to identify the given trigger word or phrase; and determining, based on parsing the input audio signal to identify the given trigger word or phrase, that the given trigger word or phrase is associated with the given digital assistant application. 6. The method of claim 5 , wherein determining the action to be performed responsive to the request and based on processing the second portion of the input audio signal that includes the request comprises: parsing, using the natural language processor component of the computing device, the second portion of the input audio signal to identify the request; and determining, based on parsing the input audio signal to identify the request, the action to be performed responsive to the request. 7. The method of claim 1 , wherein the computing device is a standalone computing device. 8. A system comprising: at least one processor; and memory storing instructions that, when executed, cause the at least one processor to be operable to: receive, via one or more microphones of a computing device, an input audio signal that includes: (i) a first portion capturing a given trigger word or phrase, and (ii) a second portion capturing a request; determine, based on processing the first portion of the input audio signal that includes the given trigger word or phrase, a given digital assistant application, from among a plurality of digital assistant applications that are accessible by the computing device, that is associated with the given trigger word or phrase, wherein the plurality of digital assistant applications that are accessible by the computing device include at least a first digital assistant application and a second digital assistant application, determine, based on processing the second portion of the input audio signal that includes the request, an action to be performed responsive to the request; generate, based on the action to be performed responsive to the request, an action data structure including instructions that, when executed, cause the action to be performed responsive to the request; in response to determining that the given trigger word or phrase is a first trigger word or phrase that is associated with the first digital assistant application: cause the first digital assistant application to perform the action to satisfy the request, wherein the instructions to cause the first digital assistant application to perform the action to satisfy the request comprise instructions to cause the first digital assistant application to: receive the action data structure via a first application programming interface (API) associated with the first digital assistant application; and execute the instructions, included in the action data structure, to cause the action to be performed responsive to the request; and in response to determining that the given trigger word or phrase is a second trigger word or phrase that is associated with the second digital assistant application: cause the second digital assistant application to perform the action to satisfy the request, wherein the instructions to cause the second digital assistant application to perform the action to satisfy the request comprise instructions to cause the second digital assistant application to: receive the action data structure via a second API associated with the second digital assistant application; and execute the instructions, included in the action data structure, to cause the action to be performed responsive to the request. 9. The system of claim 8 , wherein the plurality of digital assistant applications that are accessible by the computing device further include a third digital assistant application. 10. The system of claim 9 , wherein the instructions are further operable to: in response to determining that the given trigger word or phrase is a third trigger word or phrase that is associated with the third digital assistant application: cause the third digital assistant application to perform the action to satisfy the request.
Execution procedure of a spoken command · CPC title
Word spotting · CPC title
Distributed recognition, e.g. in client-server systems, for mobile phones or network applications · CPC title
Parsing for meaning understanding · CPC title
Selecting among different versions · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.