Theme detection for object-recognition-based notifications
US-12183330-B2 · Dec 31, 2024 · US
US9330659B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-9330659-B2 |
| Application number | US-201313775643-A |
| Country | US |
| Kind code | B2 |
| Filing date | Feb 25, 2013 |
| Priority date | Feb 25, 2013 |
| Publication date | May 3, 2016 |
| Grant date | May 3, 2016 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A development system is described for facilitating the development of a spoken natural language (SNL) interface. The development system receives seed templates from a developer, each of which provides a command phrasing that can be used to invoke a function, when spoken by an end user. The development system then uses one or more development resources, such as a crowdsourcing system and a paraphrasing system, to provide additional templates. This yields an extended set of templates. A generation system then generates one or more models based on the extended set of templates. A user device may install the model(s) for use in interpreting commands spoken by an end user. When the user device recognizes a command, it may automatically invoke a function associated with that command. Overall, the development system provides an easy-to-use tool for producing an SNL interface.
Opening claim text (preview).
What is claimed is: 1. A computer-readable storage device encoded with computer executable instructions, said instructions causing a computing device to execute a development framework, comprising: a developer interface module configured to provide a development interface, the developer interface module comprising: logic configured to receive a set of seed templates, each seed template identifying a command phrasing for use in invoking a particular function performed by a program, when spoken; and logic configured to collect a set of added templates, each added template identifying another command phrasing for use in invoking the particular function, the set of seed templates and the set of added templates forming an extended set of templates with all of the templates in the extended set providing a different command phrasing matched to a single user intent relative to the particular function; a resource interface module configured to interact with one or more development resources to provide the set of added templates; and a data store for storing the extended set of templates associated with the particular function, the extended set of templates being for use in training one or more models for use on a user device, and said one or more models being applied to determine whether any phrase spoken by an end user is a member of the extended set so as to invoke the particular function. 2. The development framework of claim 1 , wherein the developer interface module further includes logic configured to receive description information, the description information providing a description of the particular function, and wherein the data store also stores the description information. 3. The development framework of claim 2 , wherein the description information includes one or more of: a natural language description of the particular function; and an identification of a handler function that is used to invoke the particular function. 4. The development framework of claim 1 , wherein, for one type of function, each command phrasing includes at least one slot for specifying variable slot information, wherein the developer interface module further includes logic configured to identify at least one grammar, each grammar providing a mechanism for processing the variable slot information associated with a slot, and wherein the data store also stores an identification of said at least one grammar. 5. The development framework of claim 1 , wherein said at least one development resource is a crowdsourcing system, and wherein the crowdsourcing system is configured to: present solicitation information to at least one crowdsourcing participant; and receive one or more added templates from said at least one crowdsourcing participant, said at least one crowdsourcing participant providing said one or more added templates in response to the solicitation information. 6. The development framework of claim 1 , wherein said at least one development resource is a paraphrasing system, and wherein the paraphrasing system is configured to: receive at least one seed template; and provide one or more added templates that are translations of said at least one seed template, said one or more added templates being expressed in a same language as said at least one seed template. 7. The development framework of claim 1 , wherein the developer interface module further comprises logic configured to edit the extending set of templates. 8. The development framework of claim 1 , further comprising a generation system comprising at least one training system configured to: receive the extended set of templates; and generate one or more models based on the extended set of templates. 9. The development framework of claim 8 , wherein the generation system is configured to combine the extended set of templates associated with the particular function with other extended sets of templates associated with other functions, to provide a master set of templates, and wherein the generation system generates said one of more models based on the master set of templates. 10. The development framework of claim 8 , wherein one training system comprises a statistical language model training system that is configured to provide a statistical language model for use by a speech recognition engine. 11. The development framework of claim 10 , wherein the statistical language training system operates on the extended set of templates in a form in which slots are expressed as slot tokens. 12. The development framework of claim 8 , wherein one training system comprises a vector space model training system that is configured to provide a vector space model for use by an intent determination engine. 13. The development framework of claim 12 , wherein the vector space model training system determines a weight for each token in the extended set of templates by treating each template in the extended set as an expression of the single user intent. 14. The development framework of claim 8 , wherein the development framework is configured to update the extended set of templates by adding one or more user-provided templates provided by an end user, to provide an updated set of templates, and wherein the generation system is configured to regenerate said one or more models based on the updated set of templates. 15. A method, implemented by one or more computer devices, comprising: receiving description information, the description information providing a description of a particular function performed by a program resource; receiving a set of seed templates, each seed template identifying a command phrasing for use in invoking the particular function, when the command phrasing is spoken; providing a set of added templates using at least one development resource, each added template identifying another command phrasing for use in invoking the particular function, the set of seed templates and the set of added templates comprising an extended set of templates with all of the templates in the extended set providing a different command phrasing matched to a single user intent relative to the particular function; storing the description information and the extended set of templates in a data store, said at least one development resource including at least a crowdsourcing system and a paraphrasing system; and applying the extended set of templates to train one or more models for use in interpreting and responding to one or more spoken user commands when any phrase spoken by the user is a member of the extended set so as to invoke the particular function. 16. The method of claim 15 , wherein each command phrasing includes at least one slot for specifying variable slot information, the method further comprising: receiving an identification of at least one grammar, each grammar providing a mechanism for processing slot information associated with a slot; and storing the identification of said at least one grammar. 17. A device, comprising: a voice processing module for receiving a voice signal in response to a command spoken by a user, the voice processing module comprising: a speech recognition engine configured to interpret the voice signal using a statistical language model, to provide a speech recognition result; an intent determination engine configured to interpret the speech recognition result using an intent-matching model, to provide an intent determination result, the intent determination result identifying a particular function to be performed by a program in response to the command spoken by the user; l
Adaptation · CPC title
Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title
Feedback of the input speech · CPC title
Execution procedure of a spoken command · CPC title
Semantic analysis · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.