Method and apparatus for building an intelligent automated assistant
US-9501741-B2 · Nov 22, 2016 · US
US10379712B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-10379712-B2 |
| Application number | US-201514980388-A |
| Country | US |
| Kind code | B2 |
| Filing date | Dec 28, 2015 |
| Priority date | Apr 18, 2012 |
| Publication date | Aug 13, 2019 |
| Grant date | Aug 13, 2019 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A conversation user interface enables users to better understand their interactions with computing devices, particularly when speech input is involved. The conversation user interface conveys a visual representation of a conversation between the computing device, or virtual assistant thereon, and a user. The conversation user interface presents a series of dialog representations that show input from a user (verbal or otherwise) and responses from the device or virtual assistant. Associated with one or more of the dialog representations are one or more graphical elements to convey assumptions made to interpret the user input and derive an associated response. The conversation user interface enables the user to see the assumptions upon which the response was based, and to optionally change the assumption(s). Upon change of an assumption, the conversation GUI is refreshed to present a modified dialog representation of a new response derived from the altered set of assumptions.
Opening claim text (preview).
What is claimed is: 1. A computing device, comprising: one or more processors; a display communicatively coupled to the one or more processors; and memory storing computer-executable instructions that, when executed by the one or more processors, cause the one or more processors to perform acts comprising: causing display of a conversation graphical user interface (GUI) via the display, the conversation GUI including: an entry portion to receive user input; and an area to present a user dialog representation associated with the user input and a device dialog representation associated with a response from a virtual assistant for the user input; mapping the user input to a context to determine an intent associated with the user input; selecting and utilizing a communication channel between the display and the virtual assistant based on the context; causing display of, via the conversation GUI, a control for at least one of the user dialog representation or the device dialog representation, wherein the control includes options for displaying and hiding an assumption used to determine the response; receiving a selection of the control; and based at least in part on the selection of the control, causing the conversation GUI to either display or hide the assumption used to determine the response, wherein the assumption includes at least some content that is different than content of the response, and wherein causing display of the control comprises causing display of the control in an overlaid manner on at least one of the user dialog representation and the device dialog representation. 2. The computing device of claim 1 , wherein the user input comprises audio input and the assumption comprises a parameter used to process the audio input during speech recognition. 3. The computing device of claim 1 , wherein the assumption comprises a language model that is employed by natural language processing to interpret the user input. 4. The computing device of claim 1 , wherein the assumption comprises logic used by natural language processing to interpret the user input. 5. The computing device of claim 1 , wherein the assumption comprises a profile for a user that is associated with the computing device. 6. The computing device of claim 1 , wherein the assumption comprises a learned behavior of a user that is associated with the computing device. 7. The computing device of claim 1 , wherein the causing display of the control comprises causing display of the control in an overlaid manner on at least one of the user dialog representation or the device dialog representation. 8. The computing device of claim 1 , wherein the acts further comprise: receiving a modification to the assumption; determining a revised response based at least in part on the modification to the assumption; and causing display of the revised response in the conversation GUI. 9. A method comprising: displaying a conversation graphical user interface (GUI) associated with a virtual assistant; receiving user input via the conversation GUI; mapping the user input to a context to determine an intent associated with the user input; selecting and utilizing a communication channel between the display and the virtual assistant based on the context as mapped; displaying, via the conversation GUI, a first visual representation for the user input; identifying a response to the user input; displaying, via the conversation GUI, a second visual representation for the response, the response indicating that an action was performed at least in part by the virtual assistant; displaying, via the conversation GUI, a control for at least one of the first visual representation or and the second visual representation, wherein the control includes options for displaying and hiding an assumption used to determine the response; receiving a selection of the control; and based at least in part on the selection of the control, causing the conversation GUI to either display or hide an assumption used to determine the response, wherein the assumption includes at least some content that is different than content of the response, and wherein displaying the control comprises displaying the control in an overlaid manner on at least one of the user dialog representation and the device dialog representation. 10. The method of claim 9 , wherein the acts further comprise: displaying, via the conversation GUI, an alternative assumption to the assumption; receiving a selection of the alternative assumption; and displaying a revised response in the conversation GUI. 11. The method of claim 9 , wherein the acts further comprise: receiving a modification to the assumption; and based at least in part on receiving the modification to the assumption: removing the second visual representation from the conversation GUI; and displaying, via the conversation GUI, a third visual representation for a revised response from the virtual assistant, the revised response being based at least in part on the modification to the assumption. 12. The method of claim 9 , wherein the user input comprises audio input and the assumption comprises a parameter used to process the audio input during speech recognition. 13. The method of claim 9 , wherein the assumption comprises at least one of a language model or logic that is employed by natural language processing to interpret the user input. 14. One or more non-transitory computer-readable media storing computer-executable instructions that, when executed by one or more processors, cause the one or more processors to perform acts comprising: presenting a conversation graphical user interface (GUI) associated with a virtual assistant to facilitate a conversation between a virtual assistant and a user; presenting, within the conversation GUI, dialog representations that are each associated with at least one of user input from the user or and a response from the virtual assistant; mapping parsed portions of the conversation to a context to determine an intent associated with the conversation; selecting and utilizing a communication channel between the display and the virtual assistant based on the context; enabling the user to view data that was used to formulate any response from the virtual assistant; causing display of, via the conversation GUI, a control for at least one of a user dialog representation or a device dialog representation, wherein the control includes options for displaying and hiding an assumption used to determine the response; receiving a selection of the control; and based at least in part on the selection of the control, causing the conversation GUI to either display or hide the assumption used to determine the response, wherein the assumption includes at least some content that is different than content of the response, and wherein displaying the control comprises displaying the control in an overlaid manner on at least one of the user dialog representation and the device dialog representation. 15. The one or more non-transitory computer-readable media of claim 14 , wherein the enabling comprises providing a control to reveal, upon actuation, how the response from the virtual assistant was determined. 16. The one or more non-transitory computer-readable media of claim 14 , wherein the acts further comprise: learning a characteristic of the user, the characteristic comprising at least one of a speech feature, a preference of the user, a behavior of the user, a pattern associated with the user, or a navigation history associated with the user. 17. The one or mo
Interaction techniques based on cursor appearance or behaviour, e.g. being affected by the presence of displayed objects · CPC title
Selection of displayed objects or displayed text elements (G06F3/0482 takes precedence) · CPC title
Real-time or near real-time messaging, e.g. instant messaging [IM] · CPC title
Management of the audio stream, e.g. setting of volume, audio stream path · CPC title
Audio in a user interface, e.g. using voice commands for navigating, audio feedback · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.