What technology area does this patent fall under?

Primary CPC classification G10L15/22. Mapped technology areas include Physics.

When was this patent published?

Publication date Tue Mar 20 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Training an at least partial voice command system

US9922642B2 · US · B2

Patent metadata
Field	Value
Publication number	US-9922642-B2
Application number	US-201414213878-A
Country	US
Kind code	B2
Filing date	Mar 14, 2014
Priority date	Mar 15, 2013
Publication date	Mar 20, 2018
Grant date	Mar 20, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An electronic device with one or more processors and memory includes a procedure for training a digital assistant. In some embodiments, the device detects an impasse in a dialog between the digital assistant and a user including a speech input. During a learning session, the device utilizes a subsequent clarification input from the user to adjust intent inference or task execution associated with the speech input to produce a satisfactory response. In some embodiments, the device identifies a pattern of success or failure associated with an aspect previously used to complete a task and generates a hypothesis regarding a parameter used in speech recognition, intent inference or task execution as a cause for the pattern. Then, the device tests the hypothesis by altering the parameter for a subsequent completion of the task and adopts or rejects the hypothesis based on feedback information collected from the subsequent completion.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method for training a digital assistant, performed at an electronic device including one or more processors and memory storing instructions for execution by the one or more processors, the method comprising: detecting an impasse during a dialogue between the digital assistant and a user, wherein the dialogue includes at least one speech input from the user, wherein the at least one speech input includes a plurality of words; in response to detecting the impasse, establishing a learning session associated with the at least one speech input; during the learning session: receiving one or more subsequent clarification inputs from the user; based at least in part on the one or more subsequent clarification inputs, adjusting at least one of intent inference and task execution associated with the at least one speech input to produce a satisfactory response to the at least one speech input; and associating the satisfactory response with the entirety of the at least one speech input for processing future occurrences of the at least one speech input, wherein the associating comprises replacing a respective initial response shared by a set of users in a community of users with the satisfactory response to the at least one speech input from the user. 2. The method of claim 1 , wherein detecting the impasse comprises: during the dialogue between the digital assistant and the user: receiving at least one speech input from the user; inferring an initial intent based on the at least one speech input; providing an initial response to fulfill the initial intent that has been inferred; and receiving a follow-up speech input from the user rejecting the initial response. 3. The method of claim 2 , wherein the initial intent is a best guess; and during the learning session, the method further comprising: prior to receiving the one or more subsequent clarification inputs from the user, inferring a second intent based on the at least one speech input, wherein the second intent is a second best guess and the second intent is distinct from the initial intent; and providing a second response to fulfill the second intent that has been inferred. 4. The method of claim 1 , wherein the impasse comprises one of a set consisting of: a single user rejection of an initial response distinct from the satisfactory response; two or more user rejections of the initial response; and a user command ending the dialogue. 5. The method of claim 1 , further comprising: during the learning session: prior to receiving the one or more subsequent clarification inputs from the user, providing two or more alternative responses to the at least one speech input from the user. 6. The method of claim 1 , further comprising: during the learning session: reducing a respective intent inference or speech recognition threshold so as to generate the two or more alternative responses to the at least one speech input from the user. 7. The method of claim 1 , further comprising: during the learning session: prior to receiving the one or more subsequent clarification inputs from the user, rephrasing at least a portion of the at least one speech input from the user to elicit one or more subsequent clarification inputs from the user. 8. The method of claim 1 , wherein detecting an impasse during a dialogue between the digital assistant and the user comprises at least one of: receiving a follow-up speech input from the user rejecting an initial response to the at least one speech input; and determining that an actionable intent cannot be inferred from the at least one speech input. 9. The method of claim 1 , wherein the satisfactory response comprises at least one of: provision of an answer to requested information and performance of a requested task. 10. An electronic device, comprising: a sound receiving unit configured to receive sound input; a speaker unit configured to output sound; and a processing unit coupled to the sound receiving unit and the speaker unit, the processing unit configured to: detect an impasse during a dialogue between the digital assistant and a user, wherein the dialogue includes at least one speech input from the user, wherein the at least one speech includes a plurality of words; in response to detecting the impasse, establish a learning session associated with the at least one speech input; during the learning session: receive one or more subsequent clarification inputs from the user; based at least in part on the one or more subsequent clarification inputs, adjust at least one of intent inference and task execution associated with the at least one speech input to produce a satisfactory response to the at least one speech input; and associate the satisfactory response with the entirety of the at least one speech input for processing future occurrences of the at least one speech input, wherein the associating comprises replacing a respective initial response shared by a set of users in a community of users with the satisfactory response to the at least one speech input from the user. 11. The electronic device of claim 10 , wherein detecting the impasse comprises: during the dialogue between the digital assistant and the user, the processing unit is further configured to: receive at least one speech input from the user; infer an initial intent based on the at least one speech input; provide an initial response to fulfill the initial intent that has been inferred; and receive a follow-up speech input from the user rejecting the initial response. 12. The electronic device of claim 10 , wherein the initial intent is a best guess; and during the learning session, the processing unit is further configured to: prior to receiving the one or more subsequent clarification inputs from the user, infer a second intent based on the at least one speech input, wherein the second intent is a second best guess and the second intent is distinct from the initial intent; and provide a second response to fulfill the second intent that has been inferred. 13. The electronic device of claim 10 , wherein the impasse comprises one of a set consisting of: a single user rejection of an initial response distinct from the satisfactory response; two or more user rejections of the initial response; and a user command ending the dialogue. 14. The electronic device of claim 10 , wherein the processing unit is configured to detect an impasse during a dialogue between the digital assistant and the user comprises the processing unit being configured to perform at least one of: receiving a follow-up speech input from the user rejecting an initial response to the at least one speech input; and determining that an actionable intent cannot be inferred from the at least one speech input. 15. The electronic device of claim 10 , wherein the satisfactory response comprises at least one of: provision of an answer to requested information and performance of a requested task. 16. A non-transitory computer-readable storage medium storing one or more programs, the one or more programs comprising instructions, which, when executed by an electronic device with one or more processors, cause the device to: detect an impasse during a dialogue between the digital assistant and a user, wherein the dialogue includes at least one speech input from the user, wherein the at least one speech input includes a plurality of words; in response to detecting the impasse, establish a learning session associated with the at least one speech input; during the learning sess

Assignees

Apple Inc

Inventors

Classifications

G10L15/22Primary
Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title
G10L2015/223
Execution procedure of a spoken command · CPC title
G10L2015/227
of the speaker; Human-factor methodology · CPC title
G10L15/063Primary
Training · CPC title

Patent family

Related publications grouped by family.

View patent family 50729785

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9922642B2 cover?: An electronic device with one or more processors and memory includes a procedure for training a digital assistant. In some embodiments, the device detects an impasse in a dialog between the digital assistant and a user including a speech input. During a learning session, the device utilizes a subsequent clarification input from the user to adjust intent inference or task execution associated wi…
Who is the assignee on this patent?: Apple Inc
What technology area does this patent fall under?: Primary CPC classification G10L15/22. Mapped technology areas include Physics.
When was this patent published?: Publication date Tue Mar 20 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).