What technology area does this patent fall under?

Primary CPC classification G10L15/1822. Mapped technology areas include Physics.

When was this patent published?

Publication date Tue May 08 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Multi-command single utterance input method

US9966065B2 · US · B2

Patent metadata
Field	Value
Publication number	US-9966065-B2
Application number	US-201514724623-A
Country	US
Kind code	B2
Filing date	May 28, 2015
Priority date	May 30, 2014
Publication date	May 8, 2018
Grant date	May 8, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems and processes are disclosed for handling a multi-part voice command for a virtual assistant. Speech input can be received from a user that includes multiple actionable commands within a single utterance. A text string can be generated from the speech input using a speech transcription process. The text string can be parsed into multiple candidate substrings based on domain keywords, imperative verbs, predetermined substring lengths, or the like. For each candidate substring, a probability can be determined indicating whether the candidate substring corresponds to an actionable command. Such probabilities can be determined based on semantic coherence, similarity to user request templates, querying services to determine manageability, or the like. If the probabilities exceed a threshold, the user intent of each substring can be determined, processes associated with the user intents can be executed, and an acknowledgment can be provided to the user.

First claim

Opening claim text (preview).

What is claimed is: 1. A non-transitory computer readable storage medium storing one or more programs, the one or more programs comprising instructions, which when executed by one or more processors of an electronic device, cause the electronic device to: receive speech input from a user, wherein the speech input comprises a single utterance having one or more actionable commands; in direct response to receiving the speech input comprising the single utterance: generate a text string based on the speech input using a speech transcription process; parse the text string into at least a first candidate substring and a second candidate substring; determine a first probability that the first candidate substring corresponds to a first actionable command and a second probability that the second candidate sub string corresponds to a second actionable command; in response to the first probability and the second probability exceeding a threshold, determine a first intent associated with the first candidate substring and a second intent associated with the second candidate substring; execute a first process identified by the first intent and a second process identified by the second intent; and provide to the user an acknowledgment that the first process and the second process have at least begun execution. 2. The computer readable storage medium of claim 1 , wherein parsing the text string into at least the first candidate substring and the second candidate substring comprises: identifying a first keyword in the text string that corresponds to a first domain to determine the first candidate substring; and identifying a second keyword in the text string that corresponds to a second domain to determine the second candidate substring. 3. The computer readable storage medium of claim 1 , wherein parsing the text string into at least the first candidate substring and the second candidate substring comprises: identifying a first imperative verb in the text string to determine the first candidate substring; and identifying a second imperative verb in the text string to determine the second candidate substring. 4. The computer readable storage medium of claim 1 , wherein determining the first probability that the first candidate substring corresponds to the first actionable command and the second probability that the second candidate substring corresponds to the second actionable command comprises: determining a first semantic coherence of the first candidate substring and a second semantic coherence of the second candidate substring; and determining the first probability and the second probability based on the first semantic coherence and the second semantic coherence. 5. The computer readable storage medium of claim 1 , wherein determining the first probability that the first candidate substring corresponds to the first actionable command and the second probability that the second candidate substring corresponds to the second actionable command comprises: comparing the first candidate substring and the second candidate substring to one or more user request templates; and determining the first probability and the second probability based on the comparison. 6. The computer readable storage medium of claim 1 , wherein determining the first probability that the first candidate substring corresponds to the first actionable command and the second probability that the second candidate substring corresponds to the second actionable command comprises: submitting the first candidate substring and the second candidate substring to at least a first service and a second service; receiving a first likelihood that the first service can resolve the first actionable command and a second likelihood that the second service can resolve the second actionable command; and determining the first probability and the second probability based on the first likelihood and the second likelihood. 7. The computer readable storage medium of claim 1 , wherein determining the first intent associated with the first candidate substring and the second intent associated with the second candidate substring comprises: determining the second intent based on at least one word in the first candidate substring. 8. The computer readable storage medium of claim 1 , wherein determining the first intent associated with the first candidate substring and the second intent associated with the second candidate substring comprises: determining the first intent or the second intent based on information displayed on a display associated with the electronic device. 9. The computer readable storage medium of claim 8 , wherein the information comprises a list; and wherein determining the first intent associated with the first candidate substring and the second intent associated with the second candidate substring comprises: determining the first intent or the second intent based on an ordinal descriptor in the first candidate substring or the second candidate substring, wherein the ordinal descriptor is associated with one or more items in the list. 10. The computer readable storage medium of claim 8 , wherein the information comprises one or more notifications. 11. The computer readable storage medium of claim 8 , wherein the information comprises one or more emails. 12. The computer readable storage medium of claim 8 , wherein determining the first intent associated with the first candidate substring and the second intent associated with the second candidate substring comprises: determining one or more potential user requests based on the information displayed on the display; and determining the first intent or the second intent based on the one or more potential user requests. 13. The computer readable storage medium of claim 1 , wherein the acknowledgment comprises an audible confirmation. 14. The computer readable storage medium of claim 1 , wherein the acknowledgment comprises haptic feedback. 15. The computer readable storage medium of claim 1 , wherein providing to the user the acknowledgment associated with the first intent and the second intent comprises: providing a first task associated with the first intent and a second task associated with the second intent. 16. The computer readable storage medium of claim 15 , wherein providing the first task associated with the first intent and the second task associated with the second intent comprises: displaying the first task and the second task. 17. The computer readable storage medium of claim 15 , further comprising: in response to completing the first process, providing a first indicator associated with the first task; and in response to completing the second process, providing a second indicator associated with the second task. 18. The computer readable storage medium of claim 17 , wherein providing the first indicator associated with the first task comprises displaying the first indicator; and wherein providing the second indicator associated with the second task comprises displaying the second indicator. 19. The computer readable storage medium of claim 17 , further comprising: before completing the first process, providing a first processing status indicator associated with the first task; and before completing the second process, providing a second processing status indicator associated with the second task. 20. The computer readable storage medium of claim 19 , wherein providing the first processing status indicator associated with the first task comprises displaying the fi

Assignees

Apple Inc

Inventors

Classifications

H04M3/4936
Speech interaction details (speech recognition per se G10L15/00) · CPC title
H04M2203/355
Interactive dialogue design tools, features or methods · CPC title
G06F40/205
Parsing · CPC title
G10L2015/225
Feedback of the input speech · CPC title
G10L15/1822Primary
Parsing for meaning understanding · CPC title

Patent family

Related publications grouped by family.

View patent family 53284678

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9966065B2 cover?: Systems and processes are disclosed for handling a multi-part voice command for a virtual assistant. Speech input can be received from a user that includes multiple actionable commands within a single utterance. A text string can be generated from the speech input using a speech transcription process. The text string can be parsed into multiple candidate substrings based on domain keywords, imp…
Who is the assignee on this patent?: Apple Inc
What technology area does this patent fall under?: Primary CPC classification G10L15/1822. Mapped technology areas include Physics.
When was this patent published?: Publication date Tue May 08 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).