Voice command processing for locked devices

US11862174B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11862174-B2
Application numberUS-202117209596-A
CountryUS
Kind codeB2
Filing dateMar 23, 2021
Priority dateMay 29, 2018
Publication dateJan 2, 2024
Grant dateJan 2, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Techniques for processing voice commands from a locked device are described. A voice command received by a locked device is stored, a prompt requesting that the device be unlocked is generated, and the voice command is processed automatically after the device is unlocked. Thus, the system processes the voice command without the user repeating the voice command. In addition, the system may process certain voice commands even when the device is locked. For example, a whitelist filter compares an intent associated with the voice command to whitelisted intents from a whitelist database before the intent is dispatched to a speechlet, and intents included in the whitelist database are processed normally. Thus, the system performs certain voice commands while the device is locked, while other voice commands may be automatically processed after the device is unlocked without the user repeating the voice command.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method comprising: capturing, by at least one microphone of a first device in a locked state, audio representing an utterance; sending, from the first device to a remote system while the first device is in the locked state, audio data representing the utterance; determining, by the remote system, that the first device is in the locked state; performing, by the remote system, speech processing on the audio data to determine speech processing results data, wherein the speech processing includes natural language understanding (NLU) processing and the speech processing results data includes first NLU results data, wherein the first NLU results data includes a plurality of potential intents associated with a respective confidence score; determining that a first potential intent of the plurality of potential intents corresponds to an action that is not permitted to be performed while the first device is in the locked state, the first potential intent being associated with a first confidence score representing a highest confidence among the plurality of potential intents; determining that a second potential intent of the plurality of potential intents corresponds to an action that is permitted to be performed while the first device is in the locked state, the second potential intent being associated with a second confidence score representing a lower confidence than the first confidence score; sending, from the remote system to a speechlet component, data indicating the second potential intent; receiving, by the remote system from the speechlet component, response data; and sending, from the remote system to the first device, data corresponding to execution of the action, wherein the data corresponding to execution of the action is based, at least in part, on the response data. 2. The computer-implemented method of claim 1 , wherein the data corresponding to execution of the action includes a command for the first device to perform the action. 3. The computer-implemented method of claim 1 , further comprising: sending, from the first device to the remote system, state data indicating the first device is in the locked state. 4. The computer-implemented method of claim 3 , further comprising: sending, from the remote system to the speechlet component, the state data. 5. The computer-implemented method of claim 1 , wherein determining that the second potential intent of the plurality of intents corresponds to an action that is permitted to be performed while the first device is in the locked state is based at least in part on user profile data. 6. The computer-implemented method of claim 1 , further comprising: determining, by the first device while in the locked state, that the audio comprises a wakeword. 7. The computer implemented method of claim 1 , further comprising: determining, by the remote system, whether the speech processing results data refers to the speechlet component, wherein determining that the second potential intent of the plurality of intents corresponds to an action that is permitted to be performed while the first device is in the locked state is based on determining that the second potential intent refers to the speechlet component. 8. A system comprising: at least one processor; and memory comprising instructions that, when executed by the at least one processor, cause the system to: capture, by at least one microphone of a first device in a locked state, audio representing an utterance; send, from the first device to a remote system while the first device is in the locked state, audio data representing the utterance; determine, by the remote system, that the first device is in the locked state; perform, by the remote system, speech processing on the audio data to determine speech processing results data, wherein the speech processing includes natural language understanding (NLU) processing and the speech processing results data includes first NLU results data, wherein the first NLU results data includes a plurality of potential intents associated with a respective confidence score; determine that a first potential intent of the plurality of potential intents corresponds to an action that is not permitted to be performed while the first device is in the locked state, the first potential intent being associated with a first confidence score representing a highest confidence among the plurality of potential intents; determine that a second potential intent of the plurality of potential intents corresponds to an action that is permitted to be performed while the first device is in the locked state, the second potential intent being associated with a second confidence score representing a lower confidence than the first confidence score; send, from the remote system to a speechlet component, data, indicating the second potential intent; receive, by the remote system from the speechlet component, response data; and send, from the remote system to the first device, data corresponding to execution of the action, wherein the data corresponding to execution of the action is based, at least in part, on the response data. 9. The system of claim 8 , wherein the data corresponding to execution of the action includes a command for the first device to perform the action. 10. The system of claim 8 , wherein the memory further comprises instructions that, when executed by the at least one processor, further cause the system to: send, from the first device to the remote system, state data indicating the first device is in the locked state. 11. The system of claim 8 , wherein determination, that the second potential intent of the plurality of intents corresponds to an action that is permitted to be performed while the first device is in the locked state is based at least in part on user profile data. 12. The system of claim 8 , wherein the memory further comprises instructions that, when executed by the at least one processor, further cause the system to: determine, by the first device while in the locked state, that the audio comprises a wakeword. 13. A computer-implemented method comprising: capturing, by at least one microphone of a first device in a locked state, audio representing an utterance; sending, from the first device to a remote system while the first device is in the locked state, audio data representing the utterance; determining, by the remote system, that the first device is in the locked state; performing, by the remote system, speech processing on the audio data to determine speech processing results data, wherein the speech processing includes natural language understanding (NLU) processing and the speech processing results data includes first NLU results data, wherein the first NLU results data includes a plurality of potential intents associated with a respective confidence score; determining that a first potential intent of the plurality of potential intents corresponds to an action that is not permitted to be performed while the first device is in the locked state, the first potential intent being associated with a first confidence score representing a highest confidence among the plurality of potential intents; determining that a second potential intent of the plurality of potential intents corresponds to an action that is permitted to be performed while the first device is in the locked state, the second potential intent being associated with a second confidence score representing a lower confidence than the first confidence score; sending, from the remote system to a speechlet component, data indicating the second potential intent; rece

Assignees

Inventors

Classifications

  • G10L17/00Primary

    Speaker identification or verification techniques · CPC title

  • Audio in a user interface, e.g. using voice commands for navigating, audio feedback · CPC title

  • G06F21/31Primary

    User authentication · CPC title

  • Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title

  • Interactive procedures; Man-machine interfaces · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11862174B2 cover?
Techniques for processing voice commands from a locked device are described. A voice command received by a locked device is stored, a prompt requesting that the device be unlocked is generated, and the voice command is processed automatically after the device is unlocked. Thus, the system processes the voice command without the user repeating the voice command. In addition, the system may proce…
Who is the assignee on this patent?
Amazon Tech Inc
What technology area does this patent fall under?
Primary CPC classification G10L17/00. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jan 02 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 5 related publications on this page (citations in our corpus or others sharing the same primary CPC).