Processing complex utterances for natural language understanding

US11410646B1 · US · B1

Patent metadata
FieldValue
Publication numberUS-11410646-B1
Application numberUS-201916368399-A
CountryUS
Kind codeB1
Filing dateMar 28, 2019
Priority dateSep 29, 2016
Publication dateAug 9, 2022
Grant dateAug 9, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A system capable of performing natural language understanding (NLU) on utterances including complex command structures such as sequential commands (e.g., multiple commands in a single utterance), conditional commands (e.g., commands that are only executed if a condition is satisfied), and/or repetitive commands (e.g., commands that are executed until a condition is satisfied). Audio data may be processed using automatic speech recognition (ASR) techniques to obtain text. The text may then be processed using machine learning models that are trained to parse text of incoming utterances. The models may identify complex utterance structures and may identify what command portions of an utterance go with what conditional statements. Machine learning models may also identify what data is needed to determine when the conditionals are true so the system may cause the commands to be executed (and stopped) at the appropriate times.

First claim

Opening claim text (preview).

What is claimed is: 1. A system, comprising: at least one processor; and at least one computer-readable medium encoded with instructions that, when executed by the at least one processor, cause the system to: receive audio data corresponding to a first utterance from a first device; process the audio data to determine that a first portion of the audio data corresponds to a command and that a second portion of the audio data corresponds to a conditional statement; determine that the conditional statement corresponds to a first data type; receive, from a second device, first data having the first data type; cause, at a first time and based at least in part on the command, at least one component to transition from a first state to a second state; determine, using the first data and at a second time after the first time, that the conditional statement is satisfied; and cause the at least one component to transition from the second state to the first state based at least in part on the conditional statement being satisfied. 2. The system of claim 1 , wherein the at least one computer-readable medium is encoded with additional instructions that, when executed by the at least one processor, further cause the system to: determine a user profile associated with the first device; and determine that the second device is associated with the user profile. 3. The system of claim 2 , wherein the at least one computer-readable medium is encoded with additional instructions that, when executed by the at least one processor, further cause the system to: determine that the first data type corresponds to a sensor type; and determine that the second device corresponds to the sensor type. 4. The system of claim 1 , wherein the at least one computer-readable medium is encoded with additional instructions that, when executed by the at least one processor, further cause the system to: determine text data associated with the first portion; and associate the text data with a first indicator corresponding to the second device and a second indicator corresponding to a value corresponding to the first data type. 5. The system of claim 1 , wherein the at least one computer-readable medium is encoded with additional instructions that, when executed by the at least one processor, further cause the system to: receive the first data during a same communication session as the audio data. 6. The system of claim 1 , wherein the at least one computer-readable medium is encoded with additional instructions that, when executed by the at least one processor, further cause the system to: determine that the first data type corresponds to information available from an internet data source; and determine that the second device corresponds to the internet data source. 7. The system of claim 1 , wherein the audio data further includes a third portion and the at least one computer-readable medium is encoded with additional instructions that, when executed by the at least one processor, further cause the system to: determine that the conditional statement corresponds to the command and not the third portion. 8. The system of claim 7 , wherein the at least one computer-readable medium is encoded with additional instructions that, when executed by the at least one processor, further cause the system to: determine that the third portion corresponds to a second command; and cause the second command to be executed regardless of the conditional statement. 9. A system, comprising: at least one processor; and at least one computer-readable medium encoded with instructions that, when executed by the at least one processor, cause the system to: receive audio data from a first device; process the audio data to determine that a first portion of the audio data corresponds to a request to repeat execution of a command and that a second portion of the audio data corresponds to a conditional statement; cause the command to be executed multiple times, beginning at a first time; determine, at a second time after the first time, that the conditional statement is satisfied; and ceasing from further executing the command based at least in part on the conditional statement being satisfied. 10. The system of claim 9 , wherein the at least one computer-readable medium is encoded with additional instructions that, when executed by the at least one processor, further cause the system to: determine that the conditional statement corresponds to a number of times to repeat execution of the command, wherein the number of times is greater than one; determine that the conditional statement is satisfied by determining that the command has been executed the number of times; and ceasing from further executing the command based at least in part on the command having been executed the number of times. 11. The system of claim 9 , wherein the at least one computer-readable medium is encoded with additional instructions that, when executed by the at least one processor, further cause the system to: determine that the conditional statement corresponds to a first time; determine that the conditional statement is satisfied at least in part by determining that a current time corresponds to the first time; and ceasing from further executing the command based at least in part on the current time corresponding to the first time. 12. The system of claim 9 , wherein the first time is a time of day. 13. The system of claim 9 , wherein the audio data further includes a third portion and the at least one computer-readable medium is encoded with additional instructions that, when executed by the at least one processor, further cause the system to: determine that the conditional statement corresponds to the command and not the third portion. 14. The system of claim 13 , wherein the at least one computer-readable medium is encoded with additional instructions that, when executed by the at least one processor, further cause the system to: determine that the third portion corresponds to a second command; and cause the second command to be executed regardless of the conditional statement. 15. A computer-implemented method, comprising: receiving audio data corresponding to a first utterance from a first device; processing the audio data to determine that a first portion of the audio data corresponds to a command and that a second portion of the audio data corresponds to a conditional statement; determining that the conditional statement corresponds to a first data type; receiving, from a second device, first data having the first data type; causing, at a first time and based at least in part on the command, at least one component to transition from a first state to a second state; determining, using the first data and at a second time after the first time, that the conditional statement is satisfied; and causing the at least one component to transition from the second state to the first state based at least in part on the conditional statement being satisfied. 16. The computer-implemented method of claim 15 , further comprising: determining a user profile associated with the first device; and determining that the second device is associated with the user profile. 17. The computer-implemented method of claim 16 , further comprising: determining that the first data type corresponds to a sensor type; and determining that the second device corresponds to the sensor type. 18. The computer-implemented method of claim 15 , further comprising: determining text data associated with the fir

Assignees

Inventors

Classifications

  • Parsing for meaning understanding · CPC title

  • Audio in a user interface, e.g. using voice commands for navigating, audio feedback · CPC title

  • Execution procedure of a spoken command · CPC title

  • G10L15/22Primary

    Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title

  • Training · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11410646B1 cover?
A system capable of performing natural language understanding (NLU) on utterances including complex command structures such as sequential commands (e.g., multiple commands in a single utterance), conditional commands (e.g., commands that are only executed if a condition is satisfied), and/or repetitive commands (e.g., commands that are executed until a condition is satisfied). Audio data may be…
Who is the assignee on this patent?
Amazon Tech Inc
What technology area does this patent fall under?
Primary CPC classification G10L15/22. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Aug 09 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 10 related publications on this page (citations in our corpus or others sharing the same primary CPC).