Conversational recovery for voice user interface

US11380330B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11380330-B2
Application numberUS-202016997024-A
CountryUS
Kind codeB2
Filing dateAug 19, 2020
Priority dateNov 7, 2017
Publication dateJul 5, 2022
Grant dateJul 5, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A processing device executing a component of a conversational recovery system receives an intent data and a first entity data identified from user input data. The processing device determines that the first entity data is associated with first content associated with a first component. The processing device additionally receives a text data of the user input data. The processing device determines a word in the text data that matches a keyword associated with second content associated with a second component. The processing device ranks the first component and the second component. The processing device generates message data that comprises an inquiry with respect to choosing the first content or the second content.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method comprising: receiving input data representing a natural language input to a client device; processing the input data to determine first natural language understanding (NLU) data representing a first interpretation of the natural language input; determining the first NLU data corresponds to a potential failure response; based at least in part on determining the first NLU data corresponds to a potential failure response, determining first data representing a semantic similarity between the input data and previous input data representing a previous natural language input; determining, based at least in part on the first data, second NLU data corresponding to the previous input data and representing a second interpretation of the natural language input; determining output data using the second NLU data; and sending the output data to the client device. 2. The computer-implemented method of claim 1 , wherein processing the input data comprises: performing NLU processing to determine the first NLU data and the second NLU data. 3. The computer-implemented method of claim 1 , further comprising: performing NLU processing to determine the first NLU data and a first score corresponding to the first NLU data; and prior to determining the first NLU data corresponds to a potential failure response, determining the first score corresponds to a preferred rank among a plurality of interpretations of the natural language input. 4. The computer-implemented method of claim 1 , wherein the first NLU data includes first intent data different than second intent data included in the second NLU data. 5. The computer-implemented method of claim 4 , wherein the first NLU data and the second NLU data includes first entity data. 6. The computer-implemented method of claim 1 , wherein: the first NLU data corresponds to first content associated with a first component; the method further comprises determining a portion of the input data corresponds to a second component; and determining the first NLU data corresponds to a potential failure response is based at least in part on the portion of the input data corresponding to the second component. 7. The computer-implemented method of claim 1 , wherein determining the output data using the second NLU data is based at least in part on feedback data. 8. The computer-implemented method of claim 1 , further comprising: determining the natural language input lacks a verb, wherein determining the first NLU data corresponds to a potential failure response is based at least in part on the natural language input lacking a verb. 9. The computer-implemented method of claim 1 , wherein determining the output data using the second NLU data is performed at least in part to avoid presenting a failure response. 10. A system comprising: at least one processor; and at least one memory comprising instructions that, when executed by the at least one processor, cause the system to: receive input data representing a natural language input to a client device; process the input data to determine a first interpretation of the natural language input; determine the first interpretation corresponds to a potential failure response; send, to the client device, first data requesting selection of a second interpretation of the natural language input, the second interpretation being different from the first interpretation; determine output data based at least in part on the second interpretation; and send the output data to the client device. 11. The system of claim 10 , wherein the first data further includes an indication of the first interpretation. 12. The system of claim 10 , wherein the input data comprises input audio data and wherein the at least one memory further comprises instructions that, when executed by the at least one processor, further cause the system to: perform automatic speech recognition processing using the input audio data to determine automatic speech recognition (ASR) data; perform text-to-speech processing using the ASR data to determine output audio data; and include in the first data the output audio data. 13. The system of claim 12 , wherein the at least one memory further comprises instructions that, when executed by the at least one processor, further cause the system to: process the ASR data to determine natural language understanding data representing the first interpretation. 14. The system of claim 10 , wherein the at least one memory further comprises instructions that, when executed by the at least one processor, further cause the system to: determine second data representing a semantic similarity between the natural language input and previous user input data corresponding to the second interpretation, wherein the output data is determined based at least in part on the second data. 15. The system of claim 10 , wherein the at least one memory further comprises instructions that, when executed by the at least one processor, further cause the system to: determine the natural language input lacks a verb, wherein the instructions that cause the system to determine the first interpretation corresponds to a potential failure response are based at least in part on the natural language input lacking a verb. 16. The system of claim 10 , wherein the instructions that cause the system to determine the output data based at least in part on the second interpretation are performed based at least in part on feedback data. 17. The system of claim 10 , wherein the instructions that cause the system to process the input data comprise instructions that, when executed by the at least one processor, further cause the system to: performing natural language understanding (NLU) processing to determine first NLU data corresponding to the first interpretation and second NLU data corresponding to the second interpretation. 18. The system of claim 17 , wherein the instructions that determine the first interpretation corresponds to a potential failure response are based at least in part on the first NLU data. 19. The system of claim 10 , wherein: the first interpretation corresponds to first content associated with a first component; the at least one memory further comprises instructions that, when executed by the at least one processor, further cause the system to determine a portion of the input data corresponds to a second component; and determine the first interpretation corresponds to a potential failure response based at least in part on the portion of the input data corresponding to the second component.

Assignees

Inventors

Classifications

  • Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title

  • G10L15/26Primary

    Speech to text systems (G10L15/08 takes precedence) · CPC title

  • Multiple recognisers used in sequence or in parallel; Score combination systems therefor, e.g. voting systems · CPC title

  • Parsing for meaning understanding · CPC title

  • Natural language query formulation · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11380330B2 cover?
A processing device executing a component of a conversational recovery system receives an intent data and a first entity data identified from user input data. The processing device determines that the first entity data is associated with first content associated with a first component. The processing device additionally receives a text data of the user input data. The processing device determin…
Who is the assignee on this patent?
Amazon Tech Inc
What technology area does this patent fall under?
Primary CPC classification G10L15/26. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jul 05 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).