Natural language processing using context

US12080282B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12080282-B2
Application numberUS-202217848901-A
CountryUS
Kind codeB2
Filing dateJun 24, 2022
Priority dateMar 23, 2020
Publication dateSep 3, 2024
Grant dateSep 3, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

This disclosure proposes systems and methods for processing natural language inputs using data associated with multiple language recognition contexts (LRC). A system using multiple LRCs can receive input data from a device, identify a first identifier associated with the device, and further identify second identifiers associated with the first identifier and representing candidate users of the device. The system can access language processing data used for natural language processing for the LRCs corresponding to each of the first and second identifiers, and process the input data using the language processing data at one or more stages of automatic speech recognition, natural language understanding, entity resolution, and/or command execution. User recognition can reduce the number of candidate users, and thus the amount of data used to process the input data. Dynamic arbitration can select from between competing hypotheses representing the first identifier and a second identifier, respectively.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method, comprising: receiving, by a language processing component from a first device, input data corresponding to a natural language input; receiving, from a second device, first context data corresponding to the input data, the first context data representing a device identifier corresponding to the second device; processing the first context data to determine a first identifier corresponding to a first possible user of the first device; and performing, by the language processing component based on to the first identifier, first language processing on the input data to generate output data. 2. The computer-implemented method of claim 1 , further comprising: determining a first automatic speech recognition (ASR) configuration corresponding to the first identifier, wherein performing the first language processing comprises using the first ASR configuration. 3. The computer-implemented method of claim 1 , further comprising: determining a first natural language understanding (NLU) configuration corresponding to the first identifier, wherein performing the first language processing comprises using the first NLU configuration. 4. The computer-implemented method of claim 1 , further comprising: determining first entity data corresponding to the first identifier, wherein performing the first language processing comprises using the first entity data. 5. The computer-implemented method of claim 1 , further comprising: determining, based at least in part on the first context data, that the first possible user is proximate to the first device. 6. The computer-implemented method of claim 1 , wherein performing the first language processing comprising: determining a first hypothesis corresponding to the natural language input; determining a second hypothesis corresponding to the natural language input; and based at least in part on the first identifier, selecting the first hypothesis. 7. The computer-implemented method of claim 1 , wherein the second device comprises a wearable device and receiving the first context data comprises receiving a signal corresponding to the wearable device. 8. The computer-implemented method of claim 1 , further comprising: receiving second context data corresponding to the input data; determining, based on the second context data, a second identifier corresponding to a second possible user of the first device; and based at least in part on the input data, determining the first possible user more likely provided the natural language input than the second possible user. 9. The computer-implemented method of claim 1 , further comprising: determining, based at least in part on the first context data, that the first device detects only a single user proximate to the first device. 10. The computer-implemented method of claim 1 , wherein the input data is audio data corresponding to an utterance and wherein the method comprising: processing the audio data to determine a voice identifier corresponding to the input data; and determining that the voice identifier corresponds to a second identifier different from the first identifier, wherein performing the first language processing is additionally based on the second identifier. 11. A system, comprising: at least one processor; and at least one memory comprising instructions that, when executed by the at least one processor, cause the system to: receive, by at least one language processing component corresponding to a first device, input data corresponding to a natural language input; receive, from a second device, first context data corresponding to the input data, the first context data representing a device identifier corresponding to the second device; processing the first context data to determine a first identifier corresponding to a first possible user of the first device; and perform, by the at least one language processing component based on to the first identifier, first language processing on the input data to generate output data. 12. The system of claim 11 , wherein the at least one memory further comprises instructions that, when executed by the at least one processor, further cause the system to: determine a first automatic speech recognition (ASR) configuration corresponding to the first identifier, wherein the instructions that cause the system to perform the first language processing comprise instructions that, when executed by the at least one processor, cause the system to use the first ASR configuration. 13. The system of claim 11 , wherein the at least one memory further comprises instructions that, when executed by the at least one processor, further cause the system to: determine a first natural language understanding (NLU) configuration corresponding to the first identifier, wherein the instructions that cause the system to perform the first language processing comprise instructions that, when executed by the at least one processor, cause the system to use the first NLU configuration. 14. The system of claim 11 , wherein the at least one memory further comprises instructions that, when executed by the at least one processor, further cause the system to: determine first entity data corresponding to the first identifier, wherein the instructions that cause the system to perform the first language processing comprise instructions that, when executed by the at least one processor, cause the system to use the first entity data. 15. The system of claim 11 , wherein the at least one memory further comprises instructions that, when executed by the at least one processor, further cause the system to: determine, based at least in part on the first context data, that the first possible user is proximate to the first device. 16. The system of claim 11 , wherein the instructions that cause the system to perform the first language processing comprise instructions that, when executed by the at least one processor, cause the system to: determine a first hypothesis corresponding to the natural language input; determine a second hypothesis corresponding to the natural language input; and based at least in part on the first identifier, select the first hypothesis. 17. The system of claim 11 , wherein the second device comprises a wearable device and the instructions that cause the system to receive the first context data comprise instructions that, when executed by the at least one processor, cause the system to receive a signal corresponding to the wearable device. 18. The system of claim 11 , wherein the at least one memory further comprises instructions that, when executed by the at least one processor, further cause the system to: receive second context data corresponding to the input data; determine, based on the second context data, a second identifier corresponding to a second possible user of the first device; and based at least in part on the input data, determine the first possible user more likely provided the natural language input than the second possible user. 19. The system of claim 11 , wherein the at least one memory further comprises instructions that, when executed by the at least one processor, further cause the system to: determine, based at least in part on the first context data, that the first device detects only a single user proximate to the first device. 20. The system of claim 11 , wherein the input data is audio data corresponding to an utterance and the at least one memory further comprises instructions that

Assignees

Inventors

Classifications

  • Interactive procedures; Man-machine interfaces · CPC title

  • of the speaker; Human-factor methodology · CPC title

  • Execution procedure of a spoken command · CPC title

  • of application context · CPC title

  • Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12080282B2 cover?
This disclosure proposes systems and methods for processing natural language inputs using data associated with multiple language recognition contexts (LRC). A system using multiple LRCs can receive input data from a device, identify a first identifier associated with the device, and further identify second identifiers associated with the first identifier and representing candidate users of the …
Who is the assignee on this patent?
Amazon Tech Inc
What technology area does this patent fall under?
Primary CPC classification G10L15/183. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Sep 03 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).