Neural latent variable model for spoken language understanding

US9911413B1 · US · B1

Patent metadata
FieldValue
Publication numberUS-9911413-B1
Application numberUS-201615392718-A
CountryUS
Kind codeB1
Filing dateDec 28, 2016
Priority dateDec 28, 2016
Publication dateMar 6, 2018
Grant dateMar 6, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A linguist classifier, for instance intent or slot classifier, is updated using data with only partial annotation indicating overall correctness rather that specific correct intent or slot values, which are treated as “latent” (i.e., unknown) variables. Full annotation of the data is not required. A small amount of fully annotated data may be combined with a substantially larger amount of partially annotated data to update the linguistic classifier. In a specific implementation, the linguistic classifier is a neural network and the weights are trained using a reinforcement learning approach.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for configuring a natural language (NL) understanding system, the NL understanding system including at least an intent classifier, the intent classifier comprising a neural network configurable with configuration data including neural network weights to distinguish a plurality of intents based on a linguistic input, the method comprising: configuring the intent classifier with first configuration data, the first configuration data having been determined from first collected data comprising input data items each annotated with a specific intent of a plurality of intents, the neural network weights of the first configuration data having been determined to best match the input data items and the annotated specific intents; processing a second plurality of user inputs to the system, wherein the processing includes, for each of the second plurality of user inputs, using the intent classifier configured with the first configuration data to determine a recognized intent corresponding to said input, and causing determination of a corresponding response to the input based on the recognized intent; storing second collected data, the second collected data comprising, for each of the second plurality of user inputs, a representation of the user input and a corresponding response; receiving manual annotation for the second collected data, an annotation for each item of the second collected data indicating whether the corresponding response is consistent with the user input, thereby forming second annotated data without requiring annotating a correct intent for user inputs for which the caused action does not sufficiently match the user input; determining second configuration data for the intent classifier, the second configuration data being determined to distinguish the plurality of intents by computing the second configuration data to match the second annotated data, including treating correct intents as latent variables that are not represented in the second annotated data, the determining including incrementally updating the neural network weights of the second configuration data; and configuring the intent classifier with the second configuration data. 2. A method for configuring a natural language (NL) understanding system, the NL understanding system including at least a linguistic classifier, the linguistic classifier being configurable with configuration data to distinguish a plurality of linguistic categories based on a linguistic input, the method comprising: configuring the linguistic classifier with first configuration data; processing a second plurality of user inputs to the system, wherein the processing includes, for each of the second plurality of user inputs, using the linguistic classifier configured with the first configuration data to determine a recognized linguistic category corresponding to said input, and causing determination of a corresponding response to the input based on the recognized linguistic category; storing second collected data, the second collected data comprising, for each of the second plurality of user inputs, a representation of the user input and the corresponding responses; receiving annotations for the second collected data, an annotation for each item of the second collected data indicating whether the corresponding response is consistent with the user input; determining second configuration data for the first linguistic classifier, the second configuration data being determined to distinguish the plurality of linguistic categories by computing the second configuration data to match the second annotated data; and configuring the linguistic classifier with the second configuration data. 3. The method of claim 2 wherein the linguistic classifier comprises an intent classifier, and the linguistic categories comprise user intents. 4. The method of claim 2 wherein the linguistic classifier comprises a slot recognizer, and the linguistic categories comprise slots. 5. The method of claim 2 further comprising determining the first configuration data including receiving first annotated data, the first annotated data associating each user input of a plurality of user input of first collected data with a corresponding linguistic category of the plurality of categories; and computing the first configuration data for the first linguistic classifier to distinguish the plurality of linguistic categories by selecting the first configuration data to match the first annotated data. 6. The method of claim 5 wherein determining the second configuration data further comprises selecting the second configuration data to further match the first annotated data. 7. The method of claim 2 wherein processing a second plurality of user inputs to the system further comprises: applying an automated speech recognition procedure to data representing speech inputs of the users to determine a text-based representations of the inputs; producing linguistic representations of the inputs from the text-based representations of the inputs; and providing the linguistic representations to the first linguistic classifier, which provides corresponding recognized linguistic categories as outputs. 8. The method of claim 2 wherein the linguistic classifier comprises an artificial neural network (ANN), and the first configuration data and the second configuration data comprise values of weights of the ANN. 9. The method of claim 8 wherein configuring the linguistic classifier with data comprises storing values of the weights of the ANN in a storage for access during execution of software instructions implementing the ANN. 10. The method of claim 8 wherein determining the second configuration data for the linguistic classifier comprises applying an incremental updating procedure of the weights of the ANN. 11. The method of claim 10 wherein applying the incremental updating procedure comprises applying a Back Propagation procedure. 12. The method of claim 10 wherein applying the incremental updating procedure includes treating correct linguistic categories as latent variables that are not represented in the second annotated data. 13. The method of claim 2 wherein the representations of the first plurality of user inputs comprise text-based representations of said inputs. 14. The method of claim 2 wherein the representations of the actions comprise text-based representations of said actions. 15. A natural language (NL) understanding system comprising: a speech recognizer configured to process a speech input to produce an output; a linguistic feature extractor configured to process the output of the speech recognizer to produce a linguistic representation of the speech input, the linguistic representation comprising a fixed-length numerical vector; a linguistic classifier, the linguistic classifier being configurable with configuration data to distinguish a plurality of linguistic categories based on a linguistic input; a classifier initializer for determining first configuration data for the first linguist classifier; a classifier updater for determining second configuration data for the first linguistic classifier based on a processing of second collected data including a second plurality of user inputs to the system and corresponding actions determined by the system for those inputs, and annotations for each item of the second collected data indicating whether the action matches the user input, the classifier updater being configured to determining second configuration data for the first linguistic classifier to distinguish the plurality of linguistic categories by computing

Assignees

Inventors

Classifications

  • Recurrent networks, e.g. Hopfield networks · CPC title

  • Probabilistic graphical models, e.g. probabilistic networks · CPC title

  • Training · CPC title

  • Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title

  • Feature extraction for speech recognition; Selection of recognition unit · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9911413B1 cover?
A linguist classifier, for instance intent or slot classifier, is updated using data with only partial annotation indicating overall correctness rather that specific correct intent or slot values, which are treated as “latent” (i.e., unknown) variables. Full annotation of the data is not required. A small amount of fully annotated data may be combined with a substantially larger amount of parti…
Who is the assignee on this patent?
Amazon Tech Inc
What technology area does this patent fall under?
Primary CPC classification G10L15/1815. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Mar 06 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).