Automatic post-editing model for neural machine translation
US-11295092-B2 · Apr 5, 2022 · US
US11508360B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-11508360-B2 |
| Application number | US-202017021892-A |
| Country | US |
| Kind code | B2 |
| Filing date | Sep 15, 2020 |
| Priority date | Sep 15, 2020 |
| Publication date | Nov 22, 2022 |
| Grant date | Nov 22, 2022 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
This document relates to machine learning. One example includes a method or technique that can be performed on a computing device. The method or technique can include obtaining a task-adapted generative model that has been tuned using one or more task-specific seed examples. The method or technique can also include inputting dialog acts into the task-adapted generative model and obtaining synthetic utterances that are output by the task-adapted generative model. The method or technique can also include populating a synthetic training corpus with synthetic training examples that include the synthetic utterances. The synthetic training corpus may be suitable for training a natural language understanding model.
Opening claim text (preview).
The invention claimed is: 1. A method comprising: obtaining a task-adapted generative model that has been tuned using one or more task-specific seed examples; inputting dialog acts into the task-adapted generative model; obtaining synthetic utterances that are output by the task-adapted generative model; and populating a synthetic training corpus with synthetic training examples that include the synthetic utterances, the synthetic training corpus suitable for training a natural language understanding model. 2. The method of claim 1 , wherein each of the synthetic training examples comprise a particular synthetic utterance and a particular dialog act that was input to the task-adapted generative model to generate the particular synthetic utterance. 3. The method of claim 2 , wherein the dialog acts comprising intent values and slot values. 4. The method of claim 3 , wherein obtaining the synthetic utterances comprises sampling tokens from an output distribution of the task-adapted generative model. 5. The method of claim 1 , further comprising: training the natural language understanding model using the synthetic training corpus. 6. The method of claim 5 , wherein the task-adapted generative model comprises one or more transformer decoders and the natural language understanding model comprises one or more transformer encoders. 7. The method of claim 1 , further comprising: receiving a request to train the natural language understanding model; receiving the task-specific seed examples for generating the natural language understanding model; determining whether additional task-specific examples are appropriate for training the natural language understanding model; and populating the synthetic training corpus in an instance when additional task-specific examples are determined to be appropriate for generating the natural language understanding model. 8. The method of claim 7 , further comprising: outputting an offer to generate the synthetic training corpus responsive to a determination that additional task-specific examples are appropriate for generating the natural language understanding model; and populating the synthetic training corpus responsive to acceptance of the offer. 9. A system comprising: a processor; and a storage medium storing instructions which, when executed by the processor, cause the processor to: using a task-adapted generative model tuned for a particular task, generate synthetic training examples for the particular task; and populate a synthetic training corpus with the synthetic training examples. 10. The system of claim 9 , wherein the instructions, when executed by the processor, cause the processor to: sample predicted next tokens from an output distribution of the task-adapted generative model to provide a diverse set of synthetic training examples. 11. The system of claim 10 , wherein the instructions, when executed by the processor, cause the processor to: receive input designating a requested diversity of the synthetic training examples; and sample the output distribution based at least on the requested diversity. 12. The system of claim 11 , wherein the instructions, when executed by the processor, cause the processor to: select a specified number of predicted next tokens from the output distribution based at least on the requested diversity. 13. The system of claim 11 , wherein the instructions, when executed by the processor, cause the processor to: select predicted next tokens having respective probabilities above a probability threshold from the output distribution, the probability threshold corresponding to the requested diversity. 14. The system of claim 11 , wherein the instructions, when executed by the processor, cause the processor to: identify a defined set of slot values for the synthetic training corpus; and filter out synthetic training examples produced by the task-adapted generative model that lack corresponding slot values from the defined set. 15. The system of claim 11 , wherein the instructions, when executed by the processor, cause the processor to: train a natural language understanding model using the synthetic training corpus. 16. A method comprising: obtaining a pretrained generative model that has been pretrained using a first training data set having unlabeled training examples; semantically conditioning the pretrained generative model based at least on a second training data set having dialog act-labeled utterances to obtain a semantically-conditioned generative model; tuning the semantically-conditioned generative model using a third training data set having task-specific seed examples to obtain a task-adapted generative model; and outputting the task-adapted generative model. 17. The method of claim 16 , wherein the semantically conditioning comprises: inputting individual dialog acts from the second training data set to the pretrained generative model and training the pretrained generative model to generate corresponding utterances that are labeled with the individual dialog acts. 18. The method of claim 17 , wherein the tuning comprises: inputting individual task-specific dialog acts from the third training data set to the semantically-conditioned generative model and training the semantically-conditioned generative model to generate corresponding task-specific utterances that are labeled with the individual task-specific dialog acts. 19. The method of claim 18 , wherein the semantically conditioning and the tuning comprise performing next token prediction. 20. The method of claim 16 , wherein the third training data set includes slot labels that are not present in the second training data set.
Training · CPC title
Parsing for meaning understanding · CPC title
Semantic analysis · CPC title
Natural language generation · CPC title
Lexical analysis, e.g. tokenisation or collocates · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.