Cross Data Set Knowledge Distillation for Training Machine Learning Models
US-2021264106-A1 · Aug 26, 2021 · US
US11769007B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-11769007-B2 |
| Application number | US-202117303349-A |
| Country | US |
| Kind code | B2 |
| Filing date | May 27, 2021 |
| Priority date | May 27, 2021 |
| Publication date | Sep 26, 2023 |
| Grant date | Sep 26, 2023 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
An approach for generating synthetic treebanks to be used in training a parser in a production system is provided. A processor receives a request to generate one or more synthetic treebanks from a production system, wherein the request indicates a language for the one or more synthetic treebanks. A processor retrieves at least one corpus of text in which the requested language is present. A processor provides the at least one corpus to a transformer enhanced parser neural network model. A processor generates at least one synthetic treebank associated with a string of text from the at least one corpus of text in which the requested language is present. A processor sends the at least one synthetic treebank to the production system, wherein the production system trains a parser utilized by the production system with the at least one synthetic treebank.
Opening claim text (preview).
What is claimed is: 1. A computer-implemented method for generating synthetic treebanks to be used in training a parser in a production system, the method comprising: receiving, by one or more processors, a request to generate one or more synthetic treebanks from a production system, wherein the request indicates a language for the one or more synthetic treebanks; retrieving, by the one or more processors, at least one corpus of text in which the requested language is present; providing, by the one or more processors, the at least one corpus to a transformer enhanced parser neural network model; generating, by the one or more processors, at least one synthetic treebank associated with a string of text from the at least one corpus of text in which the requested language is present, wherein the at least one synthetic treebank is generated with unsupervised training of the transformer enhanced parser neural network model; and sending, by the one or more processors, the at least one synthetic treebank to the production system, wherein the production system trains a parser utilized by the production system with the at least one synthetic treebank. 2. The computer-implemented method of claim 1 , wherein the at least one corpus of text includes a corpus directed towards a limited language or domain. 3. The computer-implemented method of claim 2 , wherein the transformer enhanced parser neural network model includes one of the following pretrained transformer models: a bidirectional encoder representations for transformers (BERT) model or a cross-lingual language model (XLM). 4. The computer-implemented method of claim 1 , the transformer enhanced parser neural network model includes a neural-network parser. 5. The computer-implemented method of claim 4 , wherein the parser utilized by the production system is of lower quality than the neural-network parser. 6. The computer-implemented method of claim 1 , wherein the transformer enhanced parser neural network model separates one or more words of the at least one corpus of text into subwords. 7. A computer program product for generating synthetic treebanks to be used in training of a parser in a production system, the computer program product comprising: one or more computer-readable storage media and program instructions stored on the one or more computer-readable storage media, the program instructions comprising: program instructions to receive a request to generate one or more synthetic treebanks from a production system, wherein the request indicates a language for the one or more synthetic treebanks; program instructions to retrieve at least one corpus of text in which the requested language is present; program instructions to provide the at least one corpus to a transformer enhanced parser neural network model; program instructions to generate at least one synthetic treebank associated with a string of text from the at least one corpus of text in which the requested language is present, wherein the at least one synthetic treebank is generated with unsupervised training of the transformer enhanced parser neural network model; and program instructions to send the at least one synthetic treebank to the production system, wherein the production system trains a parser utilized by the production system with the at least one synthetic treebank. 8. The computer program product of claim 7 , wherein the at least one corpus of text includes a corpus directed towards a limited language or domain. 9. The computer program product of claim 8 , wherein the transformer enhanced parser neural network model includes one of the following pretrained transformer models: a bidirectional encoder representations for transformers (BERT) model or a cross-lingual language model (XLM). 10. The computer program product of claim 7 , the transformer enhanced parser neural network model includes a neural-network parser. 11. The computer program product of claim 10 , wherein the parser utilized by the production system is of lower quality than the neural-network parser. 12. The computer program product of claim 7 , wherein the transformer enhanced parser neural network model separates one or more words of the at least one corpus of text into subwords. 13. A computer system for generating synthetic treebanks to be used in training of a parser in a production system, the computer system comprising: one or more computer processors; one or more computer readable storage media; and program instructions stored on the computer readable storage media for execution by at least one of the one or more processors, the program instructions comprising: program instructions to receive a request to generate one or more synthetic treebanks from a production system, wherein the request indicates a language for the one or more synthetic treebanks; program instructions to retrieve at least one corpus of text in which the requested language is present; program instructions to provide the at least one corpus to a transformer enhanced parser neural network model; program instructions to generate at least one synthetic treebank associated with a string of text from the at least one corpus of text in which the requested language is present, wherein the at least one synthetic treebank is generated with unsupervised training of the transformer enhanced parser neural network model; and program instructions to send the at least one synthetic treebank to the production system, wherein the production system trains a parser utilized by the production system with the at least one synthetic treebank. 14. The computer system of claim 13 , wherein the at least one corpus of text includes a corpus directed towards a limited language or domain. 15. The computer system of claim 14 , wherein the transformer enhanced parser neural network model includes one of the following pretrained transformer models: a bidirectional encoder representations for transformers (BERT) model or a cross-lingual language model (XLM). 16. The computer system of claim 13 , the transformer enhanced parser neural network model includes a neural-network parser. 17. The computer system of claim 16 , wherein the parser utilized by the production system is of lower quality than the neural-network parser. 18. The computer system of claim 13 , wherein the transformer enhanced parser neural network model separates one or more words of the at least one corpus of text into subwords.
Supervised learning · CPC title
Weakly supervised learning, e.g. semi-supervised or self-supervised learning · CPC title
Auto-encoder networks; Encoder-decoder networks · CPC title
Parsing · CPC title
Machine-assisted translation, e.g. using translation memory · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.