Meta cooperative training paradigms

US12293284B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12293284-B2
Application numberUS-202017136054-A
CountryUS
Kind codeB2
Filing dateDec 29, 2020
Priority dateFeb 5, 2020
Publication dateMay 6, 2025
Grant dateMay 6, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Generative adversarial models have several benefits; however, due to mode collapse, these generators face a quality-diversity trade-off (i.e., the generator models sacrifice generation diversity for increased generation quality). Presented herein are embodiments that improve the performance of adversarial content generation by decelerating mode collapse. In one or more embodiments, a cooperative training paradigm is employed where a second model is cooperatively trained with the generator and helps efficiently shape the data distribution of the generator against mode collapse. Moreover, embodiments of a meta learning mechanism may be used, where the cooperative update to the generator serves as a high-level meta task and which helps ensures the generator parameters after the adversarial update stay resistant against mode collapse. In experiments, tested employments demonstrated efficient slowdown of mode collapse for the adversarial text generators. Overall, embodiments outperformed the baseline approaches with significant margins in terms of both generation quality and diversity.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method for training a generator comprising: responsive to a stop condition having not been reached, performing steps comprising: sampling a set of data from a training data; using a generator model, which comprises a set of generator parameter values, to generate a set of generated data; computing an adversarial loss for the generator model using an adversarial training loss function; determining a set of intermediate generator parameter values for the generator model using the adversarial loss and gradient descent; using a set of data sampled from the training data as inputs: into a second neural network model, which comprises a second neural network model set of parameter values, to obtain one or more output distributions from the second neural network model; and into the generator model comprising the set of intermediate generator parameter values to obtain one or more output distributions from the generator model; determining a meta gradient for a cooperate training loss that comprises comparing one or more output distributions from the second neural network model with one or more corresponding output distributions from the generator model; updating a set of generator parameter values using an adversarial gradient, which is obtained using the adversarial loss for the generator model, and the meta gradient; updating a set of discriminator parameter values for a discriminator model using an adversarial loss for the discriminator model; and updating the second neural network model set of parameter values of the second neural network model using a cooperative training loss for the second neural network model; and responsive to the stop condition having been reached, outputting the generator model, which comprises a final updated set of generator parameter values. 2. The computer-implemented method of claim 1 further comprising as initial steps: initializing at least the set of generator parameter values of the generator model and the set of discriminator parameter values of the discriminator model; and pretraining the generator model using training data, the generator model, and the discriminator model. 3. The computer-implemented method of claim 2 wherein the second neural network model and the generator model share a same neural network structure and the method further comprises: using at least some of the set of generator parameter values from the pretrained generator model as parameter values for the second neural network model. 4. The computer-implemented method of claim 1 wherein the step of updating the second neural network model set of parameter values of the second neural network model using a cooperative training loss comprises: using a maximum likelihood estimation (MLE) loss function. 5. The computer-implemented method of claim 4 wherein the step of updating the second neural network model set of parameter values of the second neural network model using a cooperative training loss comprises: minimizing Kullback-Leibler divergence between: one or more outputs from the second neural network model using a set of data sampled from the training data; and one or more outputs from the second neural network model using a mixture of data sampled from the training data and data sampled from data that were generated by the generator model. 6. The computer-implemented method of claim 5 wherein the mixture comprises an equal number or approximately equal number of data from the training data and data points which were generated by the generator model. 7. The computer-implemented method of claim 1 wherein the adversarial loss for the discriminator model and the adversarial loss for the generator model are obtained by using a min-max loss function. 8. A system comprising: one or more processors; and a non-transitory computer-readable medium or media comprising one or more sets of instructions which, when executed by at least one of the one or more processors, causes steps to be performed comprising: responsive to a stop condition having not been reached, performing steps comprising: sampling a set of data from a training data having a first distribution; using a generator model, which comprises a set of generator parameter values, to generate a set of generated data; computing an adversarial loss for the generator model using an adversarial training loss function; determining a set of intermediate generator parameter values for the generator model using the adversarial loss and gradient descent; using a set of data sampled from the training data as inputs: into a second neural network model, which comprises a second neural network model set of parameter values, to obtain one or more output distributions from the second neural network model; and into the generator model comprising the set of intermediate generator parameter values to obtain one or more output distributions from the generator model; determining a meta gradient for a cooperate training loss that comprises comparing one or more output distributions from the second neural network model with one or more corresponding output distributions from the generator model; updating a set of generator parameter values using an adversarial gradient, which is obtained using the adversarial loss for the generator model, and the meta gradient; updating a set of discriminator parameter values for a discriminator model using an adversarial loss for the discriminator model; and updating the second neural network model set of parameter values of the second neural network model using a cooperative training loss for the second neural network model; and responsive to the stop condition having been reached, outputting the generator model, which comprises a final updated set of generator parameter values. 9. The system of claim 8 wherein the non-transitory computer-readable medium or media further comprises one or more sets of instructions which, when executed by at least one of the one or more processors, causes steps to be performed comprising: initializing at least the set of generator parameter values of the generator model and the set of discriminator parameter values of the discriminator model; and pretraining the generator model using training data, the generator model, and the discriminator model. 10. The system of claim 9 wherein the second neural network model and the generator model share a same neural network structure and the non-transitory computer-readable medium or media further comprises one or more sets of instructions which, when executed by at least one of the one or more processors, causes steps to be performed comprising: using at least some of the set of generator parameter values from the pretrained generator model as parameter values for the second neural network model. 11. The system of claim 8 wherein the step of updating the second neural network model set of parameter values of the second neural network model using a cooperative training loss comprises: using a maximum likelihood estimation (MLE) loss function. 12. The system of claim 11 wherein the step of updating the second neural network model set of parameter values of the second neural network model using a cooperative training loss comprises: minimizing Kullback-Leibler divergence between: one or more outputs from the second neural network model using a set of data sampled from the training data; and one or more outputs from the second neural network model using a mixture of data sampled from the training data and data sampled from data that were generated by the generator model. 13. The system of claim 12 wherei

Assignees

Inventors

Classifications

  • Generative networks · CPC title

  • Adversarial learning · CPC title

  • characterised by memory or gating, e.g. long short-term memory [LSTM] or gated recurrent units [GRU] · CPC title

  • Hyperparameter optimisation; Meta-learning; Learning-to-learn · CPC title

  • Supervised learning · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12293284B2 cover?
Generative adversarial models have several benefits; however, due to mode collapse, these generators face a quality-diversity trade-off (i.e., the generator models sacrifice generation diversity for increased generation quality). Presented herein are embodiments that improve the performance of adversarial content generation by decelerating mode collapse. In one or more embodiments, a cooperativ…
Who is the assignee on this patent?
Baidu Usa Llc
What technology area does this patent fall under?
Primary CPC classification G06N3/08. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue May 06 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).