Method for training image generation model, method for generating images, and devices thereof

US12475689B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12475689-B2
Application numberUS-202218263854-A
CountryUS
Kind codeB2
Filing dateJan 28, 2022
Priority dateFeb 2, 2021
Publication dateNov 18, 2025
Grant dateNov 18, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Provided is a method for training an image generation model, including: acquiring a first transformation model by training; acquiring a reconstruction model by training based on the first transformation model; acquiring a second transformation model by training; generating a grafted transformation model by grafting the first transformation model with the second transformation model; and generating the image generation model based on the reconstruction model and the grafted transformation model. The first transformation model is configured to generate a first training image according to a first noise sample. The first training image is an image of a first style. The reconstruction model is configured to map an original image sample to a latent variable corresponding to the original image sample. The second transformation model is configured to generate a second training image according to a second noise sample. The second training image is an image of a second style.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method for training an image generation model, comprising: acquiring a first transformation model by training, wherein the first transformation model is configured to generate a first training image according to a first noise sample, and the first training image is an image of a first style; acquiring a reconstruction model by training based on the first transformation model, wherein the reconstruction model is configured to map an original image sample to a latent variable corresponding to the original image sample; acquiring a second transformation model by training, wherein the second transformation model is configured to generate a second training image according to a second noise sample, and the second training image is an image of a second style; generating a grafted transformation model by grafting the first transformation model with the second transformation model; and generating the image generation model based on the reconstruction model and the grafted transformation model, wherein the image generation model is configured to transform an image to be transformed of the first style into a target image of the second style. 2 . The method according to claim 1 , wherein the first transformation model comprises a first mapping network and a first synthesis network; and acquiring the first transformation model by training comprises: acquiring a first training sample set, wherein the first training sample set comprises a plurality of first noise samples; acquiring latent variables respectively corresponding to the plurality of first noise samples by inputting the plurality of first noise samples into the first mapping network; acquiring first training images respectively corresponding to the plurality of first noise samples by inputting the latent variables respectively corresponding to the plurality of first noise samples into the first synthesis network; and adjusting a weight parameter of the first transformation model based on the first training images respectively corresponding to the plurality of first noise samples. 3 . The method according to claim 2 , wherein the first transformation model comprises a first discrimination network; and adjusting the weight parameter of the first transformation model based on the first training images respectively corresponding to the plurality of first noise samples comprises: acquiring first discrimination losses respectively corresponding to the plurality of first noise samples by inputting a plurality of first training images respectively corresponding to the plurality of first noise samples into the first discrimination network; and adjusting the weight parameter of the first transformation model based on the first discrimination losses respectively corresponding to the plurality of first noise samples. 4 . The method according to claim 1 , wherein acquiring the reconstruction model based on the first transformation model comprises: acquiring a second training sample set, wherein the second training sample set comprises a plurality of original image samples; generating latent variables respectively corresponding to the plurality of original image samples by inputting the plurality of original image samples into the reconstruction model; generating reconstructed images respectively corresponding to the plurality of original image samples by inputting the latent variables respectively corresponding to the plurality of original image samples into the first transformation model, wherein the plurality of original image samples and the reconstructed images respectively corresponding to the plurality of original image samples are images of the first style; determining losses of the reconstruction model respectively corresponding to the plurality of original image samples based on the plurality of original image samples and the reconstructed images respectively corresponding to the plurality of original image samples; and adjusting a weight parameter of the reconstruction model based on the losses of the reconstruction model respectively corresponding to the plurality of original image samples. 5 . The method according to claim 4 , wherein the first transformation model comprises a first discrimination network; and determining the losses of the reconstruction model respectively corresponding to the plurality of original image samples based on the plurality of original image samples and the reconstructed images respectively corresponding to the plurality of original image samples comprises: determining a first sub-loss based on an output result acquired by inputting each of the reconstructed images respectively corresponding to the plurality of original image samples into the first discrimination network, wherein the first sub-loss indicates a first characterization of the reconstructed image; determining a second sub-loss based on an output result acquired by inputting each of the plurality of original image samples and each of the reconstructed images respectively corresponding to the plurality of original image samples into a perceptual network, wherein the second sub-loss indicates a first degree of conformity of the original image sample to the reconstructed image corresponding to the original image sample with respect to a target feature; determining a third sub-loss based on an output result acquired by inputting each of the plurality of original image samples and each of the reconstructed images respectively corresponding to the plurality of original image samples into a regression function, wherein the third sub-loss indicates a second degree of conformity of the original image sample to the reconstructed image corresponding to the original image sample with respect to the target feature; and determining the losses of the reconstruction model based on the first sub-loss, the second sub-loss, and the third sub-loss. 6 . The method according to claim 1 , wherein during a training process, an initial weight parameter of the second transformation model is a weight parameter of the first transformation model. 7 . The method according to claim 1 , wherein generating the grafted transformation model by grafting the first transformation model with the second transformation model comprises: generating the grafted transformation model by grafting n layers of weight network in a plurality of layers of weight network of the first transformation model with m layers of weight network in a plurality of layers of weight network of the second transformation model; wherein layers of the n layers of weight network and the m layers of weight network are different, n is a positive integer, and m is a positive integer; or generating the grafted transformation model by performing a summing or averaging or difference operation on weight parameters of the plurality of layers of weight network of the first transformation model and corresponding weight parameters of the plurality of layers of weight network of the second transformation model. 8 . The method according to claim 1 , wherein generating the image generation model based on the reconstruction model and the grafted transformation model comprises: acquiring a combined transformation model by combining the reconstruction model and the grafted transformation model; acquiring a fourth training sample set, wherein the fourth training sample set comprises at least one original image sample and an image of the second style corresponding to the at least one original image sample; and generating the image generation model by fine-tuning the combined transformation model using the fourth training sample set. 9 . A computer device for training an image generation model, comprising: a pro

Assignees

Inventors

Classifications

  • G06T11/10Primary

    Texturing; Colouring; Generation of textures or colours (retouching, inpainting or scratch removal G06T5/77) · CPC title

  • Creating or editing images; Combining images with text · CPC title

  • Adversarial learning · CPC title

  • Generative networks · CPC title

  • Auto-encoder networks; Encoder-decoder networks · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12475689B2 cover?
Provided is a method for training an image generation model, including: acquiring a first transformation model by training; acquiring a reconstruction model by training based on the first transformation model; acquiring a second transformation model by training; generating a grafted transformation model by grafting the first transformation model with the second transformation model; and generat…
Who is the assignee on this patent?
Bigo Tech Pte Ltd
What technology area does this patent fall under?
Primary CPC classification G06T11/10. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Nov 18 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).