Cascaded domain bridging for image generation

US12260485B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12260485-B2
Application numberUS-202218046077-A
CountryUS
Kind codeB2
Filing dateOct 12, 2022
Priority dateOct 12, 2022
Publication dateMar 25, 2025
Grant dateMar 25, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method of generating a style image is described. The method includes receiving an input image of a subject. The method further includes encoding the input image using a first encoder of a generative adversarial network (GAN) to obtain a first latent code. The method further includes decoding the first latent code using a first decoder of the GAN to obtain a normalized style image of the subject, wherein the GAN is trained using a loss function according to semantic regions of the input image and the normalized style image.

First claim

Opening claim text (preview).

What is claimed is: 1. A method of generating a style image, the method comprising: receiving an input image of a subject; encoding the input image using a first encoder of a generative adversarial network (GAN) to obtain a first latent code; decoding the first latent code using a first decoder of the GAN to obtain a normalized style image of the subject, wherein: the GAN is trained using a loss function according to semantic regions of the input image and the normalized style image, and a distribution prior of a W+ space is modeled for training the GAN by inverting a dataset of real face images using a second encoder that is pre-trained. 2. The method of claim 1 , further comprising training the GAN by inverting the dataset of real face images to obtain a plurality of latent codes. 3. The method of claim 1 , the second encoder is different from the first decoder. 4. The method of claim 1 , wherein the second encoder is a pre-trained StyleGAN encoder. 5. The method of claim 1 , wherein training the GAN further comprises performing a W+ space transfer learning from the second encoder to the first encoder. 6. The method of claim 5 , wherein performing the W+ space transfer learning comprises using a normalized exemplar set with only neutral expressions of the subject. 7. The method of claim 5 , wherein performing the W+ space transfer learning comprises using a normalized exemplar set with only neutral poses of the subject. 8. The method of claim 5 , wherein performing the W+ space transfer learning comprises using a normalized exemplar set with only neutral lighting of the subject. 9. The method of claim 1 , further comprising training the GAN using a difference between a first face segmentation model trained using real face images and a second face segmentation model using style exemplars as the loss function. 10. The method of claim 9 , wherein the semantic regions include one or more of hair regions of the subject or skin regions of the subject. 11. A system for generating a style image, the system comprising: a processor; and memory storing instructions that, when executed by the processor, cause the system to perform a set of operations, the set of operations comprising: receiving an input image of a subject; encoding the input image using a first encoder of a generative adversarial network (GAN) to obtain a first latent code; decoding the first latent code using a first decoder of the GAN to obtain a normalized style image of the subject, wherein: the GAN is trained using a loss function according to semantic regions of the input image and the normalized style image, and a distribution prior of a W+ space is modeled for training the GAN by inverting a dataset of real face images using a second encoder that is pre-trained. 12. The method of claim 11 , wherein the set of operations further comprise training the GAN by inverting the dataset of real face images to obtain a plurality of latent codes. 13. The method of claim 11 , wherein the second encoder is different from the first decoder. 14. The method of claim 11 , wherein the second encoder is a pre-trained StyleGAN encoder. 15. The method of claim 11 , wherein the set of operations further comprise performing a W+ space transfer learning from the second encoder to the first encoder. 16. The method of claim 15 , wherein the set of operations further comprise using a normalized exemplar set with only neutral expressions of the subject. 17. The method of claim 15 , wherein the set of operations further comprise using a normalized exemplar set with only neutral poses of the subject. 18. The method of claim 15 , wherein the set of operations further comprise using a normalized exemplar set with only neutral lighting of the subject. 19. The method of claim 11 , wherein the set of operations further comprise training the GAN using a difference between a first face segmentation model trained using real face images and a second face segmentation model using style exemplars as the loss function. 20. The method of claim 19 , wherein the semantic regions include one or more of hair regions of the subject or skin regions of the subject.

Assignees

Inventors

Classifications

  • Texturing; Colouring; Generation of textures or colours (retouching, inpainting or scratch removal G06T5/77) · CPC title

  • Face · CPC title

  • Training; Learning · CPC title

  • Artificial neural networks [ANN] · CPC title

  • Region-based segmentation · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12260485B2 cover?
A method of generating a style image is described. The method includes receiving an input image of a subject. The method further includes encoding the input image using a first encoder of a generative adversarial network (GAN) to obtain a first latent code. The method further includes decoding the first latent code using a first decoder of the GAN to obtain a normalized style image of the subje…
Who is the assignee on this patent?
Lemon Inc
What technology area does this patent fall under?
Primary CPC classification G06T15/02. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Mar 25 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 6 related publications on this page (citations in our corpus or others sharing the same primary CPC).