Compressing image-to-image models with average smoothing

US11790565B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11790565-B2
Application numberUS-202117191970-A
CountryUS
Kind codeB2
Filing dateMar 4, 2021
Priority dateMar 4, 2021
Publication dateOct 17, 2023
Grant dateOct 17, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

System and methods for compressing image-to-image models. Generative Adversarial Networks (GANs) have achieved success in generating high-fidelity images. An image compression system and method adds a novel variant to class-dependent parameters (CLADE), referred to as CLADE-Avg, which recovers the image quality without introducing extra computational cost. An extra layer of average smoothing is performed between the parameter and normalization layers. Compared to CLADE, this image compression system and method smooths abrupt boundaries, and introduces more possible values for the scaling and shift. In addition, the kernel size for the average smoothing can be selected as a hyperparameter, such as a 3×3 kernel size. This method does not introduce extra multiplications but only addition, and thus does not introduce much computational overhead, as the division can be absorbed into the parameters after training.

First claim

Opening claim text (preview).

What is claimed is: 1. A method of operating a generative adversarial network (GAN), comprising: receiving an image having learned parameters; using an input class of the image to determine scaling and shifting parameters in a normalization layer; and compressing the image using the determined scaling and shifting parameters by performing average smoothing between parameter layers and normalization layers to smooth abrupt boundaries where semantic information changes. 2. The method as specified in claim 1 wherein the learned parameters include spatial dependency. 3. The method as specified in claim 1 wherein the average smoothing generates a plurality of values for the scaling and shifting parameters. 4. The method as specified in claim 1 further comprising using an inception-based residual block containing a kernel. 5. The method as specified in claim 4 wherein the kernel has a kernel size selected from different kernel sizes. 6. The method as specified in claim 4 wherein the inception-based block incorporates depth-wise convolutional layers. 7. The method as specified in claim 1 , wherein the GAN is stored on a mobile computing device. 8. The method as specified in claim 1 , wherein the GAN is a pre-trained GAN. 9. A system comprising: a processor; and a memory storing computer readable instructions that, when executed by the processor, configure the system to perform operations comprising: receiving an image having learned parameters; using an input class of the image to determine scaling and shifting parameters in a normalization layer; and compressing the image using the determined scaling and shifting parameters by performing average smoothing between parameter layers and normalization layers to smooth abrupt boundaries where semantic information changes. 10. The system as specified in claim 9 wherein the learned parameters include spatial dependency. 11. The system as specified in claim 9 wherein the average smoothing generates a plurality of values for the scaling and shifting parameters. 12. The system as specified in claim 9 further comprising using an inception-based residual block containing a kernel. 13. The system as specified in claim 12 wherein the kernel has a kernel size selected from different kernel sizes. 14. The system as specified in claim 12 wherein the inception-based block incorporates depth-wise convolutional layers. 15. The system as specified in claim 9 , wherein a generative adversial network (“GAN”) is stored on a mobile computing device. 16. The system as specified in claim 15 , wherein the GAN is a pre-trained GAN. 17. A non-transitory computer-readable storage medium, the computer-readable storage medium including instructions that when executed by a computer, cause the computer to perform operations comprising: receiving an image having learned parameters; using an input class of the image to determine scaling and shifting parameters in a normalization layer; and compressing the image using the determined scaling and shifting parameters by performing average smoothing between parameter layers and normalization layers to smooth abrupt boundaries where semantic information changes. 18. The computer-readable storage medium of claim 17 , wherein the learned parameters include spatial dependency. 19. The computer-readable storage medium of claim 17 , wherein the average smoothing generates a plurality of values for the scaling and shifting parameters. 20. The computer-readable storage medium of claim 17 , further comprising instructions to use an inception-based residual block containing a kernel.

Assignees

Inventors

Classifications

  • Adversarial learning · CPC title

  • Transfer learning · CPC title

  • Hyperparameter optimisation; Meta-learning; Learning-to-learn · CPC title

  • modifying the architecture, e.g. adding, deleting or silencing nodes or connections · CPC title

  • Convolutional networks [CNN, ConvNet] · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11790565B2 cover?
System and methods for compressing image-to-image models. Generative Adversarial Networks (GANs) have achieved success in generating high-fidelity images. An image compression system and method adds a novel variant to class-dependent parameters (CLADE), referred to as CLADE-Avg, which recovers the image quality without introducing extra computational cost. An extra layer of average smoothing is…
Who is the assignee on this patent?
Ren Jian, Chai Menglei, Tulyakov Sergey, and 2 more
What technology area does this patent fall under?
Primary CPC classification G06N3/0495. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Oct 17 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).