Generating user-customized items using a visually-aware image generation network

US10970765B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10970765-B2
Application numberUS-201815897856-A
CountryUS
Kind codeB2
Filing dateFeb 15, 2018
Priority dateFeb 15, 2018
Publication dateApr 6, 2021
Grant dateApr 6, 2021

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The present disclosure relates to a personalized fashion generation system that synthesizes user-customized images using deep learning techniques based on visually-aware user preferences. In particular, the personalized fashion generation system employs an image generative adversarial neural network and a personalized preference network to synthesize new fashion items that are individually customized for a user. Additionally, the personalized fashion generation system can modify existing fashion items to tailor the fashion items to a user's tastes and preferences.

First claim

Opening claim text (preview).

What is claimed is: 1. A system for synthesizing user-customized images based on latent user preferences comprising: a memory comprising: an image generative adversarial network trained to generate realistic images of items; and a preference predictor network trained to determine visual preferences of individual users; at least one processor; and at least one non-transitory computer-readable storage medium storing instructions that, when executed by the at least one processor, cause the system to: generate a first realistic synthesized image utilizing the image generative adversarial network based on a latent code identified from a plurality of latent codes; determine a first preference prediction score for the first realistic synthesized image utilizing the preference predictor network based on visual latent user features of a user; generate a second realistic synthesized image utilizing the image generative adversarial network that is customized for the user based on a modified latent code modified according to the first preference prediction score, wherein the preference predictor network generates a second preference prediction score for the second realistic synthesized image that is greater than the first preference prediction score; and provide the second realistic synthesized image customized for the user to a client device associated with the user. 2. The system of claim 1 , wherein: the image generative adversarial network is trained utilizing a corpus of images of fashion items corresponding to fashion categories to identify latent representations of visual fashion characteristics; and the realistic synthesized second image customized for the user comprises a synthesized image of a new fashion item generated for the user. 3. The system of claim 2 , further comprising instructions that, when executed by the at least one processor, cause the system to identify the plurality of latent codes within random latent space of the image generative adversarial network. 4. The system of claim 2 , wherein the image generative adversarial network comprises: a generator neural network trained on the corpus of fashion images to generate synthesized images of fashion items; and a discriminator neural network trained on the corpus of fashion images to determine when generated synthesized images of new fashion items resemble real images of fashion items. 5. The system of claim 4 , wherein the image generative adversarial network trains in an unsupervised manner using the corpus of fashion images; the image generative adversarial network alternates training the generator neural network and the discriminator neural network using objective functions via back propagation and least squares loss; and the trained generator neural network generates synthesized images of fashion items following a same distribution of images from the corpus of fashion items. 6. The system of claim 1 , further comprising instructions that, when executed by the at least one processor, cause the system to determine the modified latent code from the plurality of latent codes based on utilizing a hyper-parameter that controls a trade-off between preference prediction scores and image quality. 7. The system of claim 1 , further comprising utilizing the preference predictor network to determine the first preference prediction score for the first realistic synthesized image based on correlating visual latent user features of the user with visual latent item features generated from the first realistic synthesized image by the image generative adversarial network. 8. The system of claim 7 , further comprising instructions that, when executed by the at least one processor, cause the system to determine the modified latent code based on iteratively searching for a low-dimensional latent code within the plurality of latent codes that maximizes the preference prediction score for the user at the preference predictor network. 9. The system of claim 7 , further comprising instructions that, when executed by the at least one processor, cause the system to determine the modified latent code by utilizing gradient ascent within a constrained space to identify an optimized latent code that yields a higher preference prediction score from the preference predictor network than the latent code. 10. The system of claim 9 , further comprising instructions that, when executed by the at least one processor, cause the system to repeat identifying the modified latent code by: randomizing initial positions of latent codes within random latent space of the image generative adversarial network for a predetermined number of iterations; based on the visual latent user features of the user, identify one or more optimized latent codes within the random latent space that yield higher preference prediction score for the user than the first preference prediction score; and selecting the modified latent code from the one or more optimized latent codes based on the higher preference prediction scores for the user. 11. The system of claim 1 , wherein the plurality of latent codes comprises latent codes combined with random noise vectors within random latent space of the image generative adversarial network. 12. The system of claim 1 , further comprising instructions that, when executed by the at least one processor, cause the system to: generate a plurality of realistic synthesized images customized for the user; and provide the plurality of realistic synthesized images customized for the user based on employing a probabilistic selection algorithm to increase diversity among the provided plurality of realistic synthesized images customized for the user. 13. A non-transitory computer-readable medium storing instructions that, when executed by at least one processor, cause a computer system to: generate a first realistic synthesized image at an image generative adversarial network utilizing a latent code identified from a plurality of latent codes; utilize a preference predictor network to determine a first preference prediction score for the first realistic synthesized image based on correlating visual latent user features of a user with visual latent item features generated from the first realistic synthesized image; modify the latent code at the image generative adversarial network based on the first preference prediction score; generate a second realistic synthesized image customized for the user at the image generative adversarial network utilizing the modified latent code, wherein the preference predictor network generates a second preference prediction score for the second realistic synthesized image that is greater than the first preference prediction score; and provide the second realistic synthesized image customized for the user to a client device associated with the user. 14. The non-transitory computer-readable of claim 13 , further comprising instructions that, when executed by at least one processor, cause a computer system to modify the latent code by iteratively searching for one or more latent codes within random latent space of the image generative adversarial network that maximize preference a predictor score for the user at the preference predictor network. 15. The non-transitory computer-readable of claim 13 , wherein the image generative adversarial network comprises: a generator neural network trained on a corpus of fashion images to generate synthesized images of fashion items; and a discriminator neural network trained on the corpus of fashion images to determine when generated synthesized images of new fashion items resemble real images

Assignees

Inventors

Classifications

  • Probabilistic or stochastic networks · CPC title

  • Combinations of networks · CPC title

  • Generative networks · CPC title

  • Supervised learning · CPC title

  • Convolutional networks [CNN, ConvNet] · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10970765B2 cover?
The present disclosure relates to a personalized fashion generation system that synthesizes user-customized images using deep learning techniques based on visually-aware user preferences. In particular, the personalized fashion generation system employs an image generative adversarial neural network and a personalized preference network to synthesize new fashion items that are individually cust…
Who is the assignee on this patent?
Adobe Inc, Univ California
What technology area does this patent fall under?
Primary CPC classification G06Q30/0621. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Apr 06 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 7 related publications on this page (citations in our corpus or others sharing the same primary CPC).