Generating stylized-stroke images from source images utilizing style-transfer-neural networks with non-photorealistic-rendering

US10748324B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10748324-B2
Application numberUS-201816184289-A
CountryUS
Kind codeB2
Filing dateNov 8, 2018
Priority dateNov 8, 2018
Publication dateAug 18, 2020
Grant dateAug 18, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

This disclosure relates to methods, non-transitory computer readable media, and systems that integrate (or embed) a non-photorealistic rendering (“NPR”) generator with a style-transfer-neural network to generate stylized images that both correspond to a source image and resemble a stroke style. By integrating an NPR generator with a style-transfer-neural network, the disclosed methods, non-transitory computer readable media, and systems can accurately capture a stroke style resembling one or both of stylized edges or stylized shadings. When training such a style-transfer-neural network, the integrated NPR generator can enable the disclosed methods, non-transitory computer readable media, and systems to use real-stroke drawings (instead of conventional paired-ground-truth drawings) for training the network to accurately portray a stroke style. In some implementations, the disclosed methods, non-transitory computer readable media, and systems can either train or apply a style-transfer-neural network that captures a variety of stroke styles, such as different edge-stroke styles or shading-stroke styles.

First claim

Opening claim text (preview).

We claim: 1. A non-transitory computer readable medium storing instructions thereon that, when executed by at least one processor, cause a computer system to: generate, utilizing a non-photorealistic-rendering (“NPR”) generator corresponding to a style-transfer-neural network, a simplified image of a source image, the simplified image depicting NPR features extracted from the source image and the source image exhibiting a stroke style; extract, utilizing an encoder of the style-transfer-neural network, a feature map from the simplified image depicting NPR features extracted from the source image; and generate, utilizing a decoder of the style-transfer-neural network, a stylized-stroke image corresponding to the source image and exhibiting the stroke style by decoding the feature map. 2. The non-transitory computer readable medium of claim 1 , further comprising instructions that, when executed by the at least one processor, cause the computer system to: receive an indication of a user selection of a stroke-style setting corresponding to the stroke style from among a first stroke-style setting and a second stroke-style setting; and based on receiving the indication of the user selection of the stroke-style setting, generate the stylized-stroke image exhibiting the stroke style. 3. The non-transitory computer readable medium of claim 1 , wherein the stoke style comprises one of pencil, woodcut, ink, crayon, or charcoal. 4. The non-transitory computer readable medium of claim 1 , wherein the instructions, when executed by the at least one processor, cause the computer system to: receive an indication of a user selection of a shading-style setting corresponding to a shading-stroke style from among a first shading-style setting and a second shading-style setting; and generate the stylized-stroke image exhibiting the stroke style by generating a stylized-shading image comprising shading exhibiting the shading-stroke style based on receiving the indication of the user selection of the shading-style setting. 5. The non-transitory computer readable medium of claim 4 , wherein the shading-stroke style comprise one of lines, crossed, smudge, or stippling. 6. The non-transitory computer readable medium of claim 1 , wherein: the style-transfer-neural network comprises an edge-style-transfer-neural network trained to generate stylized-edge images exhibiting an edge-stroke style; the decoder of the style-transfer-neural network comprises a decoder of the edge-style-transfer-neural network; the stroke style comprises the edge-stroke style; and the instructions, when executed by the at least one processor, cause the computer system to generate the stylized-stroke image by generating, utilizing the decoder of the edge-style-transfer-neural network, a stylized-edge image corresponding to the source image, the stylized-edge image comprising edges exhibiting the edge-stroke style. 7. The non-transitory computer readable medium of claim 1 , wherein: the style-transfer-neural network comprises a shading-style-transfer-neural network trained to generate stylized-shading images exhibiting a shading-stroke style; the decoder of the style-transfer-neural network comprises a decoder of the shading-style-transfer-neural network; the stroke style comprises the shading-stroke style; and the instructions, when executed by the at least one processor, cause the computer system to generate the stylized-stroke image by generating, utilizing the decoder of the shading-style-transfer-neural network, a stylized-shading image corresponding to the source image, the stylized-shading image comprising shading exhibiting the shading-stroke style. 8. The non-transitory computer readable medium of claim 7 , further comprising instructions that, when executed by the at least one processor, cause the computer system to: generate, utilizing an additional NPR generator, an additional simplified image of the source image; based on the additional simplified image, generate, utilizing an edge-style-transfer-neural network, a stylized-edge image corresponding to the source image and comprising edges exhibiting an edge-stroke style; generate, utilizing a style-fusion-neural network, a fusion map for synthesizing stroke styles from the stylized-edge image and the stylized-shading image; and based on the fusion map, generate a style-fusion image comprising the edges exhibiting the edge-stroke style and the shading exhibiting the shading-stroke style. 9. The non-transitory computer readable medium of claim 1 , further comprising instructions that, when executed by the at least one processor, cause the computer system to generate the simplified image depicting NPR features extracted from the source image by: generating the simplified image comprising edge depictions from the source image utilizing an extended difference-of-gaussians operator; or generating the simplified image comprising a contrast abstraction from the source image utilizing an objective-abstraction function. 10. The non-transitory computer readable medium of claim 1 , wherein the source image comprises a natural photograph. 11. The non-transitory computer readable medium of claim 1 , further comprising instructions that, when executed by the at least one processor, cause the computer system to train the style-transfer-neural network by: generating, utilizing the NPR generator, a simplified-training image of a stroke-training image exhibiting the stroke style; extracting, utilizing the encoder of the style-transfer-neural network, a training-feature map from the simplified-training image; based on the training-feature map, generating, utilizing the decoder of the style-transfer-neural network, a stylized-stroke-sample image exhibiting the stroke style; determining an adversarial loss using an adversarial-loss function based on a discriminator-neural network comparing the stylized-stroke-sample image and a real-stroke drawing; determining a reconstruction loss from a reconstruction-loss function based on a comparison of the stylized-stroke-sample image and the stroke-training image; and adjusting network parameters of the style-transfer-neural network based on the determined adversarial loss and the determined reconstruction loss. 12. The non-transitory computer readable medium of claim 11 , wherein: the style-transfer-neural network comprises an edge-style-transfer-neural network, the decoder of the style-transfer-neural network comprises a decoder of the edge-style-transfer-neural network, the stroke style comprises an edge-stroke style, and the stroke-training image comprises an edge-stroke-training image; and the instructions, when executed by the at least one processor, cause the computer system to: generate the simplified-training image of the stroke-training image by generating, utilizing the NPR generator, the simplified-training image comprising edge depictions from the edge-stroke-training image; and generate the stylized-stroke-sample image by generating, utilizing the decoder of the edge-style-transfer-neural network, a stylized-edge-sample image exhibiting the edge-stroke style. 13. The non-transitory computer readable medium of claim 11 , wherein: the style-transfer-neural network comprises a shading-style-transfer-neural network, the decoder of the style-transfer-neural network comprises a decoder of the shading-style-transfer-neural network, the stroke style comprises a shading-stroke style, and the stroke-training image comprises a shading-stroke-training image; and the instructions, when executed by the at least one processor, cause the computer system to: generate the simplified-training

Assignees

Inventors

Classifications

  • Recurrent networks, e.g. Hopfield networks · CPC title

  • Probabilistic or stochastic networks · CPC title

  • Combinations of networks · CPC title

  • Texturing; Colouring; Generation of textures or colours (retouching, inpainting or scratch removal G06T5/77) · CPC title

  • Generative networks · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10748324B2 cover?
This disclosure relates to methods, non-transitory computer readable media, and systems that integrate (or embed) a non-photorealistic rendering (“NPR”) generator with a style-transfer-neural network to generate stylized images that both correspond to a source image and resemble a stroke style. By integrating an NPR generator with a style-transfer-neural network, the disclosed methods, non-tran…
Who is the assignee on this patent?
Adobe Inc
What technology area does this patent fall under?
Primary CPC classification G06T15/02. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Aug 18 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).