Iteratively applying neural networks to automatically identify pixels of salient objects portrayed in digital images

US11244195B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11244195-B2
Application numberUS-201815967928-A
CountryUS
Kind codeB2
Filing dateMay 1, 2018
Priority dateMay 1, 2018
Publication dateFeb 8, 2022
Grant dateFeb 8, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The present disclosure relates to systems, method, and computer readable media that iteratively apply a neural network to a digital image at a reduced resolution to automatically identify pixels of salient objects portrayed within the digital image. For example, the disclosed systems can generate a reduced-resolution digital image from an input digital image and apply a neural network to identify a region corresponding to a salient object. The disclosed systems can then iteratively apply the neural network to additional reduced-resolution digital images (based on the identified region) to generate one or more reduced-resolution segmentation maps that roughly indicate pixels of the salient object. In addition, the systems described herein can perform post-processing based on the reduced-resolution segmentation map(s) and the input digital image to accurately determine pixels that correspond to the salient object.

First claim

Opening claim text (preview).

What is claimed is: 1. A system comprising: at least one processor; and a non-transitory computer readable medium storing instructions thereon, that, when executed by at least one processor, cause the system to: generate a reduced-resolution digital image from an input digital image having an image resolution; apply a neural network to the reduced-resolution digital image to identify a region of the input digital image corresponding to a salient object portrayed within the input digital image; based on the identified region and the input digital image, generate a digital image of the region by processing the input digital image to isolate the identified region portraying the salient object; generate a reduced-resolution digital image of the region from the digital image of the region; apply the neural network to the reduced-resolution digital image of the region portraying the salient object to generate a reduced-resolution segmentation map of the salient object; and based on the reduced-resolution segmentation map and the input digital image, generate a segmentation map for selecting the salient object in the digital image, wherein the segmentation map has a resolution corresponding to the image resolution of the input digital image. 2. The system of claim 1 , further comprising instructions that, when executed by the at least one processor, cause the system to apply the neural network to the reduced-resolution digital image to identify a region of the input digital image corresponding to the salient object by: utilizing the neural network to generate an initial segmentation map for the reduced-resolution digital image, wherein the initial segmentation map comprises a mapping of pixels within the reduced-resolution digital image that correspond to the salient object; and identifying the region of the input digital image corresponding to the salient object based on the mapping of pixels from the initial segmentation map. 3. The system of claim 1 , further comprising instructions that, when executed by the at least one processor, cause the system to: analyze the reduced-resolution segmentation map of the salient object to identify a sub-region of the input digital image on which additional refinement is needed within the identified region of the input digital image, wherein the sub-region corresponds to a portion of the salient object portrayed within the input digital image; based on the identified sub-region of the input digital image, generate a reduced-resolution digital image of the sub-region; and apply the neural network to the reduced-resolution digital image of the sub-region to generate an additional reduced-resolution segmentation map of the portion of the salient object corresponding to the sub-region of the input digital image. 4. The system of claim 3 , wherein the reduced-resolution segmentation map of the salient object comprises confidence values corresponding to pixels within the region of the input digital image and further comprising instructions that, when executed by the at least one processor, cause the system to identify the sub-region based on the confidence values. 5. The system of claim 4 further comprising instructions that, when executed by the at least one processor, cause the system to identify the sub-region by determining that one or more confidence values corresponding to one or more pixels in the sub-region do not satisfy a threshold confidence value. 6. The system of claim 1 , further comprising instructions that, when executed by the at least one processor, cause the system to: identify an additional region of the input digital image corresponding to an additional salient object portrayed within the input digital image; apply the neural network to an additional reduced-resolution digital image of the additional region to generate an additional reduced-resolution segmentation map of the additional salient object; and generate the segmentation map based on the additional reduced-resolution segmentation map. 7. The system of claim 1 , further comprising instructions that, when executed by the at least one processor, cause the system to generate the segmentation map of the salient object having the resolution corresponding to the image resolution of the input digital image by applying at least one of a dense conditional random field (CRF) filter, a guided filter, or a graph cut filter to the reduced-resolution segmentation map of the salient object in view of the input digital image to identify pixels of the input digital image corresponding to the salient object. 8. The system of claim 1 , wherein a resolution of the reduced-resolution digital image corresponds to a resolution of the reduced-resolution segmentation map of the salient object. 9. The system of claim 1 , wherein a resolution of the reduced-resolution digital image corresponds to a resolution of the reduced-resolution digital image of the region. 10. A non-transitory computer readable medium storing instructions thereon that, when executed by at least one processor, cause a computer system to: apply a neural network to a reduced-resolution digital image of an input digital image to identify a region of the input digital image portraying a salient object; generate a digital image of the region by processing the input digital image to isolate the identified region portraying the salient object; generate a second reduced-resolution digital image of the region from the digital image of the region; generate a reduced-resolution segmentation map of the salient object by applying the neural network to the second reduced-resolution digital image of the region; based on the reduced-resolution segmentation map and the input digital image, generate a segmentation map of the salient object having a map resolution corresponding to an image resolution of the input digital image; and select pixels of the input digital image portraying the salient object based on the segmentation map. 11. The non-transitory computer readable medium of claim 10 , further comprising instructions that, when executed by the at least one processor, cause the computer system to: generate the reduced-resolution digital image of the input digital image by down-sampling the input digital image to a resolution; and generate the second reduced-resolution digital image of the region of the input digital image by down-sampling the digital image of the region to a resolution corresponding to the resolution of the reduced-resolution digital image of the input digital image. 12. The non-transitory computer readable medium of claim 10 , further comprising instructions that, when executed by the at least one processor, cause the computer system to apply the neural network to the reduced-resolution digital image of the input digital image to identify the region of the input digital image by: utilizing the neural network to generate an initial segmentation map for the reduced-resolution digital image, wherein the initial segmentation map comprises a mapping of pixels within the reduced-resolution digital image that correspond to the salient object; and identifying the region of the input digital image corresponding to the salient object based on pixels from the initial segmentation map. 13. The non-transitory computer readable medium of claim 10 , further comprising instructions that, when executed by the at least one processor, cause the computer system to generate the reduced-resolution segmentation map of the salient object by iteratively applying the neural network to one or more additional reduced-resolution digital images within the region until satisfying a threshold condition.

Assignees

Inventors

Classifications

  • G06T3/4053Primary

    based on super-resolution, i.e. the output image resolution being higher than the sensor resolution · CPC title

  • Organisation of the process, e.g. bagging or boosting · CPC title

  • using neural networks · CPC title

  • using classification, e.g. of video objects · CPC title

  • Salient features, e.g. scale invariant feature transforms [SIFT] · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11244195B2 cover?
The present disclosure relates to systems, method, and computer readable media that iteratively apply a neural network to a digital image at a reduced resolution to automatically identify pixels of salient objects portrayed within the digital image. For example, the disclosed systems can generate a reduced-resolution digital image from an input digital image and apply a neural network to identi…
Who is the assignee on this patent?
Adobe Inc
What technology area does this patent fall under?
Primary CPC classification G06T3/4053. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Feb 08 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).