Learning copy space using regression and segmentation neural networks

US10970599B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10970599-B2
Application numberUS-201816191724-A
CountryUS
Kind codeB2
Filing dateNov 15, 2018
Priority dateNov 15, 2018
Publication dateApr 6, 2021
Grant dateApr 6, 2021

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Techniques are disclosed for characterizing and defining the location of a copy space in an image. A methodology implementing the techniques according to an embodiment includes applying a regression convolutional neural network (CNN) to an image. The regression CNN is configured to predict properties of the copy space such as size and type (natural or manufactured). The prediction is conditioned on a determination of the presence of the copy space in the image. The method further includes applying a segmentation CNN to the image. The segmentation CNN is configured to generate one or more pixel-level masks to define the location of copy spaces in the image, whether natural or manufactured, or to define the location of a background region of the image. The segmentation CNN may include a first stage comprising convolutional layers and a second stage comprising pairs of boundary refinement layers and bilinear up-sampling layers.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for learning copy space of an image, the method comprising: applying, by a processor-based system, a regression convolutional neural network (CNN) to an image, the regression CNN to make a prediction of properties of a copy space, the properties including size and type, the prediction conditioned on a determination by the regression CNN of a copy space presence in the image; applying, by the processor-based system, a segmentation CNN to the image, the segmentation CNN to generate one or more masks associated with locations of one or more of a manufactured copy space in the image, a natural copy space in the image, and a background region of the image; searching a database of images for an image that includes a copy space associated with desired properties, the searching based on results from the regression CNN and the segmentation CNN; and identifying at least one image that includes the copy space associated with the desired properties. 2. The method of claim 1 , wherein the segmentation CNN operates independently from the regression CNN. 3. The method of claim 1 , wherein the size is represented as a confidence score ranging from small to large, and the type is represented as a confidence score ranging from natural to manufactured. 4. The method of claim 1 , wherein the segmentation CNN includes a first stage comprising convolutional layers and a second stage comprising pairs of boundary refinement layers and bilinear up-sampling layers. 5. The method of claim 1 , wherein the regression CNN and the segmentation CNN are trained on a first set of annotated images and a second set of annotated images, the first set of annotated images including copy spaces, the second set of annotated images not including copy spaces; and wherein at least one of the first and second sets of annotated images includes annotations indicating one or more of copy space presence, copy space size, copy space type, and pixel classifications to indicate one or more of background, natural, and manufactured classifications. 6. The method of claim 1 , wherein the image is a frame of a video. 7. A system for learning copy space of an image, the system comprising: one or more processors; a regression convolutional neural network (CNN) module at least one of controllable and executable by the one or more processors, and configured to operate on an image and make a prediction of properties of a copy space, the properties including size and type, the prediction conditioned on a determination by the regression CNN of a copy space presence in the image; and a segmentation CNN module at least one of controllable and executable by the one or more processors, and configured to operate on the image and generate one or more masks associated with locations of one or more of a manufactured copy space in the image, a natural copy space in the image, and a background region of the image; wherein the one or more processors are configured to search a database of images for an image that includes a copy space associated with desired properties, the one or more processors being configured to search based on results from the regression CNN and the segmentation CNN; and wherein the one or more processors are further configured to identify at least one image that includes the copy space associated with the desired properties. 8. The system of claim 7 , wherein the segmentation CNN operates independently from the regression CNN. 9. The system of claim 7 , wherein the size is represented as a confidence score ranging from small to large, and the type is represented as a confidence score ranging from natural to manufactured. 10. The system of claim 7 , wherein the segmentation CNN includes a first stage comprising convolutional layers and a second stage comprising pairs of boundary refinement layers and bilinear up-sampling layers. 11. The system of claim 7 , wherein the regression CNN and the segmentation CNN are trained on annotated images. 12. The system of claim 7 , wherein the image is a frame of a video. 13. The system of claim 7 , wherein: the regression CNN and the segmentation CNN are trained on a first set of annotated images and a second set of annotated images, the first set of annotated images including copy spaces, the second set of annotated images not including copy spaces; and at least one of the first and second sets of annotated images includes annotations indicating one or more of copy space presence, copy space size, copy space type, and pixel classifications to indicate one or more of background, natural, and manufactured classifications. 14. A non-transitory computer program product having instructions encoded thereon that when executed by one or more computer processors cause the one or more computer processors to perform a process comprising: applying a regression convolutional neural network (CNN) to an image, the regression CNN to make a prediction of properties of a copy space, the properties including size and type, the prediction conditioned on a determination by the regression CNN of a copy space presence in the image; and applying a segmentation CNN to the image, the segmentation CNN to generate one or more masks associated with locations of one or more of a manufactured copy space in the image, a natural copy space in the image, and a background region of the image; wherein the regression CNN and the segmentation CNN are trained on a first set of annotated images and a second set of annotated images, the first set of annotated images including copy spaces, the second set of annotated images not including copy spaces; and wherein at least one of the first and second sets of annotated images includes annotations indicating one or more of copy space presence, copy space size, copy space type, and pixel classifications to indicate one or more of background, natural, and manufactured classifications. 15. The non-transitory computer program product of claim 14 , wherein the segmentation CNN operates independently from the regression CNN. 16. The non-transitory computer program product of claim 14 , wherein the size is represented as a confidence score ranging from small to large, and the type is represented as a confidence score ranging from natural to manufactured. 17. The non-transitory computer program product of claim 14 , wherein the segmentation CNN includes a first stage comprising convolutional layers and a second stage comprising pairs of boundary refinement layers and bilinear up-sampling layers. 18. The non-transitory computer program product of claim 14 , the process further comprising: searching a database of images for an image that includes a copy space associated with desired properties, the searching based on results from the regression CNN and the segmentation CNN; and identifying at least one image that includes the copy space associated with the desired properties. 19. The non-transitory computer program product of claim 14 , wherein the image is a frame of a video.

Assignees

Inventors

Classifications

  • by performing operations on regions, e.g. growing, shrinking or watersheds · CPC title

  • using neural networks · CPC title

  • Determination of region of interest [ROI] or a volume of interest [VOI] · CPC title

  • G06T7/11Primary

    Region-based segmentation · CPC title

  • using classification, e.g. of video objects · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10970599B2 cover?
Techniques are disclosed for characterizing and defining the location of a copy space in an image. A methodology implementing the techniques according to an embodiment includes applying a regression convolutional neural network (CNN) to an image. The regression CNN is configured to predict properties of the copy space such as size and type (natural or manufactured). The prediction is conditione…
Who is the assignee on this patent?
Adobe Inc
What technology area does this patent fall under?
Primary CPC classification G06T7/11. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Apr 06 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 10 related publications on this page (citations in our corpus or others sharing the same primary CPC).