Methods, apparatus, and articles of manufacture for interactive image segmentation

US12045986B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12045986-B2
Application numberUS-202017131525-A
CountryUS
Kind codeB2
Filing dateDec 22, 2020
Priority dateDec 22, 2020
Publication dateJul 23, 2024
Grant dateJul 23, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Methods, apparatus, systems, and articles of manufacture are disclosed for interactive image segmentation. An example apparatus includes an inception controller to execute an inception sublayer of a convolutional neural network (CNN) including two or more inception-atrous-collation (IAC) layers, the inception sublayer including two or more convolutions including respective kernels of varying sizes to generate multi-scale inception features, the inception sublayer to receive one or more context features indicative of user input; an atrous controller to execute an atrous sublayer of the CNN, the atrous sublayer including two or more atrous convolutions including respective kernels of varying sizes to generate multi-scale atrous features; and a collation controller to execute a collation sublayer of the CNN to collate the multi-scale inception features, the multi-scale atrous features, and eidetic memory features.

First claim

Opening claim text (preview).

What is claimed is: 1. An apparatus for interactive image segmentation, the apparatus comprising: an inception controller to execute an inception sublayer of a convolutional neural network (CNN) including two or more inception-atrous-collation (IAC) layers, the inception sublayer including two or more convolutions including respective kernels of varying sizes to generate multi-scale inception features, the inception sublayer to receive one or more context features indicative of user input; an atrous controller to execute an atrous sublayer of the CNN, the atrous sublayer including two or more atrous convolutions including respective kernels of varying sizes to generate multi-scale atrous features; and a collation controller to execute a collation sublayer of the CNN to collate the multi-scale inception features, the multi-scale atrous features, and eidetic memory features. 2. The apparatus of claim 1 , wherein the inception controller is to bypass the inception sublayer in response to determining that bypassing the inception sublayer would be advantageous to the CNN. 3. The apparatus of claim 2 , wherein the inception controller is to bypass the inception sublayer to preserve data in one or more input feature matrices to the inception sublayer. 4. The apparatus of claim 1 , wherein the atrous controller is to bypass the atrous sublayer in response to determining that bypassing the atrous sublayer would be advantageous to the CNN. 5. The apparatus of claim 4 , wherein the atrous controller is to bypass the atrous sublayer to preserve data in the inception features. 6. The apparatus of claim 1 , wherein the collation controller is to bypass a subsequent IAC layer in response to determining that bypassing the subsequent IAC layer would be advantageous to the CNN. 7. The apparatus of claim 1 , wherein the user input corresponds to one or more selections in an image to be processed, the one or more selections identifying one or more pixels within a threshold distance of one or more respective selection epicenters and a corresponding respective gradient specifying a likelihood that the one or more pixels are within the threshold distance. 8. A non-transitory computer-readable medium comprising instructions which, when executed, cause at least one processor to at least: implement an inception sublayer of a convolutional neural network (CNN) including two or more inception-atrous-collation (IAC) layers, the inception sublayer including two or more convolutions including respective kernels of varying sizes to generate multi-scale inception features, the inception sublayer to receive one or more context features indicative of user input; implement an atrous sublayer of the CNN, the atrous sublayer including two or more atrous convolutions including respective kernels of varying sizes to generate multi-scale atrous features; and implement a collation sublayer of the CNN to collate the multi-scale inception features, the multi-scale atrous features, and eidetic memory features. 9. The non-transitory computer-readable medium of claim 8 , wherein the instructions, when executed, cause the at least one processor to bypass the inception sublayer in response to determining that bypassing the inception sublayer would be advantageous to the CNN. 10. The non-transitory computer-readable medium of claim 9 , wherein the instructions, when executed, cause the at least one processor to bypass the inception sublayer to preserve data in one or more input feature matrices to the inception sublayer. 11. The non-transitory computer-readable medium of claim 8 , wherein the instructions, when executed, cause the at least one processor to bypass the atrous sublayer in response to determining that bypassing the atrous sublayer would be advantageous to the CNN. 12. The non-transitory computer-readable medium of claim 11 , wherein the instructions, when executed, cause the at least one processor to bypass the atrous sublayer to preserve data in the inception features. 13. The non-transitory computer-readable medium of claim 8 , wherein the instructions, when executed, cause the at least one processor to bypass a subsequent IAC layer in response to determining that bypassing the subsequent IAC layer would be advantageous to the CNN. 14. The non-transitory computer-readable medium of claim 8 , wherein the user input corresponds to one or more selections in an image to be processed, the one or more selections identifying one or more pixels within a threshold distance of one or more respective selection epicenters and a corresponding respective gradient specifying a likelihood that the one or more pixels are within the threshold distance. 15. An apparatus for interactive image segmentation, the apparatus comprising: memory; and at least one processor to execute machine readable instructions to cause the at least one processor to: implement an inception sublayer of a convolutional neural network (CNN) including two or more inception-atrous-collation (IAC) layers, the inception sublayer including two or more convolutions including respective kernels of varying sizes to generate multi-scale inception features, the inception sublayer to receive one or more context features indicative of user input; implement an atrous sublayer of the CNN, the atrous sublayer including two or more atrous convolutions including respective kernels of varying sizes to generate multi-scale atrous features; and implement a collation sublayer of the CNN to collate the multi-scale inception features, the multi-scale atrous features, and eidetic memory features. 16. The apparatus of claim 15 , wherein the at least one processor is to bypass the inception sublayer in response to determining that bypassing the inception sublayer would be advantageous to the CNN. 17. The apparatus of claim 16 , wherein the at least one processor is to bypass the inception sublayer to preserve data in one or more input feature matrices to the inception sublayer. 18. The apparatus of claim 15 , wherein the at least one processor is to bypass the atrous sublayer in response to determining that bypassing the atrous sublayer would be advantageous to the CNN. 19. The apparatus of claim 18 , wherein the at least one processor is to bypass the atrous sublayer to preserve data in the inception features. 20. The apparatus of claim 15 , wherein the at least one processor is to bypass a subsequent IAC layer in response to determining that bypassing the subsequent IAC layer would be advantageous to the CNN. 21. The apparatus of claim 15 , wherein the user input corresponds to one or more selections in an image to be processed, the one or more selections identifying one or more pixels within a threshold distance of one or more respective selection epicenters and a corresponding respective gradient specifying a likelihood that the one or more pixels are within the threshold distance. 22. A method for interactive image segmentation, the method comprising: executing an inception sublayer of a convolutional neural network (CNN) including two or more inception-atrous-collation (IAC) layers, the inception sublayer including two or more convolutions including respective kernels of varying sizes to generate multi-scale inception features, the inception sublayer to receive one or more context features indicative of user input; executing an atrous sublayer of the CNN, the atrous sublayer including two or more atrous convolutions including respective kernels of varying sizes to generate m

Assignees

Inventors

Classifications

  • Supervised learning · CPC title

  • Convolutional networks [CNN, ConvNet] · CPC title

  • Integrating the filters into a hierarchical structure, e.g. convolutional neural networks [CNN] · CPC title

  • using neural networks · CPC title

  • using classification, e.g. of video objects · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12045986B2 cover?
Methods, apparatus, systems, and articles of manufacture are disclosed for interactive image segmentation. An example apparatus includes an inception controller to execute an inception sublayer of a convolutional neural network (CNN) including two or more inception-atrous-collation (IAC) layers, the inception sublayer including two or more convolutions including respective kernels of varying si…
Who is the assignee on this patent?
Intel Corp
What technology area does this patent fall under?
Primary CPC classification G06T7/11. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jul 23 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).