Computer vision system and method

US10769744B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10769744-B2
Application numberUS-201816176801-A
CountryUS
Kind codeB2
Filing dateOct 31, 2018
Priority dateOct 31, 2018
Publication dateSep 8, 2020
Grant dateSep 8, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An image processing method for segmenting an image, the method comprising: receiving first image; producing a second image from said first image, wherein said second image is a lower resolution representation of said first image; processing said first image with a first processing stage to produce a first feature map; processing said second image with a second processing stage to produce a second feature map; and combining the first feature map with the second feature map to produce a semantic segmented image; wherein the first processing stage comprises a first neural network comprising at least one separable convolution module configured to perform separable convolution and said second processing stage comprises a second neural network comprising at least one separable convolution module configured to perform separable convolution; the number of layers in the first neural network being smaller than the number of layers in the second neural network.

First claim

Opening claim text (preview).

The invention claimed is: 1. An image processing method for segmenting an image, the method comprising: receiving first image; producing a second image from said first image, wherein said second image is a lower resolution representation of said first image; processing said first image with a first processing stage to produce a first feature map; processing said second image with a second processing stage to produce a second feature map; and combining the first feature map with the second feature map to produce a semantic segmented image; wherein the first processing stage comprises a first neural network comprising at least one separable convolution module configured to perform separable convolution and said second processing stage comprises a second neural network comprising at least one separable convolution module configured to perform separable convolution; a number of layers in the first neural network being smaller than a number of layers in the second neural network. 2. An image processing method according to claim 1 , wherein the first feature map and the second feature map are combined at just one stage. 3. An image processing method according to claim 2 , wherein the first feature map and the second feature map are combined by adding. 4. An image processing method according to claim 2 , wherein the first feature map and the second feature map are combined at a fusion stage, wherein said fusion stage comprises upsampling the second feature map and adding the first feature map to the second feature map wherein adding comprises adding corresponding values of the upsampled second feature map and the first feature map. 5. An image processing method according to claim 4 , wherein upsampled second feature map is subjected to depthwise convolution prior to adding to the first feature map, wherein the depthwise convolution is performed with a dilation factor that is greater than 1. 6. An image processing method according to claim 5 , wherein upsampled second feature map has been subjected to depthwise convolution with a dilation factor that is greater than 1 and the first feature map are subjected to two dimensional convolution prior to adding. 7. An image processing method according to claim 1 , wherein separable convolution modules in the second processing stage are depthwise convolution modules. 8. An image processing method according to claim 1 , wherein separable convolution modules in the second processing stage are bottleneck architecture modules. 9. An image processing method according to claim 1 , wherein separable convolution modules in the second processing stage are bottleneck residual architecture modules. 10. An image processing method according to claim 1 , wherein the second image is subjected to standard convolution prior to be processed by said separable convolution modules. 11. An image processing method, according to claim 9 , wherein there are a plurality of residual bottleneck architecture modules. 12. An image processing method according to claim 11 , wherein a final standard convolution module is provided after the residual bottleneck architecture modules. 13. An image processing method according to claim 12 , wherein a pyramid pooling module is provided between the final standard convolution module and the residual bottleneck architecture modules. 14. An image processing method according to claim 1 , wherein the first processing stage comprises a plurality of depthwise separable convolution modules. 15. A method of training a model, said model for segmenting an image, the model comprising: a first neural network having a plurality of layers comprising at least one separable convolution module configured to perform separable convolutions; a second neural network with a plurality of layers, comprising at least one separable convolution module configured to perform separable convolutions; a number of layers in the first neural network being smaller than a number of layers in the second neural network, the first neural network being configured to process an image at a first resolution and the second neural network being configured to process the same image at a lower resolution, a feature map of the first and second neural networks being combined by addition at a single stage; the training method comprising: providing training data, the training data comprising images and semantic segmented information concerning said images; training said model using said images as input and the semantic segmented information as output, wherein two stages are trained together. 16. A method according to claim 15 , further comprising increasing a number of filters during training, and reducing their number to discard filters of lower importance. 17. A method according to claim 15 , further comprising adapting the model during training to add a second output on said second processing stage, the method further comprising training using the images as input and determining loss by comparing with both the semantic segmented information at both the output and at the second output and updating weights during training by using the determined losses from both outputs. 18. An image processing system for segmenting an image, the system comprising: an interface and a processor, said interface having an image input and being adapted to receive a first image, said processor being adapted to; produce a second image from said first image, wherein said second image is a lower resolution representation of said first image; process said first image with a first processing stage to produce a first feature map; process said second image with a second processing stage to produce a second feature map; and combine the first feature map with the second feature map to produce a semantic segmented image; wherein the first processing stage comprises a first neural network comprising at least one separable convolution module configured to perform separable convolution and said second processing stage comprises a second neural network comprising at least one separable convolution module configured to perform separable convolution; a number of layers in the first neural network being smaller than a number of layers in the second neural network. 19. A detection system for a vehicle, said detection system, comprising the image processing system of claim 18 , being adapted to receive an image and determine objects from said image by segmenting said image. 20. A non-transitory carrier medium carrying computer readable instructions adapted to cause a computer to perform the method of claim 1 .

Assignees

Inventors

Classifications

  • using neural networks · CPC title

  • using classification, e.g. of video objects · CPC title

  • G06T7/11Primary

    Region-based segmentation · CPC title

  • G06T1/0014Primary

    Image feed-back for automatic industrial control, e.g. robot with camera (robots B25J19/023) · CPC title

  • Combinations of networks · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10769744B2 cover?
An image processing method for segmenting an image, the method comprising: receiving first image; producing a second image from said first image, wherein said second image is a lower resolution representation of said first image; processing said first image with a first processing stage to produce a first feature map; processing said second image with a second processing stage to…
Who is the assignee on this patent?
Toshiba Kk
What technology area does this patent fall under?
Primary CPC classification G06T7/11. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Sep 08 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).