System and method for semantic segmentation using dense upsampling convolution (DUC)

US9953236B1 · US · B1

Patent metadata
FieldValue
Publication numberUS-9953236-B1
Application numberUS-201715456219-A
CountryUS
Kind codeB1
Filing dateMar 10, 2017
Priority dateMar 10, 2017
Publication dateApr 24, 2018
Grant dateApr 24, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A system and method for semantic segmentation using dense upsampling convolution (DUC) are disclosed. A particular embodiment includes: receiving an input image; producing a feature map from the input image; performing a convolution operation on the feature map and reshape the feature map to produce a label map; dividing the label map into equal subparts, which have the same height and width as the feature map; stacking the subparts of the label map to produce a whole label map; and applying a convolution operation directly between the feature map and the whole label map without inserting extra values in deconvolutional layers to produce a semantic label map.

First claim

Opening claim text (preview).

What is claimed is: 1. A system comprising: a data processor; and an image processing module, executable by the data processor, the image processing module being configured to perform semantic segmentation using a dense upsampling convolution (DUC) operation, the DUC operation being configured to: receive an input image; produce a feature map from the input image; perform a convolution operation on the feature map and reshape the feature map to produce a label map; divide the label map into equal subparts, which have the same height and width as the feature map; stack the subparts of the label map to produce a whole label map; and apply a convolution operation directly between the feature map and the whole label map without inserting extra values in deconvolutional layers to produce a semantic label map. 2. The system of claim 1 wherein the DUC operation is machine learnable. 3. The system of claim 1 wherein the DUC operation is configured to learn an array of upscaling filters to upscale the feature map into a final dense feature map of a desired size. 4. The system of claim 1 wherein the DUC operation is configured operate within a fully convolutional network (FCN). 5. The system of claim 1 wherein the DUC operation is performed at an original resolution, thereby enabling pixel-level decoding. 6. The system of claim 1 wherein the semantic label map is used by an autonomous control subsystem to control a vehicle without a driver. 7. A method comprising: receiving an input image; producing a feature map from the input image; performing a convolution operation on the feature map and reshaping the feature map to produce a label map; dividing the label map into equal subparts, which have the same height and width as the feature map; stacking the subparts of the label map to produce a whole label map; and applying a convolution operation directly between the feature map and the whole label map without inserting extra values in deconvolutional layers to produce a semantic label map. 8. The method of claim 7 wherein the method enables machine learning. 9. The method of claim 7 including learning an array of upscaling filters to upscale the feature map into a final dense feature map of a desired size. 10. The method of claim 7 wherein the method is configured operate within a fully convolutional network (FCN). 11. The method of claim 7 wherein the method is performed at an original resolution, thereby enabling pixel-level decoding. 12. The method of claim 7 wherein the semantic label map is used by an autonomous control subsystem to control a vehicle without a driver. 13. A non-transitory machine-useable storage medium embodying instructions which, when executed by a machine, cause the machine to: receive an input image; produce a feature map from the input image; perform a convolution operation on the feature map and reshape the feature map to produce a label map; divide the label map into equal subparts, which have the same height and width as the feature map; stack the subparts of the label map to produce a whole label map; and apply a convolution operation directly between the feature map and the whole label map without inserting extra values in deconvolutional layers to produce a semantic label map. 14. The non-transitory machine-useable storage medium of claim 13 wherein the instructions are further configured to enable machine learning. 15. The non-transitory machine-useable storage medium of claim 13 wherein the instructions are further configured to learn an array of upscaling filters to upscale the feature map into a final dense feature map of a desired size. 16. The non-transitory machine-useable storage medium of claim 13 wherein the instructions are further configured to operate within a fully convolutional network (FCN). 17. The non-transitory machine-useable storage medium of claim 13 wherein the instructions are further configured to perform at an original resolution, thereby enabling pixel-level decoding. 18. The non-transitory machine-useable storage medium of claim 13 wherein the semantic label map is used by an autonomous control subsystem to control a vehicle without a driver.

Assignees

Inventors

Classifications

  • Classification techniques · CPC title

  • using neural networks · CPC title

  • Feature extraction, e.g. by transforming the feature space, e.g. multi-dimensional scaling [MDS]; Mappings, e.g. subspace methods · CPC title

  • Integrating the filters into a hierarchical structure, e.g. convolutional neural networks [CNN] · CPC title

  • Classification techniques · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9953236B1 cover?
A system and method for semantic segmentation using dense upsampling convolution (DUC) are disclosed. A particular embodiment includes: receiving an input image; producing a feature map from the input image; performing a convolution operation on the feature map and reshape the feature map to produce a label map; dividing the label map into equal subparts, which have the same height and width as…
Who is the assignee on this patent?
TuSimple
What technology area does this patent fall under?
Primary CPC classification G06V20/56. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Apr 24 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).