Method and system for semantic label generation using sparse 3D data

US10983217B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10983217-B2
Application numberUS-201816206465-A
CountryUS
Kind codeB2
Filing dateNov 30, 2018
Priority dateNov 30, 2018
Publication dateApr 20, 2021
Grant dateApr 20, 2021

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Methods and apparatuses for generating a frame of semantically labeled 2D data are described. A frame of sparse 3D data is generated from a frame of sparse 3D data. Semantic labels are assigned to the frame of dense 3D data, based on a set of 3D bounding boxes determined for the frame of sparse 3D data. Semantic labels are assigned to a corresponding frame of 2D data based on a mapping between the frame of sparse 3D data and the frame of 2D data. The mapping is used to map a 3D data point in the frame of dense 3D data to a mapped 2D data point in the frame of 2D data. The semantic label assigned to the 3D data point is assigned to the mapped 2D data point. The frame of semantically labeled 2D data, including the assigned semantic labels, is outputted.

First claim

Opening claim text (preview).

The invention claimed is: 1. A method for generating a frame of semantically labeled two dimensional (2D) data, the method comprising: receiving a frame of sparse three-dimensional (3D) data; generating a frame of dense 3D data from the frame of sparse 3D data by: encoding the frame of sparse 3D data into sparse 2D xyz arrays, each sparse 2D xyz array representing a respective coordinate value of 3D data points in the sparse 3D data; generating a sparse 2D depth array from the sparse 2D xyz arrays; performing depth completion to generate a dense 2D depth array from the sparse 2D depth array; generating dense 2D xyz arrays from the sparse 2D xyz arrays, using information from the depth completion; and generating the frame of dense 3D data by performing an inverse mapping on the dense 2D xyz arrays; assigning semantic labels to the frame of dense 3D data, the semantic labels being assigned based on a set of 3D bounding boxes determined for the frame of sparse 3D data, wherein each data point in the frame of dense 3D data falling within a given 3D bounding box is assigned the semantic label associated with the given 3D bounding box; assigning the semantic labels to a frame of 2D data that corresponds to the frame of sparse 3D data based on a mapping between the frame of sparse 3D data and the frame of 2D data, wherein the mapping is used to map a 3D data point in the frame of dense 3D data to a mapped 2D data point in the frame of 2D data, and wherein the semantic label assigned to the 3D data point is assigned to the mapped 2D data point; and outputting the frame of semantically labeled 2D data, including the assigned semantic labels. 2. The method of claim 1 , wherein the method is repeated for a plurality of frames of sparse 3D data, to output a set of semantically labeled 2D data that comprises a corresponding plurality of frames of semantically labeled 2D data. 3. The method of claim 1 , wherein the frame of sparse 3D data is received from one of a LIDAR sensor and a camera. 4. The method of claim 1 , wherein the frame of 2D data is received by performing a projection of the frame of dense 3D data. 5. The method of claim 1 , further comprising, after assigning the semantic labels to the frame of 2D data: filtering the frame of semantically labeled 2D data by applying a set of 2D bounding boxes, wherein any assigned label that does not agree with the 2D bounding boxes is discarded or relabeled. 6. A method for generating a frame of semantically labeled two dimensional (2D) data, the method comprising: receiving a frame of sparse three-dimensional (3D) data; generating a frame of dense 3D data from the frame of sparse 3D data by: encoding the frame of sparse 3D data into a sparse 2D z array representing a z coordinate value of 3D data points in the sparse 3D data, wherein the sparse 2D z array is considered to be a sparse 2D depth array; performing depth completion to generate a dense 2D depth array from the sparse 2D depth array; and generating the frame of dense 3D data by performing back projection on the dense 2D depth array; assigning semantic labels to the frame of dense 3D data, the semantic labels being assigned based on a set of 3D bounding boxes determined for the frame of sparse 3D data, wherein each data point in the frame of dense 3D data falling within a given 3D bounding box is assigned the semantic label associated with the given 3D bounding box; assigning the semantic labels to a frame of 2D data that corresponds to the frame of sparse 3D data based on a mapping between the frame of sparse 3D data and the frame of 2D data, wherein the mapping is used to map a 3D data point in the frame of dense 3D data to a mapped 2D data point in the frame of 2D data, and wherein the semantic label assigned to the 3D data point is assigned to the mapped 2D data point; and outputting the frame of semantically labeled 2D data, including the assigned semantic labels. 7. The method of claim 1 , wherein the set of 3D bounding boxes is generated for the frame of sparse 3D data using a 3D neural network. 8. The method of claim 6 , wherein the method is repeated for a plurality of frames of sparse 3D data, to output a set of semantically labeled 2D data that comprises a corresponding plurality of frames of semantically labeled 2D data. 9. The method of claim 6 , wherein the frame of sparse 3D data is received from one of a LIDAR sensor and a camera. 10. The method of claim 6 , wherein the frame of 2D data is received by performing a projection of the frame of dense 3D data. 11. The method of claim 6 , further comprising, after assigning the semantic labels to the frame of 2D data: filtering the frame of semantically labeled 2D data by applying a set of 2D bounding boxes, wherein any assigned label that does not agree with the 2D bounding boxes is discarded or relabeled. 12. An apparatus for generating a frame of semantically labeled two dimensional (2D) data, the apparatus comprising: a processing unit; and a memory coupled to the processing unit, the memory containing machine-executable instructions that, when executed by the processing unit, causes the apparatus to: receive a frame of sparse three-dimensional (3D) data; generate a frame of dense 3D data from the frame of sparse 3D data by: encoding the frame of sparse 3D data into sparse 2D xyz arrays, each sparse 2D xyz array representing a respective coordinate value of 3D data points in the sparse 3D data; generating a sparse 2D depth array from the sparse 2D xyz arrays; performing depth completion to generate a dense 2D depth array from the sparse 2D depth array; generating dense 2D xyz arrays from the sparse 2D xyz arrays, using information from the depth completion; and generating the frame of dense 3D data by performing an inverse mapping on the dense 2D xyz arrays; assign semantic labels to the frame of dense 3D data, the semantic labels being assigned based on a set of 3D bounding boxes determined for the frame of sparse 3D data, wherein each data point in the frame of dense 3D data falling within a given 3D bounding box is assigned the semantic label associated with the given 3D bounding box; assign the semantic labels to a frame of 2D data that corresponds to the frame of sparse 3D data based on a mapping between the frame of sparse 3D data and the frame of 2D data, wherein the mapping is used to map a 3D data point in the frame of dense 3D data to a mapped 2D data point in the frame of 2D data, and wherein the semantic label assigned to the 3D data point is assigned to the mapped 2D data point; and output the frame of semantically labeled 2D data, including the assigned semantic labels. 13. The apparatus of claim 12 , wherein the instructions further cause the apparatus to output a set of semantically labeled 2D data that comprises a plurality of frames of semantically labeled 2D data. 14. The apparatus of claim 12 , wherein the frame of sparse 3D data is received from one of a LIDAR sensor and a camera. 15. The apparatus of claim 12 , wherein the frame of 2D data is received by performing a projection of the dense 3D data. 16. The apparatus of claim 12 , wherein the instructions further cause the apparatus to, after assigning the semantic labels to the frame of 2D data: filter the frame of semantically labeled 2D data by applying a set of 2D bounding boxes, wherein any assigned label that does not agree with the 2D bounding boxes is discarded or relabeled. 17. The apparatus of claim 12 , wherein the set of 3D bounding boxes is generated for the frame of sparse 3D data using

Assignees

Inventors

Classifications

  • Combinations of networks · CPC title

  • Supervised learning · CPC title

  • Convolutional networks [CNN, ConvNet] · CPC title

  • Weakly supervised learning, e.g. semi-supervised or self-supervised learning · CPC title

  • G06T19/00Primary

    Manipulating three-dimensional [3D] models or images for computer graphics · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10983217B2 cover?
Methods and apparatuses for generating a frame of semantically labeled 2D data are described. A frame of sparse 3D data is generated from a frame of sparse 3D data. Semantic labels are assigned to the frame of dense 3D data, based on a set of 3D bounding boxes determined for the frame of sparse 3D data. Semantic labels are assigned to a corresponding frame of 2D data based on a mapping between …
Who is the assignee on this patent?
Nezhadarya Ehsan, Nabatchian Amirhosein, Liu Bingbing, and 1 more
What technology area does this patent fall under?
Primary CPC classification G06T19/00. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Apr 20 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).