Objection recognition in a 3D scene

US9846946B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9846946-B2
Application numberUS-201514948260-A
CountryUS
Kind codeB2
Filing dateNov 21, 2015
Priority dateDec 2, 2014
Publication dateDec 19, 2017
Grant dateDec 19, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method comprising: obtaining a three-dimensional (3D) point cloud about at least one object of interest; detecting ground and/or building objects from 3D point cloud data using an unsupervised segmentation method; removing the ground and/or building objects from the 3D point cloud data; and detecting one or more vertical objects from the remaining 3D point cloud data using a supervised segmentation method.

First claim

Opening claim text (preview).

The invention claimed is: 1. A method, comprising: obtaining a three-dimensional (3D) point cloud about at least one object of interest; detecting at least one of ground and building objects from 3D point cloud data using an unsupervised segmentation method; removing the at least one of ground and building objects from the 3D point cloud data; detecting one or more vertical objects from remaining 3D point cloud data using a supervised segmentation method; dividing the 3D cloud point data into rectangular tiles in a horizontal plane; and determining an estimation of a ground plane within the rectangular tiles, wherein determining the estimation of the ground plane within the rectangular tiles comprises: dividing the rectangular tiles into a plurality of grid cells; searching a minimal-z-value point within each grid cell; searching points in the each grid cell having a z-value within a first predetermined threshold from the minimal-z-value of a grid cell; collecting the points having the z-value within the first predetermined threshold from the minimal-z-value of the grid cell from each grid cell; estimating the ground plane of a tile on the basis of the collected points; and determining points locating within a second predetermined threshold from the estimated ground plane to comprise ground points of the tile. 2. The method according to claim 1 , further comprising: projecting 3D points to range image pixels in a horizontal plane; defining values of the range image pixels as a function of number of 3D points projected to the range image pixels and a maximal z-value among the 3D points projected to the range image pixels; defining a geodesic elongation value for objects detected in the range image pixels; and distinguishing buildings from other objects on the basis of the geodesic elongation value. 3. The method according to claim 2 , the method further comprising: binarizing the values of the range image pixels; applying morphological operations for merging neighboring points in the binarized values of the range image pixels; and extracting contours for finding boundaries of the objects. 4. The method according to claim 1 further comprising applying a voxel based segmentation to the remaining 3D point cloud data, wherein the voxel based segmentation comprises: performing voxilisation of the 3D point cloud data; and merging of voxels into super-voxels and carrying out supervised classification based on discriminative features extracted from the super-voxels. 5. The method according to claim 4 further comprising: merging the 3D points into voxels comprising a plurality of 3D points such that for a selected 3D point, all neighboring 3D points within a third predefined threshold from the selected 3D point are merged into a voxel without exceeding a maximum number of 3D points in the voxel. 6. The method according to claim 4 , further comprising: merging any number of the voxels into the super-voxels such that a criteria is fulfilled, the criteria comprising: a minimal geometrical distance between two voxels is smaller than a fourth predefined threshold; and an angle between normal vectors of the two voxels is smaller than a fifth predefined threshold. 7. The method according to claim 4 further comprising: extracting features from the super-voxels for classifying the super-voxels into objects. 8. The method according to claim 7 , wherein the features include one or more of: geometrical shape; height of a voxel above ground; horizontal distance of the voxel to a center line of a street; surface normal of the voxel; voxel planarity; density of 3D points in the voxel; and intensity of the voxel. 9. An apparatus, comprising at least one processor, memory including computer program code, the memory and the computer program code configured to, with the at least one processor, cause the apparatus to at least: obtain a three-dimensional (3D) point cloud about at least one object of interest; detect at least one of ground and building objects from 3D point cloud data using an unsupervised segmentation method; remove the at least one of ground and building objects from the 3D point cloud data; and detect one or more vertical objects from remaining 3D point cloud data using a supervised segmentation method; divide the 3D cloud point data into rectangular tiles in a horizontal plane; and determine an estimation of a ground plane within the rectangular tiles, wherein to determine the estimation of the ground plane within the rectangular tiles, the apparatus is further caused to: divide the rectangular tiles into a plurality of grid cells; search a minimal-z-value point within each grid cell; search points in the each grid cell having a z-value within a first predetermined threshold from the minimal-z-value of a grid cell; collect the points having the z-value within the first predetermined threshold from the minimal-z-value of the grid cell from the each grid cell; estimate the ground plane of a tile on the basis of the collected points; and determine points locating within a second predetermined threshold from the estimated ground plane to comprise ground points of the tile. 10. The apparatus according claim 9 , wherein the apparatus is further caused to: project 3D points to range image pixels in a horizontal plane; define values of the range image pixels as a function of number of 3D points projected to the range image pixels and a maximal z-value among the 3D points projected to the range image pixels; define a geodesic elongation value for objects detected in the range image pixels; and distinguish buildings from other objects on the basis of the geodesic elongation value. 11. The apparatus according to claim 10 , wherein the apparatus is further caused to: binarize the values of the range image pixels; apply morphological operations for merging neighboring points in the binarized values of the range image pixels; and extract contours for finding boundaries of the objects. 12. The apparatus according to claim 9 , wherein the apparatus is further caused to apply a voxel based segmentation to the remaining 3D point cloud data, and wherein to apply the voxel based segmentation, the apparatus is further caused to: perform voxilisation of the 3D point cloud data; merge voxels into super-voxels; and carry out supervised classification based on discriminative features extracted from the super-voxels. 13. The apparatus according to claim 12 , wherein the apparatus is further caused to merge the 3D points into voxels comprising a plurality of 3D points such that for a selected 3D point, all neighboring 3D points within a third predefined threshold from the selected 3D point are merged into a voxel without exceeding a maximum number of 3D points in the voxel. 14. The apparatus according to claim 12 , wherein the apparatus is further caused to merging any number of the voxels into the super-voxels such that following criteria is fulfilled: a minimal geometrical distance between two voxels is smaller than a fourth predefined threshold; and an angle between normal vectors of the two voxels is smaller than a fifth predefined threshold. 15. The apparatus according to claim 12 , wherein the apparatus is the apparatus is further caused to extract features from the super-voxels for classifying the super-voxels into objects. 16. The apparatus according to claim 15 , wherein the features include one or more of: geometrical shape; height of a voxel above ground; horizontal distance of the voxel to a center line of a street;

Assignees

Inventors

Classifications

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9846946B2 cover?
A method comprising: obtaining a three-dimensional (3D) point cloud about at least one object of interest; detecting ground and/or building objects from 3D point cloud data using an unsupervised segmentation method; removing the ground and/or building objects from the 3D point cloud data; and detecting one or more vertical objects from the remaining 3D point cloud data using a supervised segmen…
Who is the assignee on this patent?
Nokia Technologies Oy
What technology area does this patent fall under?
Primary CPC classification G06T7/0091. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Dec 19 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).