Comprehensive and efficient method to incorporate map features for object detection with LiDAR

US11520347B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11520347-B2
Application numberUS-201916255789-A
CountryUS
Kind codeB2
Filing dateJan 23, 2019
Priority dateJan 23, 2019
Publication dateDec 6, 2022
Grant dateDec 6, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

According to various embodiments, systems and methods described in the disclosure combine mapped features with point cloud features to improve object detection precision of an autonomous driving vehicle (ADV). The map features and the point cloud features can be extracted from a perception area of the ADV within a particular angle view at each driving cycle based on a position of the ADV. The map features and the point cloud features can be concatenated and provided to a neutral network for object detections.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method of operating an autonomous driving vehicle (ADV), the method comprising: extracting a plurality of map features from a map associated with a road in which the ADV is driving, wherein the plurality of map features are extracted from a portion of the map, the portion of the map corresponding to a perception area of the ADV within a particular angle of view at each driving cycle, wherein the particular angle of view corresponds to a heading of the ADV; extracting a plurality of point cloud features from a portion of a point cloud of LIDAR data, the portion of the point cloud corresponding to the perception area of the ADV within the particular angle of view; concatenating the plurality of point cloud features and the plurality of map features into a feature list; providing the feature list as input to one or more neural networks, which detect one or more objects in a driving environment based on the input; and generating a trajectory during each driving cycle of the ADV in view of the detected objects to drive the ADV through the detected objects. 2. The method of claim 1 , wherein the plurality of map features are extracted using a convolution neural network, and include one or more lanes, one or more lane boundaries, one or more traffic signs, and one or more road curbs. 3. The method of claim 1 , wherein the extracting the plurality of map features from the map includes: forming a plurality of layers, each layer corresponding to one of the plurality of map features extracted from the map; converting the plurality of layers into a red, green, and blue (RGB) representation; and extracting the plurality of map features from the RGB representation. 4. The method of claim 1 , wherein the extracted map features are pre-calculated and cached to speed up inference of the one or more neural networks. 5. The method of claim 1 , wherein the plurality of point cloud features are extracted using a fully connected network, which is to partition a space within the perception area into a plurality of equally spaced voxels, to encode each non-empty voxel with a plurality of point-wise features, and to combine the point-wise features with a locally aggregated feature. 6. The method of claim 5 , wherein the plurality of point-wise features for each non-empty voxel represent statistical quantities derived from all LiDAR points within that voxel, and include a distance from the center of the voxel to an origin of the point cloud, a maximum height of LiDAR points within the voxel, and a mean height of LiDAR points within the voxel. 7. The method of claim 1 , wherein the one or more neural networks include a convolution neural network and a region proposal network, wherein the convolution neural network generates a feature map based on the plurality of map features and the plurality of point cloud features, and wherein the region proposal network maps the feature map to one or more desired learning targets to generate object detections. 8. A non-transitory machine-readable medium having instructions stored therein, which when executed by a processor, causing the processor to perform operations of operating an autonomous driving vehicle (ADV), the operations comprising: extracting a plurality of map features from a map associated with a road in which the ADV is driving, wherein the plurality of map features are extracted from a portion of the map, the portion of the map corresponding to a perception area of the ADV within a particular angle of view at each driving cycle, wherein the particular angle of view corresponds to a heading of the ADV; extracting a plurality of point cloud features from a portion of a point cloud of LIDAR data, the portion of the point cloud corresponding to the perception area of the ADV within the particular angle of view; concatenating the plurality of point cloud features and the plurality of map features into a feature list; providing the feature list as input to one or more neural networks, which detect one or more objects in a driving environment based on the input; and generating a trajectory during each driving cycle of the ADV in view of the detected objects to drive the ADV through the detected objects. 9. The machine-readable medium of claim 8 , wherein the plurality of map features are extracted using a convolution neural network, and include one or more lanes, one or more lane boundaries, one or more traffic signs, and one or more road curbs. 10. The machine-readable medium of claim 8 , wherein the extracting the plurality of map features from the map comprises: forming a plurality of layers, each layer corresponding to one of the plurality of map features extracted from the map; converting the plurality of layers into a red, green, and blue (RGB) representation; and extracting the plurality of map features from the RGB representation using one or more convolution layers of a convolution neural network. 11. The machine-readable medium of claim 8 , wherein the extracted map features are pre-calculated and cached to speed up inference of the one or more neural networks. 12. The machine-readable medium of claim 8 , wherein the plurality of point cloud features are extracted using a fully connected network, which is to partition a space within the perception area into a plurality of equally spaced voxels, to encode each non-empty voxel with a plurality of point-wise features, and to combine the point-wise features with a locally aggregated feature. 13. The machine-readable medium of claim 12 , wherein the plurality of point-wise features for each non-empty voxel represent statistical quantities derived from all LiDAR points within that voxel, and include a distance from the center of the voxel to an origin of the point cloud, a maximum height of LiDAR points within the voxel, and a mean height of LiDAR points within the voxel. 14. The machine-readable medium of claim 8 , wherein the one or more neural networks include a convolution neural network and a region proposal network, wherein the convolution neural network generates a feature map based on the plurality of map features and the plurality of point cloud features, and wherein the region proposal network maps the feature map to one or more desired learning targets to generate object detections. 15. A data processing system, comprising: a processor; and a memory coupled to the processor to store instructions, which when executed by a processor, causing the processor to perform operations of operating an autonomous driving vehicle (ADV), the operations comprising: extracting a plurality of map features from a map associated with a road in which the ADV is driving, wherein the plurality of map features are extracted from a portion of the map, the portion of the map corresponding to a perception area of the ADV within a particular angle of view at each driving cycle, wherein the particular angle of view corresponds to a heading of the ADV, extracting a plurality of point cloud features from a portion of a point cloud of LIDAR data , the portion of the point cloud corresponding to the perception area of the ADV within the particular angle of view, concatenating the plurality of point cloud features and the plurality of map features into a feature list, providing the feature list as input to one or more neural networks, which detect one or more objects in a driving environment based on the input, and generating a trajectory during each driving cycle of the ADV in view of the detected objects to drive the ADV through the detected objects. 16. The system of claim 15

Assignees

Inventors

Classifications

  • Combinations of networks · CPC title

  • G01S17/931Primary

    of land vehicles · CPC title

  • for mapping or imaging · CPC title

  • Map- or contour-matching · CPC title

  • Combination of radar systems with lidar systems · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11520347B2 cover?
According to various embodiments, systems and methods described in the disclosure combine mapped features with point cloud features to improve object detection precision of an autonomous driving vehicle (ADV). The map features and the point cloud features can be extracted from a perception area of the ADV within a particular angle view at each driving cycle based on a position of the ADV. The m…
Who is the assignee on this patent?
Baidu Usa Llc
What technology area does this patent fall under?
Primary CPC classification G01S17/931. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Dec 06 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 6 related publications on this page (citations in our corpus or others sharing the same primary CPC).