Object detection using recurrent neural network and concatenated feature map

US10452946B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10452946-B2
Application numberUS-201816226142-A
CountryUS
Kind codeB2
Filing dateDec 19, 2018
Priority dateJan 24, 2017
Publication dateOct 22, 2019
Grant dateOct 22, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

According to one embodiment, a system includes a sensor component and a detection component. The sensor component is configured to obtain a first stream of sensor data and a second stream of sensor data, wherein each of the first stream and second stream comprise a plurality of sensor frames. The detection component is configured to generate a concatenated feature map based on a sensor frame of a first type and a sensor frame of a second type. The detection component is configured to detect one or more objects based on the concatenated feature map. One or more of generating and detecting comprises generating or detecting using a neural network with a recurrent connection that feeds information about features or objects from previous frames.

First claim

Opening claim text (preview).

The invention claimed is: 1. A method comprising: extracting object features from a depth map to generate a first feature map by processing the depth map using a neural network, wherein the depth map is based on range sensor data; extracting object features from an image to generate a second feature map by processing the image using the neural network; concatenating the first feature map and the second feature map to generate a concatenated feature map; and detecting one or more objects based on the concatenated map by processing the concatenated map using the neural network. 2. The method of claim 1 , wherein generating the first feature map comprises generating based on high-level features in the depth map, and wherein generating the second feature map comprises generating based on high-level features in the image. 3. The method of claim 2 , wherein generating the concatenated feature map comprises fusing features from the first feature map and the second feature map. 4. The method of claim 1 , wherein the neural network feeds forwards one or more of: a feature in a previous feature map; an object detected in a previous frame or time period; or a feature in a previous depth map. 5. The method of claim 1 , further comprising: generating a subsequent first feature map based on a subsequent depth map that is based on subsequent range data; generating a subsequent second feature map based on a subsequent image; and feeding forward one or more of a feature of the first feature map or the second feature map using recurrent connections in the neural network. 6. The method of claim 1 , wherein the neural network comprises an input layer, one or more hidden layers, and a classification layer, wherein a recurrent connection of the neural network feeds an output of the classification layer from the previous frames into one or more of the input layer or a hidden layer of the one or more hidden layers during generating the feature map or detecting the one or more objects. 7. The method of claim 1 , wherein the concatenated map comprises a fusion of extracted object features from the depth map and extracted object features from the image. 8. The method of claim 1 , wherein the range sensor data comprises one or more of light detection and ranging (LIDAR) sensor data or depth camera data. 9. A system comprising: a sensor of a vehicle; a camera of the vehicle; and a processor that is programmable to execute instructions stored in non-transitory computer readable storage media, the instructions comprising: receiving sensor data from the sensor and an image from the camera; generating a depth map based on the sensor data; extracting object features from the depth map to generate a first feature map by processing the depth map using a neural network; extracting object features from an image to generate a second feature map by processing the image using the neural network; concatenating the first feature map and the second feature map to generate a concatenated feature map; and detecting one or more objects based on the concatenated map by processing the concatenated map using the neural network. 10. The system of claim 9 , wherein generating the first feature map comprises generated base on high-level features in the depth map, and wherein generating the second feature map comprises generating based on high-level features in the image. 11. The system of claim 10 , wherein generating the concatenated feature map comprises fusing features from the first feature map and the second feature map. 12. The system of claim 9 , wherein the neural network feeds forwards one or more of: a feature in a previous feature map; an object detected in a previous frame or time period; or a feature in a previous depth map. 13. The system of claim 9 , wherein the instructions further comprise: generating a subsequent first feature map based on a subsequent depth map that is based on subsequent range data; generating a subsequent second feature map based on a subsequent image; and feeding forward one or more of a feature of the first feature map or the second feature map using recurrent connections in the neural network. 14. The system of claim 9 , wherein the neural network comprises an input layer, one or more hidden layers, and a classification layer, wherein a recurrent connection of the neural network feeds an output of the classification layer from the previous frames into one or more of the input layer or a hidden layer of the one or more hidden layers during generating the feature map or detecting the one or more objects. 15. Non-transitory computer readable storage media storing instructions that, when executed by one or more processors, cause the one or more processors to: extract object features from a depth map to generate a first feature map by processing the depth map using a neural network, wherein the depth map is based on range sensor data; extract object features from an image to generate a second feature map by processing the image using the neural network; concatenating the first feature map and the second feature map to generate a concatenated feature map; and detect one or more objects based on the concatenated map by processing the concatenated map using the neural network. 16. The non-transitory computer readable storage media of claim 15 , wherein the instructions cause the one or more processors to generate the first feature map by generating based on high-level features in the depth map and generate the second feature map by generating based on high-level features in the image. 17. The non-transitory computer readable storage media of claim 16 , wherein the instructions cause the one or more processors to generate the concatenated feature map by fusing features from the first feature map and the second feature map. 18. The non-transitory computer readable storage media of claim 15 , wherein the neural network feeds forwards one or more of: a feature in a previous feature map; an object detected in a previous frame or time period; or a feature in a previous depth map. 19. The non-transitory computer readable storage media of claim 15 , wherein the instructions further cause the one or more processors to: generate a subsequent first feature map based on a subsequent depth map that is based on subsequent range data; generate a subsequent second feature map based on a subsequent image; and feed forward one or more of a feature of the first feature map or the second feature map using recurrent connections in the neural network. 20. The non-transitory computer readable storage media of claim 15 , wherein the neural network comprises an input layer, one or more hidden layers, and a classification layer, wherein a recurrent connection of the neural network feeds an output of the classification layer from the previous frames into one or more of the input layer or a hidden layer of the one or more hidden layers during generating the feature map or detecting the one or more objects.

Assignees

Inventors

Classifications

  • G01S7/417Primary

    involving the use of neural networks · CPC title

  • using neural networks · CPC title

  • using classification, e.g. of video objects · CPC title

  • Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components · CPC title

  • Determining position or orientation of objects or cameras (camera calibration G06T7/80) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10452946B2 cover?
According to one embodiment, a system includes a sensor component and a detection component. The sensor component is configured to obtain a first stream of sensor data and a second stream of sensor data, wherein each of the first stream and second stream comprise a plurality of sensor frames. The detection component is configured to generate a concatenated feature map based on a sensor frame of…
Who is the assignee on this patent?
Ford Global Tech Llc
What technology area does this patent fall under?
Primary CPC classification G01S7/417. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Oct 22 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).