Systems and methods for visual classification with region proposals

US10460180B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10460180-B2
Application numberUS-201715492760-A
CountryUS
Kind codeB2
Filing dateApr 20, 2017
Priority dateApr 20, 2017
Publication dateOct 29, 2019
Grant dateOct 29, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems and method are provided for controlling an autonomous vehicle. A camera configured to capture an image, and a controller can execute an autonomous driving system (ADS) that classify that image. The ADS comprises a classification system for classifying objects in an environment within a driveable area of the autonomous vehicle. The classification system comprises a processor configured to execute a region proposal generator module and an image classification module. The region proposal generator module generates a set of bounding box region proposals for the image. The bounding box region proposals are selected areas of the image that include objects to be classified. The image classification module classifies, via a neural network executed by the processor, the objects from the image that are within one of the bounding box region proposals.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for classifying objects in an environment, the method comprising: (a) processing, ranging information from depth sensors to identify a set of bounding box region proposals for an image from a camera, wherein the ranging information comprises point cloud data provided from lidar devices, wherein the bounding box region proposals are selected areas of the image that include objects to be classified, wherein each of the bounding box region proposals comprises: a set of two-dimensional bounding box coordinates that correspond to a region of a rectified image that includes one or more objects to be classified, wherein the set of bounding box region proposals collectively specify which objects are in the rectified image and where those objects are in the rectified image, wherein the processing comprises: (a1) generating segmented objects based on the ranging information, wherein the segmented objects define three-dimensional locations and dimensions of objects in vicinity of a vehicle; (a2) determining a subset of segmented objects to be classified that: meet a size constraint, are within a certain height range above the ground and are within a driveable area of the vehicle, and wherein other segmented objects that are not to be classified are those that the vehicle cannot hit and are disregarded when generating the object state information; (a3) generating object state information that indicates the three-dimensional locations of the subset of segmented objects to be classified; and (a4) translating the three-dimensional locations of objects as specified by the object state information into the set of bounding box region proposal; and (b) classifying, via a neural network executed by the hardware based processor, only the objects from the image that are within one of the bounding box region proposals and generating an object classification result for each object from the rectified image that is within one of the bounding box region proposals; (c) processing image data received from cameras to generate the rectified images; and (d) generating a regressed bounding box for each object that is classified, wherein each regressed bounding box for each object is the bounding box that the neural network has determined to be a best bounding box encompassing that object, wherein steps (a1) through (a4), (b), (c) and (d) are performed iteratively such that the object classification result and the regressed bounding box for each object being classified are fed back on each iteration to refine the bounding box region proposal for each object during each subsequent iteration so that the bounding box region proposal for each object more closely describes actual geometry of that object. 2. The method according to claim 1 , wherein processing the object state information to generate the set of bounding box region proposals: for each object: projecting the three-dimensional location of that object relative to the vehicle into one of the rectified images. 3. The method according to claim 1 , wherein the ranging information further comprises at least one of: radar data from radar devices; stereo vision data from cameras that provides relative depth information; and structured-light ranging data from a stereo vision system. 4. A classification system for classifying objects in an environment, the classification system comprising: a hardware-based processor: and memory comprising processor-executable instructions encoded on a non-transient processor-readable media, wherein the hardware-based processor is configurable to execute the processor-executable instructions to: generate segmented objects based on ranging information, wherein the ranging information comprises point cloud data provided from lidar devices, wherein the segmented objects define three-dimensional locations and dimensions of objects in vicinity of a vehicle; determine a subset of segmented objects to be classified that meet a size constraint based, are within a certain height range above the ground and are within a driveable area of the vehicle, wherein other segmented objects that are not to be classified are those that the vehicle cannot hit and are disregarded when generating the object state information; and generate object state information that indicates the three-dimensional locations of the subset of segmented objects with respect to the vehicle; translate the three-dimensional locations of objects as specified by the object state information into a set of bounding box region proposals for an image from a camera, wherein the bounding box region proposals are selected areas of the image that include objects to be classified, wherein each of the bounding box region proposals comprises: a set of two-dimensional bounding box coordinates that correspond to a region of a rectified image that includes one or more objects to be classified, and wherein the set of bounding box region proposals collectively specify which objects are in the rectified image and where those objects are in the rectified image; classify, via a neural network executed by the hardware based processor, only the objects from the rectified image that are within one of the bounding box region proposals and to disregard other objects from the rectified image such that only portions of the rectified image that are specified by one of the bounding box region proposals are analyzed to classify objects within the rectified image; and generate an object classification result for each object from the rectified image that is within one of the bounding box region proposals; and generate a regressed bounding box for each object that is classified, wherein each regressed bounding box for each object is the bounding box that the neural network has determined to be a best bounding box encompassing that object, wherein the object classification result and the regressed bounding box for each object being classified are fed back iteratively to refine the bounding box region proposal for each object during each subsequent iteration so that the bounding box region proposal for each object more closely describes actual geometry of that object. 5. An autonomous vehicle, comprising: a camera configured to capture an image; a controller comprising: an autonomous driving system (ADS), comprising: a classification system for classifying objects in an environment within a driveable area of the autonomous vehicle, the classification system comprising: memory comprising processor-executable instructions encoded on a non-transient processor-readable media: and a hardware-based processor to execute the processor-executable instructions to: generate segmented objects based on ranging information, wherein the ranging information comprises point cloud data provided from lidar devices, wherein the segmented objects define three-dimensional locations and dimensions of objects in vicinity of the autonomous vehicle; determine a subset of segmented objects to be classified that meet a size constraint based, are within a certain height range above the ground and are within a driveable area of the vehicle, wherein other segmented objects that are not to be classified are those that the vehicle cannot hit and are disregarded when generating the object state information; generate object state information that indicates the three-dimensional locations of the subset of segmented objects with respect to the vehicle; translate the three-dimensional locations of objects as specified by the object state information into a set of bounding box region proposals for the image, wherein the bounding box region proposals are selected areas of the image that include objects to be classified, wherein each of the bounding box region proposals comprises: a set of two-dimensional bounding box coordinates that

Assignees

Inventors

Classifications

  • using neural networks · CPC title

  • using classification, e.g. of video objects · CPC title

  • based on distances to training or reference patterns · CPC title

  • G06V20/58Primary

    Recognition of moving objects or obstacles, e.g. vehicles or pedestrians; Recognition of traffic objects, e.g. traffic signs, traffic lights or roads · CPC title

  • Detecting or recognising potential candidate objects based on visual cues, e.g. shapes · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10460180B2 cover?
Systems and method are provided for controlling an autonomous vehicle. A camera configured to capture an image, and a controller can execute an autonomous driving system (ADS) that classify that image. The ADS comprises a classification system for classifying objects in an environment within a driveable area of the autonomous vehicle. The classification system comprises a processor configured t…
Who is the assignee on this patent?
Gm Global Tech Operations Llc
What technology area does this patent fall under?
Primary CPC classification G06V20/58. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Oct 29 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 9 related publications on this page (citations in our corpus or others sharing the same primary CPC).