Multi-modal, multi-technique vehicle signal detection

US11288527B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11288527-B2
Application numberUS-202016804667-A
CountryUS
Kind codeB2
Filing dateFeb 28, 2020
Priority dateFeb 27, 2020
Publication dateMar 29, 2022
Grant dateMar 29, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A vehicle includes one or more cameras that capture a plurality of two-dimensional images of a three-dimensional object. A light detector and/or a semantic classifier search within those images for lights of the three-dimensional object. A vehicle signal detection module fuses information from the light detector and/or the semantic classifier to produce a semantic meaning for the lights. The vehicle can be controlled based on the semantic meaning. Further, the vehicle can include a depth sensor and an object projector. The object projector can determine regions of interest within the two-dimensional images, based on the depth sensor. The light detector and/or the semantic classifier can use these regions of interest to efficiently perform the search for the lights.

First claim

Opening claim text (preview).

What is claimed is: 1. A method, comprising: determining, by an object tracker, a polygon of a three-dimensional object; generating, by a first vehicle signal detector, geometric information and semantic information for a first light of the three-dimensional object within a first two-dimensional image of the three-dimensional object captured by a camera on a vehicle, wherein geometric information for the first light comprises location of the first light on the three-dimensional object, and semantic information comprises a classification of the first light; generating, by a second vehicle signal detector, geometric information and semantic information for a second light within a second two-dimensional image of the three-dimensional object, wherein the first light is not captured in the second two-dimensional image, geometric information for the second light comprises location of the second light, and semantic information comprises a classification of the second light; collecting, over time, a first vector of geometric information and semantic information generated by the first vehicle signal detector; collecting, over time, a second vector of geometric information and semantic information generated by the second vehicle signal detector; fusing at least the first vector and a second vector into a matrix; determining, that the first light and the second light are associated with the polygon based on the geometric information generated by the first and second vehicle signal detectors; and determining a semantic meaning of the first light and the second light based on the matrix, and the determination that the first light and the second light are both associated with the polygon. 2. The method of claim 1 , further comprising, controlling the vehicle, based on the semantic meaning of the first light and the second light, wherein the controlling is accelerating, braking, or steering the vehicle. 3. The method of claim 1 , wherein generating, by the first vehicle signal detector, comprises: determining a first location of the first light on the three-dimensional object and a first color of the first light in the first two-dimensional image, wherein the semantic information for the first light is based on the first location and the first color. 4. The method of claim 1 , wherein: the first two-dimensional image and the second two-dimensional image are different images from different cameras. 5. The method of claim 1 , further comprising: determining that the first light and the second light are at substantially the same height of the polygon. 6. The method of claim 1 , wherein determining the semantic meaning of the first light and the second light comprises extracting frequency information from the matrix. 7. The method of claim 1 , wherein: geometric information for the first light comprises a first color of the first light; and geometric information for the second light comprises a second color of the second light. 8. One or more non-transitory, computer-readable media encoded with instructions that, when executed by one or more processing units, perform a method comprising: determining, by an object tracker, a polygon of a three-dimensional object; determining, by a first vehicle signal detector encoded by the instructions, geometric information and semantic information for a first light of the three-dimensional object within a first two-dimensional image of the three-dimensional object captured by a camera on a vehicle, wherein geometric information for the first light comprises location of the first light on the three-dimensional object, and semantic information comprises a semantic label of the first light generated by a first classifier of the first vehicle signal detector; determining, by a second vehicle signal detector encoded by the instructions, geometric information and semantic information for a second light within a second two-dimensional image, wherein the first light is not present in the second two-dimensional image geometric information for the second light comprises location of the second light, and semantic information comprises a semantic label of the second light generated by a second classifier of the second vehicle signal detector; accumulating, overtime, a first vector of geometric information and semantic information determined by the first vehicle signal detector; accumulating, over time, a second vector of geometric information and semantic information determined by the second vehicle signal detector; forming a matrix with at least the first vector and the second vector; and determining a semantic meaning of the first light and the second light based on the matrix, and a determination, from the geometric information of the first light, the geometric information of the second light, and the polygon, that the first light and the second light are both associated with the polygon. 9. The one or more non-transitory, computer-readable media of claim 8 , the method further comprising: controlling the vehicle, based on the semantic meaning of the first light and the second light, wherein the controlling is accelerating, braking, or steering the vehicle. 10. The one or more non-transitory, computer-readable media of claim 8 , the method further comprising: determining a first location of the first light on the three-dimensional object and a first color of the first light in the first two-dimensional image, wherein the semantic information for the first light is based on the first location and the first color. 11. The one or more non-transitory, computer-readable media of claim 8 , wherein: determining the semantic meaning comprises applying a filter on the matrix. 12. The one or more non-transitory, computer-readable media of claim 8 , wherein: determining the semantic meaning comprises applying logic rules on the matrix. 13. The one or more non-transitory, computer-readable media of claim 8 , wherein: determining the semantic meaning comprises applying a supervised or unsupervised learning technique on the matrix. 14. The one or more non-transitory, computer-readable media of claim 8 , wherein: geometric information for the first light comprises a first color of the first light; and geometric information for the second light comprises a second color of the second light. 15. A vehicle, comprising: one or more memories including instructions; one or more processors to execute the instructions; a body including a camera; and a first vehicle signal detector encoded in the instructions to: receiving, from an object tracker, a polygon of a three-dimensional object; determine geometric information and semantic information for a first light of the three-dimensional object within a first two-dimensional image of the three-dimensional object captured by the camera, wherein geometric information for the first light comprises location of the first light, and semantic information comprises a semantic classification of the first light; and generate a first vector using geometric information and semantic information collected from the first vehicle signal detector over time; a second vehicle signal detector encoded in the instructions to: determine geometric information and semantic information for a second light within a second two-dimensional image, wherein the first light is not captured in the second two-dimensional image, geometric information for the second light comprises location of the second light, and semantic information comprises a semantic classification of the second light; and generate a second vector using geometric information and semantic informati

Assignees

Inventors

Classifications

  • of extracted features · CPC title

  • the classifiers operating on different input data, e.g. multi-modal recognition · CPC title

  • of results relating to different input data, e.g. multimodal recognition · CPC title

  • of extracted features · CPC title

  • Fusion techniques · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11288527B2 cover?
A vehicle includes one or more cameras that capture a plurality of two-dimensional images of a three-dimensional object. A light detector and/or a semantic classifier search within those images for lights of the three-dimensional object. A vehicle signal detection module fuses information from the light detector and/or the semantic classifier to produce a semantic meaning for the lights. The ve…
Who is the assignee on this patent?
Gm Cruise Holdings Llc
What technology area does this patent fall under?
Primary CPC classification G06V20/584. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Mar 29 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).