Coarse-to-fine attention networks for light signal detection and recognition

US12087064B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12087064-B2
Application numberUS-202318230775-A
CountryUS
Kind codeB2
Filing dateAug 7, 2023
Priority dateSep 4, 2020
Publication dateSep 10, 2024
Grant dateSep 10, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A vehicle light signal detection and recognition method, system, and computer program product include bounding, using a coarse attention module, one or more regions of an image of an automobile including at least one of a brake light and a signal light generated by automobile signals which include illuminated sections to generate one or more bounded region, removing, using a fine attention module, noise from the one or more bounded regions to generate one or more noise-free bounded regions, and identifying the at least one of the brake light and the signal light from the one or more noise-free bounded regions.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented vehicle light signal detection and recognition method, the method comprising: bounding as a bounded region, using a coarse attention module with an input of a smaller feature map, self-luminous objects in one or more regions of an image of an automobile including at least one of a brake light and a signal light generated by automobile signals; obtaining a first refined feature map, using the coarse attention module, by calculating an average of all previous features of a same category of the bounded region as a channel expected attention score (C-E score) of each category which is multiplied by the smaller feature map; generating, using a fine attention module with an input of a bigger feature map, a second refined feature map to localize precise discriminative regions for the bounded region by using the bigger feature map, wherein the smaller feature map and the bigger feature map are generated by a Region of Interest (ROI) pooling with different Kernels in an original feature map; and performing classification and localization by converting the first refined feature map and the second refined feature map to a value to calculate coordinates in the bounded region and categories of objects in the bounding regions. 2. The method of claim 1 , embodied in a cloud-computing environment. 3. A computer-implemented vehicle light signal detection and recognition method, the method comprising: bounding as a bounded region, using a coarse attention module with an input of a smaller feature map, self-luminous objects in one or more regions of an image of an automobile including at least one of a brake light and a signal light generated by automobile signals, wherein the coarse attention module includes an attention score branch and an expected score learning branch, wherein, in the attention score learning branch, the coarse attention module converts the smaller feature map into an original feature vector through global average pooling (GAP) which is used by the coarse attention module to calculate a coarse attention score (C-A score), and wherein, in the expected score learning branch, the coarse attention module calculates an average of all previous features of a same category of the bounded region as a channel expected attention score (C-E score) of each category, which is multiplied by the smaller feature map to obtain a first refined feature map; generating, using a fine attention module with an input of a bigger feature map, a second refined feature map to localize precise discriminative regions for the bounded region by using the C-A score, wherein the smaller feature map and the bigger feature map are generated by a Region of Interest (ROI) pooling with different Kernels in an original feature map; and performing classification and localization by converting the first refined feature map and the second refined feature map to a value to calculate coordinates in the bounded region and categories of objects in the bounding regions. 4. The method of claim 3 , embodied in a cloud-computing environment. 5. A computer program product, the computer program product comprising a computer-readable storage medium having program instructions embodied therewith, the program instructions executable by a computer to cause the computer to perform: bounding as a bounded region, using a coarse attention module with an input of a smaller feature map, self-luminous objects in one or more regions of an image of an automobile including at least one of a brake light and a signal light generated by automobile signals; obtaining a first refined feature map, using the coarse attention module, by calculating an average of all previous features of a same category of the bounded region as a channel expected attention score (C-E score) of each category which is multiplied by the smaller feature map; generating, using a fine attention module with an input of a bigger feature map, a second refined feature map to localize precise discriminative regions for the bounded region by using the bigger feature map, wherein the smaller feature map and the bigger feature map are generated by a Region of Interest (ROI) pooling with different Kernels in an original feature map; and performing classification and localization by converting the first refined feature map and the second refined feature map to a value to calculate coordinates in the bounded region and categories of objects in the bounding regions.

Assignees

Inventors

Classifications

  • References adjustable by an adaptive method, e.g. learning · CPC title

  • using neural networks · CPC title

  • relating to illumination properties, e.g. using a reflectance or lighting model · CPC title

  • Integrating the filters into a hierarchical structure, e.g. convolutional neural networks [CNN] · CPC title

  • Determination of region of interest [ROI] or a volume of interest [VOI] · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12087064B2 cover?
A vehicle light signal detection and recognition method, system, and computer program product include bounding, using a coarse attention module, one or more regions of an image of an automobile including at least one of a brake light and a signal light generated by automobile signals which include illuminated sections to generate one or more bounded region, removing, using a fine attention modu…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06V20/584. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Sep 10 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 6 related publications on this page (citations in our corpus or others sharing the same primary CPC).