Method for processing information, information processing apparatus, and non-transitory computer-readable recording medium

US10796184B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10796184-B2
Application numberUS-201916394062-A
CountryUS
Kind codeB2
Filing dateApr 25, 2019
Priority dateNov 9, 2016
Publication dateOct 6, 2020
Grant dateOct 6, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Inputting an image to a neural network, performing convolution on a current frame included in the image to calculate a current feature map, which is a feature map at a present time, combining a past feature map, which is obtained by performing convolution on a past frame included in the image, and the current feature map, estimating an object candidate area using the combined past feature map and current feature map, estimating positional information and identification information regarding the one or more objects included in the current frame using the combined past feature map and current feature map and the estimated object candidate area, and outputting the positional information and the identification information regarding the one or more objects included in the current frame of the image estimated in the estimating as object detection results are included.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for processing information achieved by a computer using a neural network, the method comprising: inputting an image including one or more objects to the neural network; causing a convolutional layer included in the neural network to perform convolution on a current frame included in the image to calculate a current feature map; which is a feature map at a present time; causing a combiner for combining two or more feature maps into one feature map to combine a past feature map, which is a feature map obtained by causing the convolutional layer to perform convolution on a past frame included in the image and preceding the current frame, and the current feature map; causing a region proposal network included in the neural network to estimate an object candidate area using the combined past feature map and current feature map, the region proposal network being used to estimate the object candidate area; causing a region of interest pooling layer included in the neural network to estimate positional information and identification information regarding the one or more objects included in the current frame using the combined past feature map and current feature map and the estimated object candidate area, the region of interest pooling layer being used to perform class estimation; and outputting the positional information and the identification information regarding the one or more objects included in the current frame of the image estimated in the causing as object detection results. 2. The method according to claim 1 , wherein the neural network includes three or more convolutional layers, wherein one of the three or more convolutional layers is caused to perform convolution on the current frame included in the image to calculate the current feature map, and wherein the corresponding ones of the three or more convolutional layers other than the foregoing convolution layer are caused to perform convolution on the past frame included in the image to calculate the past feature map. 3. The method according to claim 1 , wherein the neural network includes a convolutional layer, wherein the convolutional layer is caused to perform convolution on the past frame included in the image to calculate the past feature map and store the past feature map in a memory, and wherein, when the past feature map and the current feature map are combined with each other, the past feature map stored in the memory and the current feature map obtained by causing the convolutional layer to perform convolution on the current frame included in the image are combined with each other. 4. The method according to claim 1 , wherein the convolutional layer is a network model lighter than a certain network model. 5. The method according to claim 4 , wherein the lighter network model is a network model whose processing speed at which the computer performs the causing using the neural network is higher than 5 fps. 6. The method according to claim 4 , wherein the lighter network model is SqueezeNet including a plurality of fire modules, each of which includes a squeeze layer, which is a 1×1 filter, and an expand layer, in which a 1×1 filter and a 3×3 filter are arranged in parallel with each other. 7. A non-transitory computer-readable recording medium storing a program for causing a computer to perform operations comprising: inputting an image including one or more objects to a neural network; causing a convolutional layer included in the neural network to perform convolution on a current frame included in the image to calculate a current feature map, which is a feature map at a present time; causing a combiner for combining two or more feature maps into one feature map to combine a past feature map, which is a feature map obtained by causing the convolutional layer to perform convolution on a past frame included in the image and preceding the current frame, and the current feature map; causing a region proposal network included in the neural network to estimate an object candidate area using the combined past feature map and current feature map, the region proposal network being used to estimate the object candidate area; causing a region of interest pooling layer included in the neural network to estimate positional information and identification information regarding the one or more objects included in the current frame using the combined past feature map and current feature map and the estimated object candidate area, the region of interest pooling layer being used to perform class estimation; and outputting the positional information and the identification information regarding the one or more objects included in the current frame of the image estimated in the causing as object detection results. 8. An information processing apparatus achieved by a computer using a neural network, the information processing apparatus comprising: an inputter that inputs an image including one or more objects to the neural network; a processor that causes a convolutional layer included in the neural network to perform convolution on a current frame included in the image to calculate a current feature map, which is a feature map at a present time, that causes a combiner for combining two or more feature maps into one feature map to combine a past feature map, which is a feature map obtained by causing the convolutional layer to perform convolution on a past frame included in the image and preceding the current frame, and the current feature map, that causes a region proposal network included in the neural network to estimate an object candidate area using the combined past feature map and current feature map, the region proposal network being used to estimate the object candidate area, and that causes a region of interest pooling layer included in the neural network to estimate positional information and identification information regarding the one or more objects included in the current frame using the combined past feature map and current feature map and the estimated object candidate area, the region of interest pooling layer being used to perform class estimation; and an outputter that outputs the positional information and the identification information regarding the one or more objects included in the current frame of the image estimated by the processor as object detection results.

Assignees

Inventors

Classifications

  • exterior to a vehicle by using sensors mounted on the vehicle · CPC title

  • G06V10/82Primary

    using neural networks · CPC title

  • Integrating the filters into a hierarchical structure, e.g. convolutional neural networks [CNN] · CPC title

  • using classification, e.g. of video objects · CPC title

  • Convolutional networks [CNN, ConvNet] · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10796184B2 cover?
Inputting an image to a neural network, performing convolution on a current frame included in the image to calculate a current feature map, which is a feature map at a present time, combining a past feature map, which is obtained by performing convolution on a past frame included in the image, and the current feature map, estimating an object candidate area using the combined past feature map a…
Who is the assignee on this patent?
Panasonic Ip Man Co Ltd
What technology area does this patent fall under?
Primary CPC classification G06V10/82. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Oct 06 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).