Subcategory-aware convolutional neural networks for object detection

US9965719B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9965719-B2
Application numberUS-201615342823-A
CountryUS
Kind codeB2
Filing dateNov 3, 2016
Priority dateNov 4, 2015
Publication dateMay 8, 2018
Grant dateMay 8, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A computer-implemented method for detecting objects by using subcategory-aware convolutional neural networks (CNNs) is presented. The method includes generating object region proposals from an image by a region proposal network (RPN) which utilizes subcategory information, and classifying and refining the object region proposals by an object detection network (ODN) that simultaneously performs object category classification, subcategory classification, and bounding box regression. The image is an image pyramid used as input to the RPN and the ODN. The RPN and the ODN each include a feature extrapolating layer to detect object categories with scale variations among the objects.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method for detecting objects by using subcategory-aware convolutional neural networks (CNNs), the method comprising: generating object region proposals from an image by a region proposal network (RPN) which utilizes subcategory information; and classifying and refining the object region proposals by an object detection network (ODN) that simultaneously performs object category classification, subcategory classification, and bounding box regression, wherein the RPN and the ODN each include a feature extrapolating layer to detect object categories with scale variations among the objects. 2. The method of claim 1 , wherein the image is an image pyramid used as input to the RPN and the ODN. 3. The method of claim 1 , wherein the RPN includes a subcategory convolution layer such that each filter in the convolution layer corresponds to an object subcategory. 4. The method of claim 3 , wherein the subcategory convolution layer outputs heats maps indicating a presence of subcategories at a specific location and a specific scale. 5. The method of claim 4 , wherein the heat maps are used to create a region of interest (RoI) generating layer for generating object candidates by thresholding the heat maps. 6. The method of claim 5 , wherein the RPN terminates at two layers, one layer that outputs softmax probability estimates over object subcategories and another layer that refines RoI location with a bounding box regressor. 7. A system for detecting objects by using subcategory-aware convolutional neural networks (CNNs), the system comprising: a memory; and a hardware computer processor operatively coupled to the memory, the processor being configured for: generating object region proposals from an image by a region proposal network (RPN) which utilizes subcategory information; and classifying and refining the object region proposals by an object detection network (ODN) that simultaneously performs object category classification, subcategory classification, and bounding box regression, wherein the RPN and the ODN each include a feature extrapolating layer to detect object categories with scale variations among the objects. 8. The system of claim 7 , wherein the image is an image pyramid used as input to the RPN and the ODN. 9. The system of claim 7 , wherein the RPN includes a subcategory convolution layer such that each filter in the convolution layer corresponds to an object subcategory. 10. The system of claim 9 , wherein the subcategory convolution layer outputs heats maps indicating a presence of subcategories at a specific location and a specific scale. 11. The system of claim 10 , wherein the heat maps are used to create a region of interest (RoI) generating layer for generating object candidates by thresholding the heat maps. 12. The system of claim 11 , wherein the RPN terminates at two layers, one layer that outputs softmax probability estimates over object subcategories and another layer that refines RoI location with a bounding box regressor. 13. A non-transitory computer-readable storage medium comprising a computer-readable program for detecting objects by using subcategory-aware convolutional neural networks (CNNs), wherein the computer-readable program when executed on a computer, using a hardware computer processor coupled to the non-transitory computer-readable storage medium, causes the computer to perform the steps of: generating object region proposals from an image by a region proposal network (RPN) which utilizes subcategory information; and classifying and refining the object region proposals by an object detection network (ODN) that simultaneously performs object category classification, subcategory classification, and bounding box regression, wherein the RPN and the ODN each include a feature extrapolating layer to detect object categories with scale variations among the objects. 14. The non-transitory computer-readable storage medium of claim 13 , wherein the image is an image pyramid used as input to the RPN and the ODN. 15. The non-transitory computer-readable storage medium of claim 13 , wherein the RPN includes a subcategory convolution layer such that each filter in the convolution layer corresponds to an object subcategory. 16. The non-transitory computer-readable storage medium of claim 15 , wherein the subcategory convolution layer outputs heats maps indicating a presence of subcategories at a specific location and a specific scale. 17. The non-transitory computer-readable storage medium of claim 16 , wherein the heat maps are used to create a region of interest (RoI) generating layer for generating object candidates by thresholding the heat maps.

Assignees

Inventors

Classifications

  • Obstacle · CPC title

  • using feature-based methods · CPC title

  • Artificial neural networks [ANN] · CPC title

  • Integrating the filters into a hierarchical structure, e.g. convolutional neural networks [CNN] · CPC title

  • using classification, e.g. of video objects · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9965719B2 cover?
A computer-implemented method for detecting objects by using subcategory-aware convolutional neural networks (CNNs) is presented. The method includes generating object region proposals from an image by a region proposal network (RPN) which utilizes subcategory information, and classifying and refining the object region proposals by an object detection network (ODN) that simultaneously performs …
Who is the assignee on this patent?
Nec Lab America Inc, Nec Corp
What technology area does this patent fall under?
Primary CPC classification G06V10/82. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue May 08 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).