Tagging similar images using neural network

US9740966B1 · US · B1

Patent metadata
FieldValue
Publication numberUS-9740966-B1
Application numberUS-201615016607-A
CountryUS
Kind codeB1
Filing dateFeb 5, 2016
Priority dateFeb 5, 2016
Publication dateAug 22, 2017
Grant dateAug 22, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An approach is provided in which a knowledge manager selects an extraction layer from a convolutional neural network that was trained on an initial set of images. The knowledge manager processes subsequent images obtained from crawling a computer network that includes extracting image feature sets of the subsequent images from the selected extraction layer and generating tags from metadata associated with the subsequent images. In turn, the knowledge manager receives a new image, extracts a new image feature set from the selected extraction layer, and assigns one or more of the tags to the new image based upon evaluating the new image feature set to the image features sets of the subsequent images.

First claim

Opening claim text (preview).

The invention claimed is: 1. A method implemented by an information handling system that includes a memory and a processor, the method comprising: training a classifier on a first set of images, wherein the classifier comprises a convolutional neural network that includes a plurality of layers; processing a second set of images using the trained classifier, wherein the processing comprises: extracting a plurality of feature sets from a selected layer included in the plurality of layers; and generating a plurality of tags based upon metadata corresponding to the second set of images; selecting a subset of the plurality of tags based on performing a nearest neighbors search on a new image feature set corresponding to a new image, wherein the new image feature set is extracted from the selected layer during processing of the new image; and assigning at least one of the subset of the plurality of tags to the new image based upon voting on the subset of the plurality of tags. 2. The method of claim 1 wherein the selected layer is not a last layer in the plurality of layers. 3. The method of claim 2 wherein the plurality of layers includes a set of convolutional layers and a subsequent set of fully connected non-convolutional layers, and wherein the selected layer is a first layer in the subsequent set of fully-connected non-convolutional layers. 4. The method of claim 1 wherein the first set of images includes a first set of object types that are different from a second set of object types included in the second set of images. 5. The method of claim 1 further comprising searching the Internet to obtain the second set of images. 6. An information handling system comprising: one or more processors; a memory coupled to at least one of the processors; and a set of computer program instructions stored in the memory and executed by at least one of the processors in order to perform actions of: training a classifier on a first set of images, wherein the classifier comprises a convolutional neural network that includes a plurality of layers; processing a second set of images using the trained classifier, wherein the processing comprises: extracting a plurality of feature sets from a selected layer included in the plurality of layers; and generating a plurality of tags based upon metadata corresponding to the second set of images; selecting a subset of the plurality of tags based on performing a nearest neighbors search on a new image feature set corresponding to a new image, wherein the new image feature set is extracted from the selected layer during processing of the new image; and assigning at least one of the subset of the plurality of tags to the new image based upon voting on the subset of the plurality of tags. 7. The information handling system of claim 6 wherein the selected layer is not a last layer in the plurality of layers. 8. The information handling system of claim 7 wherein the plurality of layers includes a set of convolutional layers and a subsequent set of fully connected non-convolutional layers, and wherein the selected layer is a first layer in the subsequent set of fully-connected non-convolutional layers. 9. The information handling system of claim 6 wherein the first set of images includes a first set of object types that are different from a second set of object types included in the second set of images. 10. The information handling system of claim 6 wherein at least one of the one or more processors perform additional actions comprising: searching the Internet to obtain the second set of images. 11. A computer program product stored in a computer readable storage medium, comprising computer program code that, when executed by an information handling system, causes the information handling system to perform actions comprising: training a classifier on a first set of images, wherein the classifier comprises a convolutional neural network that includes a plurality of layers; processing a second set of images using the trained classifier, wherein the processing comprises: extracting a plurality of feature sets from a selected layer included in the plurality of layers; and generating a plurality of tags based upon metadata corresponding to the second set of images; selecting a subset of the plurality of tags based on performing a nearest neighbors search on a new image feature set corresponding to a new image, wherein the new image feature set is extracted from the selected layer during processing of the new image; and assigning at least one of the subset of the plurality of tags to the new image based upon voting on the subset of the plurality of tags. 12. The computer program product of claim 11 wherein the selected layer is not a last layer in the plurality of layers. 13. The computer program product of claim 12 wherein the plurality of layers includes a set of convolutional layers and a subsequent set of fully connected non-convolutional layers, and wherein the selected layer is a first layer in the subsequent set of fully-connected non-convolutional layers. 14. The computer program product of claim 11 wherein the first set of images includes a first set of object types that are different from a second set of object types included in the second set of images.

Assignees

Inventors

Classifications

  • Classification techniques · CPC title

  • G06V10/82Primary

    using neural networks · CPC title

  • Distances to closest patterns, e.g. nearest neighbour classification · CPC title

  • using statistics or function optimisation, e.g. modelling of probability density functions · CPC title

  • Combinations of networks · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9740966B1 cover?
An approach is provided in which a knowledge manager selects an extraction layer from a convolutional neural network that was trained on an initial set of images. The knowledge manager processes subsequent images obtained from crawling a computer network that includes extracting image feature sets of the subsequent images from the selected extraction layer and generating tags from metadata asso…
Who is the assignee on this patent?
IBM, IBM
What technology area does this patent fall under?
Primary CPC classification G06V10/82. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Aug 22 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).