Label-embedding view of attribute-based recognition

US10331976B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10331976-B2
Application numberUS-201313923639-A
CountryUS
Kind codeB2
Filing dateJun 21, 2013
Priority dateJun 21, 2013
Publication dateJun 25, 2019
Grant dateJun 25, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

In image classification, each class of a set of classes is embedded in an attribute space where each dimension of the attribute space corresponds to a class attribute. The embedding generates a class attribute vector for each class of the set of classes. A set of parameters of a prediction function operating in the attribute space respective to a set of training images annotated with classes of the set of classes is optimized such that the prediction function with the optimized set of parameters optimally predicts the annotated classes for the set of training images. The prediction function with the optimized set of parameters is applied to an input image to generate at least one class label for the input image. The image classification does not include applying a class attribute classifier to the input image.

First claim

Opening claim text (preview).

The invention claimed is: 1. A non-transitory storage medium storing instructions readable and executable by an electronic data processing device to perform an image cataloging method including the operations of: representing classes of a set of classes Y={y j , j=1, . . . , C} by class attribute vectors φ(y j ) where φ is an embedding function that embeds a class in an attribute space of dimensionality E where each dimension a i , i=1, . . . , E of the attribute space corresponds to a class attribute of a set of E class attributes; representing training images x n labeled by respective training image class labels y n as θ(x n ) where θ is an embedding function that embeds an image in an image feature space of dimensionality D image features, the training images not being labeled with class attributes of the attribute space of dimensionality E; optimizing, respective to the training images, a set of parameters w of a prediction function f ⁡ ( x ; w ) = arg ⁢ ⁢ max y ∈ Y ⁢ ⁢ { θ ⁡ ( x ) ′ ⁢ W ⁢ ⁢ φ ⁡ ( y ) } where x denotes an image and y denotes a class of the set of classes Y and where W is a D×E matrix representing the set of parameters w of the prediction function ƒ(x;w) wherein the training images respective to which the set of parameters w of the prediction function ƒ(x;w) is optimized do not include training images that are labeled with class attributes of the set of E class attributes; applying the prediction function ƒ(x;w) with the optimized set of parameters W to an input image to generate at least one class label for the input image; and cataloging the input image using the at least one class label for the input image, the cataloging including storing the input image or a pointer to the input image in an images database structured by image class. 2. The non-transitory storage medium as set forth in claim 1 wherein the applying does not include applying image attribute classifiers to the input image. 3. The non-transitory storage medium as set forth in claim 1 wherein the class attributes of the attribute space include image attributes. 4. The non-transitory storage medium as set forth in claim 3 wherein the set of classes Y comprises a hierarchy of classes and the class attributes of the attribute space further include class hierarchy attributes derived from the hierarchy of classes. 5. The non-transitory storage medium as set forth in claim 1 wherein the set of classes Y comprises a hierarchy of classes and the class attributes of the attribute space include class hierarchy attributes derived from the hierarchy of classes. 6. The non-transitory storage medium as set forth in claim 1 wherein the applying further includes: generating class attribute scores for the input image for class attributes of the attribute space based on θ(x in )′W where x in denotes the input image. 7. The non-transitory storage medium as set forth in claim 1 wherein the optimizing includes optimizing at least one of a ranking objective, a multiclass objective, and a regression objective. 8. The non-transitory storage medium of claim 1 further comprising: receiving assignments of class attributes of the set of class attributes to classes of the set of classes; wherein the representing of the classes by class attribute vectors φ(y j ) uses the received assignments of class attributes to the classes. 9. An apparatus for performing image retrieval, the apparatus comprising: an electronic data processing device programmed to perform a method including the operations of: receiving assignments of class attributes of a set of class attributes to classes of a set of classes; embedding classes of the set of classes Y={y j , j=1, . . . , C} in an attribute space of dimensionality E using the received assignments of class attributes to the classes, where each dimension a i , i=1, . . . , E of the attribute space corresponds to a class attribute of the set of E class attributes; embedding training images x n labeled by respective training image class labels y n in an image feature space of dimensionality D image features wherein none of the training images are labeled with class attributes of the attribute space; optimizing a set of parameters w of a prediction function y = f ⁡ ( x ; w ) = arg ⁢ ⁢ max y ∈ Y ⁢ { θ ⁡ ( x ) ′ ⁢ W ⁢ ⁢ φ ⁡ ( y ) }  operating in the image feature space an

Assignees

Inventors

Classifications

  • Feature extraction, e.g. by transforming the feature space, e.g. multi-dimensional scaling [MDS]; Mappings, e.g. subspace methods · CPC title

  • Aspects of pattern recognition specially adapted for signal processing · CPC title

  • Classification techniques · CPC title

  • Multiple classes · CPC title

  • based on discrimination criteria, e.g. discriminant analysis · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10331976B2 cover?
In image classification, each class of a set of classes is embedded in an attribute space where each dimension of the attribute space corresponds to a class attribute. The embedding generates a class attribute vector for each class of the set of classes. A set of parameters of a prediction function operating in the attribute space respective to a set of training images annotated with classes of…
Who is the assignee on this patent?
Xerox Corp
What technology area does this patent fall under?
Primary CPC classification G06V10/7715. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jun 25 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).