Conditional adaptation network for image classification

US10289909B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10289909-B2
Application numberUS-201715450620-A
CountryUS
Kind codeB2
Filing dateMar 6, 2017
Priority dateMar 6, 2017
Publication dateMay 14, 2019
Grant dateMay 14, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method and apparatus for classifying an image. In one example, the method may include receiving one or more images associated with a source domain and one or more images associated with a target domain, identifying one or more source domain features based on the one or more images associated with the source domain, identifying one or more target domain features based on the one or more images associated with the target domain, training a conditional maximum mean discrepancy (CMMD) engine based on a difference between the one or more source domain features and the one or more target domain features, applying the CMMD engine to the one or more images associated with the target domain to generate one or more labels for each unlabeled target image of the one or more images associated with the target domain and classifying each one of the one or more images in the target domain using the one or more labels.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for classifying an image, comprising: receiving, by a processor, one or more images associated with a source domain and one or more images associated with a target domain; identifying, by the processor, a plurality of source layers, wherein each one of the plurality of source layers comprises one or more source domain features based on the one or more images associated with the source domain via a plurality of deep neural networks; identifying, by the processor, a plurality of domain layers, wherein each one of the plurality of domain layers comprises one or more target domain features based on the one or more images associated with the target domain via the plurality of deep neural networks; training, by the processor, a plurality of conditional maximum mean discrepancy (CMMD) engines based on a difference between the one or more source domain features and the one or more target domain features, wherein each one of the plurality of CMMD engines receives the one or more source domain features from a respective source layer of the plurality of source layers and the one or more target domain features from a respective domain layer of the plurality of domain layers, wherein a number of the plurality of CMMD engines is equal to a number of the plurality of deep neural networks; applying, by the processor, the plurality of CMMD engines to the one or more images associated with the target domain to generate one or more labels for each unlabeled target image of the one or more images associated with the target domain; and classifying, by the processor, each one of the one or more images in the target domain using the one or more labels. 2. The method of claim 1 , wherein the plurality of CMMD engines is optimized by a gradient descent algorithm. 3. The method of claim 1 , wherein the image comprises text. 4. A non-transitory computer-readable medium storing a plurality of instructions, which when executed by a processor, causes the processor to perform operations for classifying an image comprising: receiving one or more images associated with a source domain and one or more images associated with a target domain; identifying a plurality of source layers, wherein each one of the plurality of source layers comprises one or more source domain features based on the one or more images associated with the source domain via a plurality of deep neural networks; identifying a plurality of domain layers, wherein each one of the plurality of domain layers comprises one or more target domain features based on the one or more images associated with the target domain via the plurality of deep neural networks; training a plurality of conditional maximum mean discrepancy (CMMD) engines based on a difference between the one or more source domain features and the one or more target domain features, wherein each one of the plurality of CMMD engines receives the one or more source domain features from a respective source layer of the plurality of source layers and the one or more target domain features from a respective domain layer of the plurality of domain layers, wherein a number of the plurality of CMMD engines is equal to a number of the plurality of deep neural networks; applying the plurality of CMMD engines to the one or more images associated with the target domain to generate one or more labels for each unlabeled target image of the one or more images associated with the target domain; and classifying each one of the one or more images in the target domain using the one or more labels. 5. The non-transitory computer-readable medium of claim 4 , wherein the plurality of CMMD engines is optimized by a gradient descent algorithm. 6. The non-transitory computer-readable medium of claim 4 , wherein the image comprises text. 7. A method for classifying an image, comprising: receiving, by a processor of an image classification system, one or more images associated with a source domain where the image classification system was previously deployed and trained and one or more images associated with a target domain where the image classification system is re-deployed; obtaining, by the processor, one or more source domain features from the one or more images associated with the source domain based on the image classification system that was previously deployed and trained, wherein the one or more source domain features are identified in a plurality of source layers by a plurality of deep neural networks; identifying, by the processor, a plurality of domain layers, wherein each one of the plurality of domain layers comprises one or more target domain features based on the one or more images associated with the target domain via the plurality of deep neural networks; training, by the processor, a plurality of conditional maximum mean discrepancy (CMMD) engines based on a shift between the one or more source domain features and the one or more target domain features, wherein each one of the plurality of CMMD engines receives the one or more source domain features from a respective source layer of the plurality of source layers and the one or more target domain features from a respective domain layer of the plurality of domain layers, wherein a number of the plurality of CMMD engines is equal to a number of the plurality of deep neural networks; applying, by the processor, the plurality of CMMD engines to the one or more images associated with the target domain to generate one or more labels for each unlabeled target image of the one or more images associated with the target domain; and classifying, by the processor, each one of the one or more images in the target domain using the one or more labels.

Assignees

Inventors

Classifications

  • using neural networks · CPC title

  • Validation; Performance evaluation · CPC title

  • using classification, e.g. of video objects · CPC title

  • G06V10/454Primary

    Integrating the filters into a hierarchical structure, e.g. convolutional neural networks [CNN] · CPC title

  • Scenes; Scene-specific elements (control of digital cameras H04N23/60) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10289909B2 cover?
A method and apparatus for classifying an image. In one example, the method may include receiving one or more images associated with a source domain and one or more images associated with a target domain, identifying one or more source domain features based on the one or more images associated with the source domain, identifying one or more target domain features based on the one or more images…
Who is the assignee on this patent?
Xerox Corp
What technology area does this patent fall under?
Primary CPC classification G06V10/454. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue May 14 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).