System for domain adaptation with a domain-specific class means classifier

US2016078359A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2016078359-A1
Application numberUS-201414504837-A
CountryUS
Kind codeA1
Filing dateOct 2, 2014
Priority dateSep 12, 2014
Publication dateMar 17, 2016
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A classification system includes memory which stores, for each of a set of classes, a classifier model for assigning a class probability to a test sample from a target domain. The classifier model has been learned with training samples from the target domain and from at least one source domain. Each classifier model models the respective class as a mixture of components, the component mixture including a component for each source domain and a component for the target domain. Each component is a function of a distance between the test sample and a domain-specific class representation which is derived from the training samples of the respective domain that are labeled with the class, each of the components in the mixture being weighted by a respective mixture weight. Instructions, implemented by a processor, are provided for labeling the test sample based on the class probabilities assigned by the classifier models.

First claim

Opening claim text (preview).

What is claimed is: 1 . A classification system comprising: memory which stores: for each of a set of classes, a classifier model for assigning a class probability to a test sample from a target domain, the classifier model having been learned with training samples from the target domain and training samples from at least one source domain different from the target domain, each classifier model modeling the respective class as a mixture of components, the mixture of components including a component for each of the at least one source domain and a component for the target domain, each component being a function of a distance between the test sample and a domain-specific class representation which is derived from the training samples of the respective domain that are labeled with the class, each of the components in the mixture being weighted by a respective mixture weight; and instructions for labeling the test sample based on the class probabilities assigned by the classifier models; and a processor in communication with the memory which executes the instructions. 2 . The system of claim 1 , wherein each component is an exponentially decreasing function of the distance between the test sample and the domain-specific class representation. 3 . The system of claim 1 , wherein each domain-specific class representation is an average of the training samples of the respective domain that are labeled with the class. 4 . The system of claim 1 , wherein the distance between the test sample and each domain-specific class representation is computed in an embedding space into which the test sample and each domain-specific class representation is embedded with the same metric. 5 . The system of claim 1 , where the mixture components are Gaussian functions and the inverse of their covariance is shared and approximated by a low-rank matrix. 6 . The system of claim 1 , wherein the classifier models are learned by maximizing a sum of the log of the training sample class posteriors. 7 . The system of claim 1 , wherein the classifier model is of the form: p  ( c | x i ) = 1 z i  w c  ∑ d = 1 D  w d  ( exp  ( - 1 2  d W  ( x i , μ d c ) ) ) ( 6 ) or is a function thereof, where p(c|x i ) represents the posterior probability of the class c for the test sample x i ; w c represents the class specific mixture weight, that can be constant; w d represents the mixture weight for a respective mixture component exp(−½d W (x i ,μ d c ), where d W (x i ,μ d c ) represents the distance between sample x i and the domain-specific class representation μ d c for a domain d selected from the target domain and the at least one source domain, and W represents an optional metric for embedding sample x i and each of the domain-specific class representations μ d c in a common embedding space; and Z i is an optional normalizing factor. 8 . The system of claim 1 , wherein the training samples and test sample are multidimensional representations. 9 . The system of claim 7 , wherein the multidimensional representations are derived from images, videos, sounds, text or other multimedia documents. 10 . The system of claim 1 , wherein each of the source domain training samples is labeled with a label for one of the classes and fewer than all of the target domain training samples are labeled with a label for any of the classes. 11 . The system of claim 9 , wherein at least some of the target domain training samples are labeled with a label for at least one of the classes. 12 . The system of claim 10 , wherein the learning includes for each of a plurality of iterations, performing at least one of: adding to an active training set, which is derived from the source domain samples and labeled target domain samples, a most confident unlabeled target domain sample for each class, and removing from the active training set a least confident source domain sample from each class; and retraining a metric based on the active training set which is used to embed the test sample and a domain-specific class representation into an embedding space in which the distance is computed. 13 . The system of claim 1 , wherein the mixture weight for the target domain is higher than for each of the at least one source domains. 14 . The system o

Assignees

Inventors

Classifications

  • Validation; Performance evaluation · CPC title

  • Proximity, similarity or dissimilarity measures · CPC title

  • using classification, e.g. of video objects · CPC title

  • Probabilistic graphical models, e.g. probabilistic networks · CPC title

  • G06N20/00Primary

    Machine learning · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2016078359A1 cover?
A classification system includes memory which stores, for each of a set of classes, a classifier model for assigning a class probability to a test sample from a target domain. The classifier model has been learned with training samples from the target domain and from at least one source domain. Each classifier model models the respective class as a mixture of components, the component mixture i…
Who is the assignee on this patent?
Xerox Corp
What technology area does this patent fall under?
Primary CPC classification G06N20/00. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Mar 17 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).