Image classifier learning device, image classifier learning method, and program

US11989929B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11989929-B2
Application numberUS-201917277248-A
CountryUS
Kind codeB2
Filing dateSep 6, 2019
Priority dateSep 20, 2018
Publication dateMay 21, 2024
Grant dateMay 21, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An object is to make it possible to train an image recognizer by efficiently using training data that does not include label information. A determination unit 180 causes repeated execution of the followings. A feature representation model for extracting feature vectors of pixels is trained such that an objective function is minimized, the objective function being expressed as a function that includes a value that is based on a difference between a distance between feature vectors of pixels labeled with a positive example label and a distance between a feature vector of a pixel labeled with the positive example label and a feature vector of an unlabeled pixel, and a value that is based on a difference between a distance between a feature vector of a pixel labeled with the positive example label and a feature vector of an unlabeled pixel and a distance between a feature vector of a pixel labeled with the positive example label and a feature vector of a pixel labeled with a negative example label, and based on a distribution of feature vectors corresponding to the positive example label, a predetermined number of labels are given based on the likelihood that each unlabeled pixel is a positive example.

First claim

Opening claim text (preview).

The invention claimed is: 1. An image recognizer training device comprising a processor configured to execute operations, comprising: training a feature representation model based on a set of images that include an image including pixels each labeled with a positive example label or a negative example label and an image including unlabeled pixels, the feature representation model being a model for extracting feature vectors of the pixels and trained such that an objective function is minimized, the objective function being expressed as a function that includes a value that is based on a difference between a distance between feature vectors of pixels labeled with the positive example label and a distance between a feature vector of a pixel labeled with the positive example label and a feature vector of an unlabeled pixel, and a value that is based on a difference between a distance between a feature vector of a pixel labeled with the positive example label and a feature vector of an unlabeled pixel and a distance between a feature vector of a pixel labeled with the positive example label and a feature vector of a pixel labeled with the negative example label; extracting feature vectors of a plurality of pixels included in the images based on the feature representation model trained; estimating a distribution of feature vectors that are extracted with respect to pixels labeled with the positive example label; calculating, with respect to each unlabeled pixel, a likelihood that the pixel is a positive example based on the distribution estimated and give the positive example label to pixels, a number of which is a first sample number determined in advance, in descending order of the likelihood and give the negative example label to pixels, the number of which is a second sample number determined in advance, in ascending order of the likelihood; and determining whether or not there is an unlabeled pixel among pixels of the images, and if there is an unlabeled pixel, causes repeated execution of training, extraction of feature vectors, estimation, and labeling, wherein for each unlabeled pixel, the likelihood of the unlabeled pixel being a positive example is labeled based on a normal distribution and the feature vector of the unlabeled pixel. 2. The image recognizer training device according to claim 1 , wherein the objective function is expressed as a function that further includes a value that is based on a difference between a distance between feature vectors of pixels labeled with the same label and a distance between feature vectors of pixels labeled with different labels. 3. The image recognizer training device according to claim 1 , wherein the feature representation model includes a convolutional neural network. 4. The image recognizer training device according to claim 1 , wherein the objective function is associated with Triplet Loss. 5. An image recognizer training method comprising: training a feature representation model based on a set of images that include an image including pixels each labeled with a positive example label or a negative example label and an image including unlabeled pixels, the feature representation model being a model for extracting feature vectors of the pixels and trained such that an objective function is minimized, the objective function being expressed as a function that includes a value that is based on a difference between a distance between feature vectors of pixels labeled with the positive example label and a distance between a feature vector of a pixel labeled with the positive example label and a feature vector of an unlabeled pixel, and a value that is based on a difference between a distance between a feature vector of a pixel labeled with the positive example label and a feature vector of an unlabeled pixel and a distance between a feature vector of a pixel labeled with the positive example label and a feature vector of a pixel labeled with the negative example label; extracting feature vectors of a plurality of pixels included in the images based on the feature representation model trained; estimating a distribution of feature vectors that arc extracted with respect to pixels labeled with the positive example label; calculating with respect to each unlabeled pixel, a likelihood that the pixel is a positive example based on the distribution estimated and giving the positive example label to pixels, a number of which is a first sample number determined in advance, in descending order of the likelihood and giving the negative example label to pixels, the number of which is a second sample number determined in advance, in ascending order of the likelihood; and determining whether or not there is an unlabeled pixel among pixels of the images, and if there is an unlabeled pixel, causing repeated execution of training, extraction of feature vectors, estimation, and labeling, wherein for each unlabeled pixel, the likelihood of the unlabeled pixel being positive example is labeled based on a normal distribution and the feature vector of the unlabeled pixel. 6. The image recognizer training method according to claim 5 , wherein the objective function is expressed as a function that further includes a value that is based on a difference between a distance between feature vectors of pixels labeled with the same label and a distance between feature vectors of pixels labeled with different labels. 7. The image recognizer training method according to claim 5 , wherein the feature representation model includes a convolutional neural network. 8. The image recognizer training method according to claim 5 , wherein the objective function is associated with Triplet Loss. 9. A computer-readable non-transitory recording medium storing computer-executable instructions that when executed by a processor cause a computer system to: training a feature representation model based on a set of images that include an image including pixels each labeled with a positive example label or a negative example label and an image including unlabeled pixels, the feature representation model being a model for extracting feature vectors of the pixels and trained such that an objective function is minimized, the objective function being expressed as a function that includes a value that is based on a difference between a distance between feature vectors of pixels labeled with the positive example label and a distance between a feature vector of a pixel labeled with the positive example label and a feature vector of an unlabeled pixel, and a value that is based on a difference between a distance between a feature vector of a pixel labeled with the positive example label and a feature vector of an unlabeled pixel and a distance between a feature vector of a pixel labeled with the positive example label and a feature vector of a pixel labeled with the negative example label; extracting extract feature vectors of a plurality of pixels included in the images based on the feature representation model trained by the representation learner; estimating a distribution of feature vectors that are extracted with respect to pixels labeled with the positive example label; determining, with respect to each unlabeled pixel, a likelihood that the pixel is a positive example based on the estimated distribution and give the positive example label to pixels, a number of which is a first sample number determined in advance, in descending order of the likelihood and give the negative example label to pixels, the number of which is a second sample number determined in advance, in ascending order of the likelihood; and determining whether or not there is an unlabeled pixel among pixels of the images, and if there is an unlabeled pixel, causes repeated execution of

Assignees

Inventors

Classifications

  • Incorporation of unlabelled data, e.g. multiple instance learning [MIL] · CPC title

  • characterised by the incorporation of unlabelled data, e.g. multiple instance learning [MIL], semi-supervised techniques using expectation-maximisation [EM] or naïve labelling · CPC title

  • Matching criteria, e.g. proximity measures · CPC title

  • using neural networks · CPC title

  • Image analysis · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11989929B2 cover?
An object is to make it possible to train an image recognizer by efficiently using training data that does not include label information. A determination unit 180 causes repeated execution of the followings. A feature representation model for extracting feature vectors of pixels is trained such that an objective function is minimized, the objective function being expressed as a function that …
Who is the assignee on this patent?
Nippon Telegraph & Telephone
What technology area does this patent fall under?
Primary CPC classification G06V10/7753. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue May 21 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).