Learning device, search device, learning method, search method, learning program, and search program

US11790635B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11790635-B2
Application numberUS-201917619239-A
CountryUS
Kind codeB2
Filing dateJun 17, 2019
Priority dateJun 17, 2019
Publication dateOct 17, 2023
Grant dateOct 17, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A retrieval apparatus includes a first retrieval unit, a second retrieval unit, and an integration unit that calculates an integrated similarity by integrating a first similarity calculated by the first retrieval unit and a second similarity calculated by the second retrieval unit. For similarities between a basic image, an image similar to the basic image, and an image dissimilar to the basic image of the reference images, at least the feature extraction of the first retrieval unit is learned such that a margin based on a second similarity between the basic image and the similar image and a second similarity between the basic image and the dissimilar image increases as the second similarity between the basic image and the dissimilar image increases relative to the second similarity between the basic image and the similar image.

First claim

Opening claim text (preview).

The invention claimed is: 1. A retrieval apparatus comprising circuitry configured to execute operations comprising: receiving a query image to be retrieve; calculating a first similarity as a similarity between feature vectors, the first similarity being calculated between a feature vector extracted from the query image through learned feature extraction for outputting the feature vector and a feature vector extracted from reference images labeled by the feature extraction; calculating a second similarity as a similarity determined using information on features, based on information on a feature different from the feature vector of the query image and information on features of the reference images; and calculating an integrated similarity by integrating the first similarity and the second similarity, wherein for similarities between a basic image, an image similar to the basic image, and a dissimilar image that is dissimilar to the basic image of the reference images, at least the feature extraction is learned such that a margin based on a second similarity between the basic image and the similar image and a second similarity between the basic image and the dissimilar image increases as the second similarity between the basic image and the dissimilar image increases relative to the second similarity between the basic image and the similar image. 2. A learning apparatus comprising circuitry configured to execute operations comprising: calculating a second similarity for a combination of reference images by using information on features different from feature vectors of labeled reference images, the second similarity being calculated as a similarity determined using the information on the features, the reference images including a basic image as a reference of the labeling, a similar image that is the reference image similar to the basic image, and a dissimilar image that is the reference image dissimilar to the basic image; and updating a parameter of a neural network such that a margin increases as the second similarity between the basic image and the dissimilar image increases relative to a second similarity between the basic image and the similar image, the parameter being updated by using a loss function including a first similarity between a feature vector of the basic image and a feature vector of the similar image, a first similarity between the feature vector of the basic image and a feature vector of the dissimilar image, and the margin based on the second similarity between the basic image and the similar image and the second similarity between the basic image and the dissimilar image, the feature vectors being outputted from the neural network for receiving a predetermined image and outputting the feature vectors. 3. The learning apparatus according to claim 2 , wherein for a set of the basic image, the similar image, and the dissimilar image, the circuitry further configured to execute a method comprising: updating the parameter of the neural network according to the loss function by using only the set of the three images if the second similarity between the basic image and the dissimilar image is at least a threshold value relative to the second similarity between the basic image and the similar image and the margin is at least a threshold value. 4. A computer-implemented retrieval method for retrieving, comprising: receiving a query image to be retrieved; calculating a first similarity as a similarity between feature vectors, the first similarity being calculated between a feature vector extracted from the query image through learned feature extraction for outputting the feature vector and a feature vector extracted from reference images labeled by the feature extraction; calculating a second similarity as a similarity determined using information on features, based on information on a feature different from the feature vector of the query image and information on the features of the reference images; and calculating an integrated similarity by integrating the calculated first similarity and the calculated second similarity, wherein for similarities between a basic image, an image similar to the basic image, and a dissimilar image that is dissimilar to the basic image of the reference images, at least the feature extraction is learned such that a margin based on a second similarity between the basic image and the similar image and a second similarity between the basic image and the dissimilar image increases as the second similarity between the basic image and the dissimilar image increases relative to the second similarity between the basic image and the similar image. 5. The retrieval apparatus according to claim 1 , the circuitry further configured to execute a method comprising: determining an identity of the query image based on the first similarity between the feature vector extracted from a region in the query image and another feature vector extracted from another region that corresponds to the region in a reference image, wherein the feature vector indicates a local feature amount of the query image. 6. The retrieval apparatus according to claim 1 , wherein the first similarity is based on a cosine similarity between the feature vector extracted from the query image through learned feature extraction and the feature vector extracted from a reference image of the reference images labeled by a feature extraction. 7. The retrieval apparatus according to claim 1 , wherein the second similarity is based on a similarity between a first local feature amount associated with a first feature associated with the feature vector extracted from the query image through learned feature extraction and a second local feature amount associated with a second feature associated with the feature vector extracted from the reference images labeled by a feature extraction. 8. The learning apparatus according to claim 2 , wherein the first similarity is based on a cosine similarity between a first feature vector associated with the basic image and a second feature vector associated with a reference image. 9. The learning apparatus according to claim 2 , wherein the second similarity is based on a similarity between a first local feature amount associated with a first feature associated with the feature vector for the basic image and a second local feature amount associated with a second feature for a reference image. 10. The computer-implemented method according to claim 4 , the method further comprising: determining an identity of the query image based on the first similarity between the feature vector extracted from a region in the query image and another feature vector extracted from another region that corresponds to the region in a reference image, wherein the feature vector indicates a local feature amount of the query image. 11. The computer-implemented method according to claim 4 , wherein the first similarity is based on a cosine similarity between the feature vector extracted from the query image through learned feature extraction and the feature vector extracted from a reference image labeled by a feature extraction. 12. The computer-implemented method according to claim 4 , wherein the second similarity is based on a similarity between a first local feature amount associated with a first feature associated with the feature vector extracted from the query image through learned feature extraction and a second local feature amount associated with a second feature associated with the feature vector extracted from the reference images labeled by the feature extraction.

Assignees

Inventors

Classifications

  • G06V10/761Primary

    Proximity, similarity or dissimilarity measures · CPC title

  • G06F16/532Primary

    Query formulation, e.g. graphical querying · CPC title

  • having vectorial format · CPC title

  • Extraction of image or video features · CPC title

  • Active pattern-learning, e.g. online learning of image or video features · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11790635B2 cover?
A retrieval apparatus includes a first retrieval unit, a second retrieval unit, and an integration unit that calculates an integrated similarity by integrating a first similarity calculated by the first retrieval unit and a second similarity calculated by the second retrieval unit. For similarities between a basic image, an image similar to the basic image, and an image dissimilar to the basic …
Who is the assignee on this patent?
Nippon Telegraph & Telephone
What technology area does this patent fall under?
Primary CPC classification G06V10/761. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Oct 17 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 10 related publications on this page (citations in our corpus or others sharing the same primary CPC).