Method for extracting low-rank descriptors from images and videos for querying, classification, and object detection

US9639761B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9639761-B2
Application numberUS-201414202327-A
CountryUS
Kind codeB2
Filing dateMar 10, 2014
Priority dateMar 10, 2014
Publication dateMay 2, 2017
Grant dateMay 2, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method extracts a low-rank descriptor of a video acquired of a scene by first extracting a set of descriptors for each image in the video. The sets of descriptors for the video are aggregated to form a descriptor matrix. Iteratively, a low-rank descriptor matrix is determined from the descriptor matrix, as well as a selection matrix that associates each column in the descriptor matrix to a corresponding column in the low-rank descriptor matrix. The low-rank descriptor matrix is output upon convergence.

First claim

Opening claim text (preview).

We claim: 1. A method for querying a database of videos, comprising: extracting a set of descriptors for each image in a video of a scene; aggregating the sets of descriptors for the video to form a descriptor matrix; determining a product of a low-rank descriptor matrix and a selection matrix using an orthogonal non-negative matrix factorization of the descriptor matrix, such that each column of the selection matrix has only one nonzero entry and associates a column in the descriptor matrix to a corresponding column in the low-rank descriptor matrix; wherein the factorization comprises: determining a low-rank factor by a non-negative least squares minimization; and determining a selection matrix factor by minimizing a proximal point least squares problem and retaining a largest entry in every column of the selection matrix and setting all other entries to zero; and querying the database of videos including at least one video using the low-rank descriptor matrix to produce a search result, wherein the steps are performed in a processor. 2. The method of claim 1 , further comprising: extracting features from each image in the video; and aggregating the features to form the set of descriptors for each image in the video. 3. The method of claim 2 , wherein the features are extracted using a scale-invariant feature transform. 4. The method of claim 1 , wherein the set of descriptors for each image in the video are stacked to form a matrix X of size m×N, where m is a length of a feature vector and N is a total number of descriptors extracted from the video. 5. The method of claim 1 , wherein the rank of the low-rank descriptor matrix is less than a length of a feature vector. 6. The method of claim 1 , further comprising: determining a low-rank class descriptor of each video in the database, wherein each video in the database is associated with a class; determining a correlation coefficient between the low-rank descriptor matrix and each low-rank class descriptor; and assigning the class of the video in the database with a largest correlation coefficient to the video of the scene. 7. The method of claim 6 , wherein each video is partitioned into a group of pictures, and the determining and assigning steps are applied to the group of pictures. 8. The method of claim 1 , further comprising: determining a low-rank class descriptor of each video in the database; determining a correlation coefficient between the low-rank descriptor matrix and each low-rank class descriptor; and retrieving the videos in the database with a correlation coefficient larger than a predetermined threshold. 9. The method of claim 1 , wherein the scene includes an object, and further comprising: subtracting background pixels from each image in the video to obtain a foreground video; determining a low-rank object descriptor of the foreground video; determining a low-rank object class descriptor of each video in the database, wherein each video in the database is associated with an object class; and assigning the object class of the video in the database with a largest correlation coefficient to the foreground video. 10. The method of claim 1 , wherein the scene includes an object, and further comprising: subtracting background pixels from each image in the video to obtain a foreground video of the scene; determining a low-rank object descriptor of the foreground video of the scene; subtracting background pixels from each image of each video in the database to obtain foreground videos in the database; determining a low-rank object class descriptor of each foreground video in the database, wherein each video in the database is associated with an object class; and assigning the object class of the video in the database with a largest correlation coefficient to the foreground video of the scene.

Assignees

Inventors

Classifications

  • Feature extraction, e.g. by transforming the feature space, e.g. multi-dimensional scaling [MDS]; Mappings, e.g. subspace methods · CPC title

  • Extracting features or characteristics from the video content, e.g. video fingerprints, representative shots or key frames · CPC title

  • based on approximation criteria, e.g. principal component analysis · CPC title

  • Clustering; Classification · CPC title

  • Physics · mapped topic

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9639761B2 cover?
A method extracts a low-rank descriptor of a video acquired of a scene by first extracting a set of descriptors for each image in the video. The sets of descriptors for the video are aggregated to form a descriptor matrix. Iteratively, a low-rank descriptor matrix is determined from the descriptor matrix, as well as a selection matrix that associates each column in the descriptor matrix to a co…
Who is the assignee on this patent?
Mitsubishi Electric Res Laboratories Inc
What technology area does this patent fall under?
Primary CPC classification G06V10/7715. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue May 02 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).