Systems and methods for identifying a target object in an image

US2018330198A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2018330198-A1
Application numberUS-201715594611-A
CountryUS
Kind codeA1
Filing dateMay 14, 2017
Priority dateMay 14, 2017
Publication dateNov 15, 2018
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

There is provided a method of identifying objects in an image, comprising: extracting query descriptors from the image, comparing each query descriptor with training descriptors for identifying matching training descriptors, each training descriptor is associated with a reference object identifier and with relative location data (distance and direction from a center point of a reference object indicated by the reference object identifier), computing object-regions of the digital image by clustering the query descriptors having common center points defined by the matching training descriptors, each object-region approximately bounding one target object and associated with a center point and a scale relative to a reference object size, wherein the object-regions are computed independently of the identifier of the reference object associated with the object-regions, wherein members of each cluster point toward a common center point, and classifying the target object of each object-region according to the reference object identifier of the cluster.

First claim

Opening claim text (preview).

1 . A computed implemented method of identifying a plurality of target objects in a digital image, the method comprising: receiving a digital image including a plurality of target objects; extracting a plurality of query descriptors from respective a plurality of locations in the digital image; comparing each one of said plurality of query descriptors with a plurality of training descriptors for identifying a plurality of matching training descriptors, each one of the plurality of training descriptors is associated with one of a plurality of reference object identifiers and with relative location data comprising an estimated distance and an estimated direction from a center point of a reference object indicated by the respective associated reference object identifier from the plurality of reference object identifiers; computing a plurality of object-regions of the digital image by clustering the query descriptors having common center points defined by the matching training descriptors, each object-region approximately bounding one target object of the plurality of target objects of the digital image, each object-region is associated with another common center point of said common center points and with a scale relative to a reference object size, wherein each of the plurality of object-regions is computed independently of the respective reference object identifier associated with said each of the plurality of object-regions and is computed by: aggregating the relative location data of the matching training descriptors to generate a Kernel Density Estimate (KDE) for a plurality of posterior probability maps of the center point and scale of each respective reference object of the plurality of reference object identifiers; aggregating the posterior probability maps into a plurality of probability map clusters; extracting each of the plurality of object-regions with inter-scale normalization and non-maximal suppression according to location of the common center point and the scale of each respective cluster of the plurality of probability map clusters, wherein each of the plurality of object-regions is defined according to the center point and the scale of the reference object of the plurality of reference objects associated with the respective cluster of the plurality of probability map clusters; and classifying the bound target object of each object-region of the plurality of object-regions according to the reference object identifier of a respective cluster according to a statistically significant correlation requirement between a common center point of the respective cluster and the center point of the reference object associated with the reference object identifier of the respective cluster. 2 . (canceled) 3 . The method of claim 21 , wherein the plurality of probability map clusters are each represented as an x-y-s-3D-heatmap having a center point at a location with coordinates defined by an x axis (x) and a y-axis(y) and defined by a scale (s). 4 . The method of claim 1 , wherein each one of the plurality of training descriptors is associated with an estimated scale of one level of a plurality of levels of a Gaussian pyramid, and wherein the scale of the object-region is computed based on the estimated scale of the one level of the plurality of levels of the Gaussian pyramid of the identified matching training descriptors, wherein the object-regions are computed by clustering according to the scale defined by the matching training descriptors. 5 . The method of claim 1 , further comprising: selecting, for each bound target object of each object-region, a group of candidate reference object identifiers based on a cluster of query descriptors associated with the respective object-region; computing a probability of each member of the group of candidate reference object identifiers being the respective target object; and classifying the bound target object of each object-region of the plurality of object-regions according to the member of the group with the highest computed probability. 6 . The method of claim 5 , wherein the probability of each member of the group of candidate reference object identifiers is computed based on one or more of the following components: a data fidelity value that penalizes distance between the query descriptor and the plurality of matching training descriptors, a penalty for deviation in expected spatial location between the common center point and the center point of the reference object associated with the plurality of matching training descriptors, and discrepancy in scale between the target object and the reference object associated with the plurality of matching training descriptors. 7 . The method of claim 1 , wherein the plurality of training descriptors are extracted from a training set comprising a single training sample image of each of the plurality of reference objects associated with each of the plurality of reference object identifiers. 8 . The method of claim 7 , wherein the plurality of image descriptions are extracted from each single training sample image of each of the plurality of reference objects in a sampling pattern that is denser relative to the pattern of the plurality of locations in the digital image. 9 . The method of claim 1 , wherein the plurality of training descriptors are indexed with a sub-linear search data structure, and the comparing is performed by searching for the matching training descriptor of the extracted query descriptors within the sub-linear search data structure. 10 . The method of claim 1 , wherein the extracted query descriptors and each of the plurality of training descriptors is based on scale invariant feature transform (SIFT). 11 . The method of claim 1 , wherein the comparing is performed by finding a set of Euclidean nearest neighbors of the respective extracted query descriptors, wherein each member of the set of Euclidean nearest neighbors is one of the plurality of matching training descriptors. 12 . The method of claim 11 , wherein the set of Euclidean nearest neighbors are identified for a first subset of the extracted query descriptors, wherein a second subset of extracted query descriptors are unmatched, wherein for each member of the second subset of extracted query descriptors that are unmatched, a matching training descriptor is computed such that the difference between the center point of the relative location data of the identified matching training descriptors and the center point of the relative location data of the computed training descriptor matched to the unmatched second subset of query descriptors is equal to the difference between the location relative location of the query descriptor matched to the identified matching training descriptor and the location relative location of the unmatched second subset of query descriptors for which the matching training descriptor is computed. 13 . The method of claim 12 , wherein each member of the second subset of extracted query descriptors that are unmatched is paired to a closest single already matched query descriptor of the first subset of extracted query descriptors according to a distance requirement. 14 . The method of claim 12 , further comprising iterating the computing of the matching training descriptor for each unmatched query descriptor of the second subset of query descriptors, wherein the probability of extending the matching training descriptor from nth closest members of the first subset of query descriptors is mathematically represented as (1−p)pn−1, where p denotes the probability of independently ignoring each previously matched query descriptor.

Assignees

Inventors

Classifications

  • using probabilistic graphical models from image or video features, e.g. Markov models or Bayesian networks · CPC title

  • using neural networks · CPC title

  • using classification, e.g. of video objects · CPC title

  • Classification techniques · CPC title

  • G06N20/00Primary

    Machine learning · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2018330198A1 cover?
There is provided a method of identifying objects in an image, comprising: extracting query descriptors from the image, comparing each query descriptor with training descriptors for identifying matching training descriptors, each training descriptor is associated with a reference object identifier and with relative location data (distance and direction from a center point of a reference object …
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06N20/00. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Nov 15 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).