Score fusion and training data recycling for video classification

US9147129B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9147129-B2
Application numberUS-201213622328-A
CountryUS
Kind codeB2
Filing dateSep 18, 2012
Priority dateNov 18, 2011
Publication dateSep 29, 2015
Grant dateSep 29, 2015

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Multiple classifiers can be applied independently to evaluate images or video. Where there are heavily imbalanced class distributions, a local expert forest model for meta-level score fusion for event detection can be used. Performance variations of classifiers in different regions of a score space can be adapted. Multiple pairs of experts based on different partitions, or “trees,” can form a “forest,” balancing local adaptivity and over-fitting. Among ensemble learning methods, stacking with a meta-level classifier can be used to fuse an output of multiple base-level classifiers to generate a final score. A knowledge-transfer framework can reutilize the base-training data for learning the meta-level classifier. By recycling the knowledge obtained during a base-classifier-training stage, efficient use can be made of all available information, such as can be used to achieve better fusion and better overall performance.

First claim

Opening claim text (preview).

The claimed invention is: 1. A system, comprising: a processor circuit, including: a first data input configured to receive probability estimates from two or more separate feature classifiers over a collection of training items, those items having associated ground truth category information; and a processor-readable medium, including instructions that, when performed by the processor, configure the system to: select a fusion model to adapt local statistics of the two or more separate feature classifiers over the collection of training items; generate K partitions on each separate feature classifier to form K 2 pairs of associations; determine a maximum likelihood estimate of a pair of the K 2 pairs being the correct classifier including modelling the likelihood using a localized expert forest and using a linear model for the localized expert forest; and fuse the maximum likelihood estimates from the separate feature classifiers according to the selected fusion model to generate an output probability estimate for new items that do not have associated ground truth information. 2. The system of claim 1 , wherein the processor-readable medium includes instructions that, when performed by the processor, configure the system to fuse the probability estimates from the separate feature classifiers using weights assigned to each classifier. 3. The system of claim 2 , wherein the processor-readable medium includes instructions that, when performed by the processor, configure the system to use an objective function to fuse the probability estimates from the separate feature classifiers, the objective function comprising a minimum mean squared error fusion function with a non-negative constraint. 4. The system of claim 2 , wherein the processor-readable medium includes instructions that, when performed by the processor, configure the system to use an objective function to fuse the probability estimates from the separate feature classifiers, the objective function comprising a linear support vector machine with a non-negative constraint. 5. The system of claim 1 , wherein the first data input is configured to receive probability estimates from the two or more separate feature classifiers over a collection of training items, wherein the collection of training items comprises video clips or still images that can be categorized by an activity depicted in the video clips or still images. 6. The system of claim 1 , wherein the processor-readable medium includes instructions that, when performed by the processor, configure the system to compare the generated output probability estimate to a threshold to identify a category for new items that do not have associated ground truth information. 7. The system of claim 1 , further comprising instructions that, when performed by the processor, configure the system to select K associations of the K 2 associations with the highest determined maximum likelihood scores. 8. The system of claim 7 , further comprising instructions that, when performed by the processor, configure the system to determine a cluster center for each of the K selected associations. 9. The system of claim 8 , further comprising instructions that, when performed by the processor, configure the system to perform linear discriminant analysis to determine a one dimensional projection vector that separates pairs of cluster centers. 10. The system of claim 9 , wherein instructions for determining a one dimensional projection vector that separates a pair of cluster centers, include instructions that, when performed by the processor, configure the system to project the determined cluster centers onto a one dimensional axis corresponding to the one dimensional projection vector. 11. The system of claim 10 , wherein instructions for determining a one dimensional projection vector that separates a pair of cluster centers include instructions that, when performed by the processor, configure the system to partition the one dimensional axis into partitions based on a specified threshold. 12. A method, comprising: receiving probability estimates from two or more separate feature classifiers over a collection of training items, those items having associated ground truth category information; selecting a fusion model to adapt to the local statistics of the separate feature classifiers over the training data; generating K partitions on each separate feature classifier to form K 2 pairs of associations; determining a maximum likelihood estimate of a pair of the K 2 pairs being the correct classifier including modelling the likelihood using a localized expert forest and using a linear model for the localized expert forest; and fusing the maximum likelihood estimates from the separate feature classifiers according to the model to generate an output probability estimate for new items without associated ground truth information. 13. The method of claim 12 wherein the fusion model comprises weights assigned to each classifier. 14. The method of claim 13 wherein an objective function for fusing the local statistics comprises a minimum mean squared error fusion with non-negative constraint. 15. The method of claim 13 wherein an objective function for fusing the local statistics comprises a linear support vector machine with non-negative constraint. 16. The method of claim 12 , wherein the items are video clips and the categories denote activities depicted by the video clip. 17. The method of claim 12 , wherein fusing the probability estimates to generate an output probability estimate includes using the output probability estimate to identify a category for new items that do not have associated ground truth information.

Assignees

Inventors

Classifications

  • the supervisor being a human, e.g. interactive learning with a human teacher · CPC title

  • Generating sets of training patterns; Bootstrap methods, e.g. bagging or boosting · CPC title

  • of classification results, e.g. where the classifiers operate on the same input data · CPC title

  • Generating training patterns; Bootstrap methods, e.g. bagging or boosting · CPC title

  • G06N20/00Primary

    Machine learning · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9147129B2 cover?
Multiple classifiers can be applied independently to evaluate images or video. Where there are heavily imbalanced class distributions, a local expert forest model for meta-level score fusion for event detection can be used. Performance variations of classifiers in different regions of a score space can be adapted. Multiple pairs of experts based on different partitions, or “trees,” can form a “…
Who is the assignee on this patent?
Honeywell Int Inc
What technology area does this patent fall under?
Primary CPC classification G06N20/00. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Sep 29 2015 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).