Video frame annotation

US9627004B1 · US · B1

Patent metadata
FieldValue
Publication numberUS-9627004-B1
Application numberUS-201514883461-A
CountryUS
Kind codeB1
Filing dateOct 14, 2015
Priority dateOct 14, 2015
Publication dateApr 18, 2017
Grant dateApr 18, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A system and methodology provide for annotating videos with entities and associated probabilities of existence of the entities within video frames. A computer-implemented method selects an entity from a plurality of entities identifying characteristics of a video item, where the video item has associated metadata. The computer-implemented method receives probabilities of existence of the entity in video frames of the video item, and selects a video frame determined to comprise the entity responsive to determining the video frame having a probability of existence of the entity greater than zero. The computer-implemented method determines a scaling factor for the probability of existence of the entity using the metadata of the video item, and determines an adjusted probability of existence of the entity by using the scaling factor to adjust the probability of existence of the entity. The computer-implemented method labels the video frame with the adjusted probability of existence.

First claim

Opening claim text (preview).

The invention claimed is: 1. A computer-implemented method for annotating videos with entities and associated probabilities of existence of the entities within video frames, the method comprising: selecting an entity from a plurality of entities identifying characteristics of a video item, the video item having associated metadata; receiving probabilities of existence of the entity in video frames of the video item; selecting a video frame determined to comprise the entity responsive to determining the video frame having a probability of existence of the entity greater than zero; determining a scaling factor for the probability of existence of the entity using the metadata of the video item; determining an adjusted probability of existence of the entity by using the scaling factor to adjust the probability of existence of the entity; and labeling the video frame with the adjusted probability of existence. 2. The method of claim 1 , wherein the metadata comprises a centrality of the entity indicating an importance of the entity, and the step of determining a scaling factor comprises: identifying a maximum probability of existence of the entity in the video; and calculating a ratio of the centrality of the entity to the maximum probability of existence of the entity in the video as the scaling factor. 3. The method of claim 2 , further comprising: multiplying the probability of existence by the scaling factor to determine an interim probability of existence; comparing the interim probability of existence to the probability of existence, and; determining the adjusted probability of existence as a greater between the interim probability of existence and the probability of existence. 4. The method of claim 1 , wherein the scaling factor is based on a linear fusion model, further comprising measuring a weight vector for each metadata, the weight vector representing an influence of the metadata on the probability of existence of the entity. 5. The method of claim 4 , further comprising determining a classifier comprising a set of weight vectors, the set of weight vectors including a first weight vector for retention statistics for the video item, a weight vector for video-level features of the entity for the video item, and a weight vector for frame-level features of the entity for the video item. 6. The method of claim 1 , wherein the scaling factor is based on a classifier determined by a machine learning model, the method further comprising providing training data comprising a set of video items, metadata associated with the set of video items, and probabilities of existence of items associated with the set of video items to the machine learning model. 7. The method of claim 1 , further comprising: receiving a search query from a user; and providing the video item including the video frame to the user in response to determining that the search query matches the entity. 8. A system comprising: a processor for executing computer program instructions; and a non-transitory computer-readable storage medium comprising computer program instructions executable by the processor, the computer program instructions comprising: selecting an entity from a plurality of entities identifying characteristics of a video item, the video item having associated metadata; receiving probabilities of existence of the entity in video frames of the video item; selecting a video frame determined to comprise the entity responsive to determining the video frame having a probability of existence of the entity greater than zero; determining a scaling factor for the probability of existence of the entity using the metadata of the video item; determining an adjusted probability of existence of the entity by using the scaling factor to adjust the probability of existence of the entity; and labeling the video frame with the adjusted probability of existence. 9. The system of claim 8 , wherein the metadata comprises a centrality of the entity indicating an importance of the entity, and the step of determining a scaling factor comprises: identifying a maximum probability of existence of the entity in the video; and calculating a ratio of the centrality of the entity to the maximum probability of existence of the entity in the video as the scaling factor. 10. The system of claim 9 , wherein the computer program instructions further comprises: multiplying the probability of existence by the scaling factor to determine an interim probability of existence; comparing the interim probability of existence to the probability of existence, and; determining the adjusted probability of existence as a greater between the interim probability of existence and the probability of existence. 11. The system of claim 8 , wherein the scaling factor is based on a linear fusion model, and wherein the computer program instructions further comprises measuring a weight vector for each metadata, the weight vector representing an influence of the metadata on the probability of existence of the entity. 12. The system of claim 11 , wherein the computer program instructions further comprises determining a classifier comprising a set of weight vectors, the set of weight vectors including a first weight vector for retention statistics for the video item, a weight vector for video-level features of the entity for the video item, and a weight vector for frame-level features of the entity for the video item. 13. The system of claim 8 , wherein the scaling factor is based on a classifier determined by a machine learning model, and wherein the computer program instructions further comprises providing training data comprising a set of video items, metadata associated with the set of video items, and probabilities of existence of items associated with the set of video items to the machine learning model. 14. The system of claim 8 , wherein the computer program instructions further comprises: receiving a search query from a user; and providing the video item including the video frame to the user in response to determining that the search query matches the entity. 15. A non-transitory computer-readable storage medium comprising computer program instructions executable by a processor, the computer program instructions comprising: selecting an entity from a plurality of entities identifying characteristics of a video item, the video item having associated metadata; receiving probabilities of existence of the entity in video frames of the video item; selecting a video frame determined to comprise the entity responsive to determining the video frame having a probability of existence of the entity greater than zero; determining a scaling factor for the probability of existence of the entity using the metadata of the video item; determining an adjusted probability of existence of the entity by using the scaling factor to adjust the probability of existence of the entity; and labeling the video frame with the adjusted probability of existence. 16. The non-transitory computer-readable storage medium of claim 15 , wherein the metadata comprises a centrality of the entity indicating an importance of the entity, and the step of determining a scaling factor comprises: identifying a maximum probability of existence of the entity in the video; and calculating a ratio of the centrality of the entity to the maximum probability of existence of the entity in the video as the scaling factor. 17. The non-transitory computer-readable storage medium of claim 16 , wherein the computer program instructions further comprises: multi

Assignees

Inventors

Classifications

  • Incorporation of unlabelled data, e.g. multiple instance learning [MIL] · CPC title

  • using classification, e.g. of video objects · CPC title

  • G06V20/46Primary

    Extracting features or characteristics from the video content, e.g. video fingerprints, representative shots or key frames · CPC title

  • characterised by the incorporation of unlabelled data, e.g. multiple instance learning [MIL], semi-supervised techniques using expectation-maximisation [EM] or naïve labelling · CPC title

  • based on parametric or probabilistic models, e.g. based on likelihood ratio or false acceptance rate versus a false rejection rate · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9627004B1 cover?
A system and methodology provide for annotating videos with entities and associated probabilities of existence of the entities within video frames. A computer-implemented method selects an entity from a plurality of entities identifying characteristics of a video item, where the video item has associated metadata. The computer-implemented method receives probabilities of existence of the entity…
Who is the assignee on this patent?
Google Inc
What technology area does this patent fall under?
Primary CPC classification G06V20/46. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Apr 18 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).