Topical based media content summarization system and method

US2016299968A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2016299968-A1
Application numberUS-201514682654-A
CountryUS
Kind codeA1
Filing dateApr 9, 2015
Priority dateApr 9, 2015
Publication dateOct 13, 2016
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Disclosed herein is an automated approach for summarizing media content using descriptive information associated with the media content. For example and without limitation, the descriptive information may comprise a title associated with the media content. One or more segments of the media content may be identified to form a media content summary based on each segment's respective similarity to the descriptive information, which respective similarity may be determined using a media content and auxiliary data feature spaces. A shared dictionary of canonical patterns generated using the media content and auxiliary data feature spaces may be used in determining a media content segment's similarity to the descriptive information.

First claim

Opening claim text (preview).

1 . A method comprising: obtaining, using at least one computing device, a plurality of items of auxiliary data using descriptive information associated with a media content item, the media content item comprising a plurality of units; generating, using the at least one computing device, a media content item feature space and an auxiliary data feature space; identifying, using the at least one computing device, a plurality of segments of the media content item, each segment comprising at least one unit of the media content item's plurality of units; scoring, using the at least one computing device, each segment of the plurality of segments of the media content items using the media content item feature space and the auxiliary data feature space, each segment's score representing a measure of similarity of the segment to the descriptive information; identifying, using the at least one computing device, at least one segment of the plurality of segments of the media content item as more similar to the descriptive information relative to others of the plurality of segments using the scoring of the plurality of segments; and generating, using the at least one computing device, a media content item summary comprising the at least one segment of the plurality identified as being more similar to the descriptive information. 2 . The method of claim 1 , the descriptive information comprising a title of the media content item. 3 . The method of claim 1 , the plurality of canonical patterns appear in both the media content item and the auxiliary data. 4 . The method of claim 1 , scoring further comprising: determining, for each segment, a unit-level score for each of the at least one unit of the segment using the media content item feature space and the auxiliary data feature space, each unit level score representing a measure of similarity of a respective one of the at least one unit to the descriptive information; and scoring each segment using the unit-level score determined for each of the at least one unit of the segment. 5 . The method of claim 4 , each segment's score comprising an average unit-level score determined using each unit-level score determined for each of the at least one unit of the segment. 6 . The method of claim 4 , the media content item is a video content item, each unit of the media content item is a frame and each segment comprises at least one frame. 7 . The method of claim 1 , generating a media content item feature space and an auxiliary data feature space further comprising: generating, by the at least one computing device, the media content item feature space by generating, for each of the plurality of units of the media content item, a plurality of feature descriptor values, each feature descriptor value corresponding to a feature of a set of features; and generating, by the at least one computing device, the auxiliary data feature space by generating a plurality of feature descriptor values for each item of auxiliary data, each feature descriptor value corresponding to a feature of the set of features used in generating the plurality of feature descriptor values for each unit of the media content item. 8 . The method of claim 1 , the scoring further comprising: determining, using the at least one computing device, the media content item feature space and the auxiliary data feature space, a shared dictionary comprising a plurality of canonical patterns shared by the media content item and the auxiliary data; and scoring, using the at least one computing device, the plurality of segments of the media content item, for each segment of the plurality of segments, the scoring comprising determining a measure of similarity of the segment to the descriptive information using the shared dictionary. 9 . The method of claim 8 , the determining further comprising: determining a first set of coefficients for use with the shared dictionary in approximating the plurality of feature descriptor values of each unit of the plurality of units of the media content item; determining a second set of coefficients for use with the shared dictionary in approximating the plurality of feature descriptor values of each item of the auxiliary data; and determining third and fourth sets of coefficients, the third set of coefficients for use with the plurality of feature descriptor values of each unit of the plurality of units and the fourth set of coefficients for use with the plurality of feature descriptor values of each item of the plurality of items of auxiliary data in approximating the shared dictionary. 10 . The method of claim 9 , the scoring further comprising: scoring each unit of the plurality of units of media content item, the scoring comprising determining a unit-level score for each unit representing the unit's measure of similarity to the media content item's descriptive information using a plurality of coefficients from the first and third sets of coefficient. 11 . The method of claim 8 , the determining further comprising: learning the shared dictionary's canonical patterns such that each unit of the plurality of units of the media content item and each item of the plurality of items of the auxiliary data is independently approximated by a combination of the plurality of canonical patterns of the shared dictionary and such that each canonical pattern of the plurality of canonical patterns of the shared dictionary is jointly approximated by a combination of the plurality of units of the media content item and the plurality of items of auxiliary data. 12 . The method of claim 11 , each unit of the media content item and each item of the plurality of items of the auxiliary data is independently approximated by a convex combination of the plurality of canonical patterns of the shared dictionary and each canonical pattern of the plurality of canonical patterns of the shared dictionary is jointly approximated by a convex combination of the plurality of units of the media content item and the plurality of items of auxiliary data. 13 . A system comprising: at least one computing device, each computing device comprising one or more processors and a storage medium for tangibly storing thereon program logic for execution by the processor, the stored program logic comprising: obtaining logic executed by the one or more processors for obtaining a plurality of items of auxiliary data using descriptive information associated with a media content item, the media content item comprising a plurality of units; generating logic executed by the one or more processors for generating a media content item feature space and an auxiliary data feature space; identifying logic executed by the one or more processors for identifying a plurality of segments of the media content item, each segment comprising at least one unit of the media content item's plurality of units; scoring logic executed by the one or more processors for scoring each segment of the plurality of segments of the media content items using the media content item feature space and the auxiliary data feature space, each segment's score representing a measure of similarity of the segment to the descriptive information; identifying logic executed by the one or more processors for identifying at least one segment of the plurality of segments of the media content item as more similar to the descriptive information relative to others of the plurality of segments using the scoring of the plurality of segments; and generating logic executed by the one or more processors for generating a media content item summary comprising the at least one segment of the plurality identified as being mor

Assignees

Inventors

Classifications

  • Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually · CPC title

  • G06F16/739Primary

    in form of a video summary, e.g. the video summary being a video sequence, a composite still image or having synthesized frames · CPC title

  • Summarisation for human users · CPC title

  • Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually · CPC title

  • Physics · mapped topic

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2016299968A1 cover?
Disclosed herein is an automated approach for summarizing media content using descriptive information associated with the media content. For example and without limitation, the descriptive information may comprise a title associated with the media content. One or more segments of the media content may be identified to form a media content summary based on each segment's respective similarity to…
Who is the assignee on this patent?
Yahoo Inc
What technology area does this patent fall under?
Primary CPC classification G06F16/739. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Oct 13 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).