Multimedia document summarization

US10762283B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10762283-B2
Application numberUS-201514947964-A
CountryUS
Kind codeB2
Filing dateNov 20, 2015
Priority dateNov 20, 2015
Publication dateSep 1, 2020
Grant dateSep 1, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Multimedia document summarization techniques are described. That is, given a document that includes text and a set of images, various implementations generate a summary by extracting relevant text segments in the document and relevant segments of images with constraints on the amount of text and number/size of images in the summary.

First claim

Opening claim text (preview).

What is claimed is: 1. In a digital medium environment including one or more computing devices configured to perform document summarization, a method comprising: receiving a multimedia document to be processed to generate a summary; receiving a set of parameters associated with a budget for processing the multimedia document, the set of parameters associated with the budget placing a constraint on a size of the summary and defining how many of a first content type and a second content type the summary is permitted to contain, wherein the first content type and the second content type are two different content types; responsive to a determination of the constraint on the size of the summary specified by the budget having not been met, generating the summary for the multimedia document by: determining that individual elements of the first content type and the second content type in the multimedia document are available within the budget to be added to the summary; executing an objective function for each available element that can be added to the summary within the budget, the objective function providing a measure of a contribution of each available element to coverage of information content relative to content of the multimedia document, diversity of information content relative to content of the multimedia document, and cohesion of information content relative to content of the summary; selecting an available element which maximizes a ratio of increase in the objective function to a cost of the element, wherein the cost of the element is a measure of the size of the element; and adding the selected element to the summary. 2. The method as described in claim 1 , wherein receiving the set of parameters comprises receiving user-specified parameters. 3. The method as described in claim 1 , wherein receiving the set of parameters comprises receiving parameters associated with the first content type and the second content type. 4. The method as described in claim 1 , wherein the first content type comprises text and the second content type comprises images, and wherein receiving the set of parameters comprises receiving one or more of: number of sentences the summary is to have or number of images the summary is to have. 5. The method as described in claim 1 , wherein executing the objective function comprises iteratively executing the objective function to provide the summary. 6. The method as described in claim 1 , further comprising after receiving the multimedia document, assigning gain values and cost to each available element of the first content type and the second content type in the multimedia document, wherein the gain values are a function of the coverage that elements have regarding the information content of the document and diversity of information that the elements have with respect to the summary, and a higher gain value of an element results in a higher ratio of increase in the objective function to a cost of the element. 7. The method as described in claim 6 , further comprising: updating gains of elements not in the summary; updating a residual budget for corresponding element categories; and iterating through, until the budget is exhausted: said executing an objective function for each available element; said selecting an available element which maximizes a ratio of increase; said adding the selected element to the summary; said updating gains of elements not in the summary; and said updating a residual budget. 8. In a digital medium environment including one or more computing devices to perform document summarization, a method comprising: receiving a multimedia document to be processed to generate a summary; ascertaining whether a budget is available for processing the multimedia document, the budget placing a constraint on the size of the summary and serving as a constraint on an amount of text content and image content that the summary is permitted to contain; ascertaining whether a particular element of text content or image content in the multimedia document exists within the budget available such that the constraint on the amount of text content and image content that the summary is permitted to contain has not been met; responsive to the particular element existing within the budget available, computing a value of an objective function for each available element in the multimedia document within the budget, the objective function providing a measure of a contribution of the particular element to coverage of information content relative to content of the multimedia document, diversity of information content relative to content of the multimedia document, and cohesion of information content relative to content of the summary, and being configured to compute a value for at least one text element and at least one image element in the multimedia document; responsive to said computing, selecting an element in the multimedia document whose computed value maximizes a ratio of increase in the objective function to a cost associated with the selected element in the multimedia document wherein the cost of the element is a measure of the size of the element; and adding the selected element to the summary. 9. The method as described in claim 8 , further comprising: after adding the selected element to the summary, updating a gain for each element in the multimedia document; updating the budget; ascertaining whether any budget is available for processing the multimedia document; responsive to an element existing within the budget available iterating through said computing the value, said selecting an element, said adding the selected element, said updating a gain, said updating the budget and said ascertaining whether any budget is available until no element in the multimedia document is left unused having at least a cost lower than the available budget. 10. The method as described in claim 9 , wherein the objective function includes a term that measures coverage of image parts of the document. 11. The method as described in claim 9 , wherein the objective function includes a term that measures coverage of text parts of the document and a term that measures coverage of image parts of the document. 12. The method as described in claim 9 , wherein the objective function includes a term that provides a diversity reward which measures the amount of diverse information provided in the summary for both text parts and image parts. 13. The method as described in claim 9 , wherein the objective function includes: a term that measures coverage of text parts of the document; a term that measures coverage of image parts of the document; and a term that provides a diversity reward which measures the amount of diverse information provided in the summary for both text parts and image parts. 14. In a digital medium environment having one or more computing devices to perform document summarization, a system comprising: a processor; a computer-readable storage media storing computer-readable instructions which, when executed by the processor, perform operations comprising: receiving a multimedia document to be processed to generate a summary; ascertaining whether a budget is available for processing the multimedia document, the budget placing a constraint on the size of the summary and serving as a constraint on an amount of text content and image content that the summary is permitted to contain; processing the multimedia document by: ascertaining whether a particular element of text content or image content in the multimedia document exists within the budget available such that the constraint on the amount of text

Assignees

Inventors

Classifications

  • Dynamic search techniques; Heuristics; Dynamic trees; Branch-and-bound · CPC title

  • Recurrent networks, e.g. Hopfield networks · CPC title

  • Combinations of networks · CPC title

  • Auto-encoder networks; Encoder-decoder networks · CPC title

  • Convolutional networks [CNN, ConvNet] · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10762283B2 cover?
Multimedia document summarization techniques are described. That is, given a document that includes text and a set of images, various implementations generate a summary by extracting relevant text segments in the document and relevant segments of images with constraints on the amount of text and number/size of images in the summary.
Who is the assignee on this patent?
Adobe Inc
What technology area does this patent fall under?
Primary CPC classification G06F40/30. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Sep 01 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 5 related publications on this page (citations in our corpus or others sharing the same primary CPC).