Retrieving visual media

US9229958B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9229958-B2
Application numberUS-201114239387-A
CountryUS
Kind codeB2
Filing dateSep 27, 2011
Priority dateSep 27, 2011
Publication dateJan 5, 2016
Grant dateJan 5, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Examples of the present disclosure may include methods, systems, and computer readable media with executable instructions. An example method for retrieving visual media can include receiving a text query associated with a target content. A first group of visual media is identified based on correspondence of metadata of the visual media with the text query, and keyframes from the first group of identified visual media are selected. The method further includes detecting instances of a content type in the selected keyframes, and grouping similar instances of the content type into clusters. The target content is associated with a cluster having a greatest quantity of similar instances.

First claim

Opening claim text (preview).

What is claimed: 1. A method for retrieving visual media, comprising: receiving, using a processor, a text query associated with a target content; identifying, using the processor, a first group of visual media based on correspondence of metadata of the visual media with the text query; selecting, using the processor, keyframes from the first group of identified visual media; detecting, using the processor, instances of a content type in the selected keyframes; grouping, using the processor, similar instances of the content type into clusters; associating, using the processor, the target content with a cluster having a greatest quantity of similar instances; indexing, using the processor, locations within the visual media based on content type in the selected keyframes; and repurposing, using the processor, images directly from the indexed locations within the visual media into a new customized visual media product. 2. The method of claim 1 , wherein: receiving, using the processor, a text query associated with a target content includes receiving a name, the target content being one or more images of a person with the name; and detecting, using the processor, instances of a content type in the selected keyframes includes detecting face images. 3. The method of claim 1 , further comprising: selecting, using the processor, instances of a content type from each cluster having a predetermined threshold quantity of similar instances; and displaying, using the processor, the instances of a content type for each of the clusters having the threshold quantity of similar instances in an order corresponding to cluster size, wherein cluster size corresponds to quantity of similar instances of the content type in the cluster. 4. The method of claim 3 , wherein selecting and displaying the instances of a content type for each of the clusters includes respectively selecting and displaying images of faces appearing in one or more video clips. 5. The method of claim 4 , wherein displaying images of faces appearing in one or more video clips includes displaying a largest face image and displaying a face image most different from a selected face image. 6. The method of claim 3 , further comprising: receiving, using the processor, a user selection of at least one of the displayed instances; identifying, using the processor, a second group of visual media based on correspondence of metadata of the visual media with the text query and the selected instances of a content type; selecting, using the processor, second keyframes from the second group of identified visual media; detecting, using the processor, second instances of a content type in the selected second keyframes; grouping, using the processor, similar second instances of the content type into clusters; and determining, using the processor, matches between the second instances of the content type and the selected instances of a content type. 7. The method of claim 6 , further comprising: determining, using the processor, a ranking score for visual media having at least one determined match based on cumulative time during which the selected instances of a content type appear; and displaying, using the processor, a listing of the visual media based on ranking score. 8. The method of claim 7 , further comprising creating, using the processor, an index of the visual media based on ranking score and a quantity of occurrences that the visual media is viewed, the index including a location of the visual media and one or more locations within the visual media at which a particular instance of a content type appears. 9. The method of claim 7 , further comprising: analyzing, using the processor, metadata of most-viewed visual media and a name associated with each respective most-viewed visual media prior to receiving the text query; generating, using the processor, a re-ranked result corresponding to the name; caching, using the processor, the re-ranked result; and updating, using the processor, the cached re-ranked result responsive to new text queries. 10. A non-transitory computer-readable medium having computer-executable instructions stored thereon, the computer-executable instructions comprising instructions that, if executed by one or more processors, cause the one or more processors to: retrieve a group of visual media based on correspondence of metadata of the visual media with a text query; select keyframes from the group of retrieved visual media; apply face clustering to the keyframes from the group of retrieved visual media; generate query face images based on the face clusters; re-rank the group of retrieved visual media on a display based on a received input corresponding to a particular one of the query face images; index locations within the group of visual media based on the face clusters; and repurpose images directly from the indexed locations within the group of visual media into a new customized visual media product. 11. The non-transitory computer-readable medium of claim 10 , further comprising instructions that, if executed by one or more processors, cause the one or more processors to indicate portions of a selected one of the group of retrieved visual media corresponding to a selected query face image. 12. A computing system, comprising: a display; a non-transitory computer-readable medium having computer-executable instructions stored thereon; and a processor coupled to the display and the non-transitory computer-readable medium, wherein the processor executes the instructions to: retrieve a group of visual media based on correspondence of metadata of the visual media with a text query; select keyframes from the group of retrieved visual media; apply face clustering to the keyframes from the group of retrieved visual media; generate query face images based on face clusters; re-rank the group of retrieved visual media on the display based on a received input corresponding to a particular one of the query face images; index locations within the group of visual media based on the face clustering; and repurpose images directly from the indexed locations within the group of visual media into a new customized visual media product. 13. The computing system of claim 12 , wherein the processor executes the instructions to display portions of a particular retrieved visual media based on a received input corresponding to a particular one of the query face images.

Assignees

Inventors

Classifications

  • Physics · mapped topic

  • Physics · mapped topic

  • Physics · mapped topic

  • using original textual content or text extracted from visual content or transcript of audio data · CPC title

  • Classification, e.g. identification · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9229958B2 cover?
Examples of the present disclosure may include methods, systems, and computer readable media with executable instructions. An example method for retrieving visual media can include receiving a text query associated with a target content. A first group of visual media is identified based on correspondence of metadata of the visual media with the text query, and keyframes from the first group of …
Who is the assignee on this patent?
Zhang Tong, Liu ke-yan, Sun xin-yun, and 1 more
What technology area does this patent fall under?
Primary CPC classification G06F17/30268. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jan 05 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).