Signature generation for multimedia deep-content-classification by a large-scale matching system and method thereof

US8990125B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-8990125-B2
Application numberUS-201213682132-A
CountryUS
Kind codeB2
Filing dateNov 20, 2012
Priority dateOct 26, 2005
Publication dateMar 24, 2015
Grant dateMar 24, 2015

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Content-based clustering, recognition, classification and search of high volumes of multimedia data in real-time. The embodiments disclosed herein are dedicated to real-time fast generation of signatures to high-volume of multimedia content-segments, based on relevant audio and visual signals, and to scalable matching of signatures of high-volume database of content-segments' signatures. The embodiments disclosed herein can be implemented in any applications which involve large-scale content-based clustering, recognition and classification of multimedia data, such as, content-tracking, video filtering, multimedia taxonomy generation, video fingerprinting, speech-to-text, audio classification, object recognition, video search and any other application requiring content-based signatures generation and matching for large content volumes such as, web and other large-scale databases.

First claim

Opening claim text (preview).

What we claim is: 1. An apparatus for generating a signature for an input signal, comprising: a processor; and a memory coupled to the processor and configured to store at least instructions causing for execution of a plurality of computational cores, wherein the instructions includes a plurality of subsets of instructions, each subset of instructions causes the execution of one computational core from the plurality of computational cores and is coded to have properties that have at least some statistical independency from other subset of instructions, thereby the properties are independent of each other computational core, wherein each subset of instructions causes its respective computational core to generate responsive of the input signal a first signature element and a second signature element, the first signature element is a robust signature; wherein a plurality of first signature elements include a first signature of the input signal and a plurality of second signature elements include a second signature of the input signal, the first signature is determined respective of a first threshold value and the second signature is determined respective of a second threshold, wherein the first threshold is higher than the second threshold. 2. The apparatus of claim 1 , wherein the first signature is robust to at least one of: noise and distortion. 3. The apparatus of claim 1 , wherein the properties include in part a first threshold and a second threshold, each of the plurality of computational cores generates responsive to the input signal and the first threshold the first signature element and responsive to the input signal and the second threshold the second signature element. 4. The apparatus of claim 3 , wherein each of the first threshold and the second threshold is a constant threshold value of a Heaviside step function, wherein each of the subset of instructions are coded with the Heaviside step function to be computed by each respective computational core when generating the first signature element and the second signature element. 5. The apparatus of claim 3 , wherein the properties are defined according to at least one random parameter. 6. The apparatus of claim 1 , wherein the input signal is at least multimedia signal. 7. The apparatus of claim 6 , wherein the multimedia signal is at least one of: an image, graphics, a video stream, a video clip, an audio stream, an audio clip, a video frame, text, a photograph, images of signals, and portions thereof. 8. A method for large-scale classification of multimedia signals, comprising: generating by a signature generator a set of robust signatures representing samples for at least one class of multimedia signals; for each multimedia signal to be classified performing: generating at least a signature by the signature generator; determining if the multimedia signal matches the at least one of class of multimedia signals based on the signature generated for the multimedia signal and the set of robust signatures representing samples for the at least one of class of multimedia signals; and classifying the multimedia signal to a class of the multimedia signal matching the at least one of class of multimedia signals. 9. The method of claim 8 , further comprising: creating a new class of multimedia signals to include an unmatched multimedia signal if is a match does not exist. 10. The method of claim 8 , wherein each robust signature of the set of robust signatures is robust to noise and distortion. 11. The method of claim 8 , wherein robust signatures of the set of robust signatures and the signature of the multimedia signal are generated by a signature generator. 12. The method of claim 11 , wherein the signature generator includes: a processor; and a memory coupled to the processor and configured to store at least instructions causing for execution of a plurality of computational cores, wherein the instructions includes a plurality of subsets of instructions, each subset of instructions causes the execution of one computational core from the plurality of computational cores and is coded to have properties that have at least some statistical independency from other subset of instructions, thereby the properties are independent of each other computational core, wherein each subset of instructions causes its respective computational core to generate responsive of a multimedia signal a first signature element and a second signature element, the first signature element is a robust signature; wherein a plurality of first signature elements include a first signature of the input signal and a plurality of second signature elements include a second signature of the input signal, the first signature is determined respective of a first threshold value and the second signature is determined respective of a second threshold, wherein the first threshold is higher than the second threshold. 13. The method of claim 8 , wherein the multimedia signal is at least one of: an image, graphics, a video stream, a video clip, an audio stream, an audio clip, a video frame, text, a photograph, images of signals, and portions thereof.

Assignees

Inventors

Classifications

  • De-duplication implemented within the file system, e.g. based on file segments (de-duplication techniques in storage systems for the management of data blocks G06F3/0641) · CPC title

  • Indexing structures · CPC title

  • Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually · CPC title

  • Matching video sequences · CPC title

  • using classification, e.g. of video objects · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US8990125B2 cover?
Content-based clustering, recognition, classification and search of high volumes of multimedia data in real-time. The embodiments disclosed herein are dedicated to real-time fast generation of signatures to high-volume of multimedia content-segments, based on relevant audio and visual signals, and to scalable matching of signatures of high-volume database of content-segments' signatures. The em…
Who is the assignee on this patent?
Cortica Ltd
What technology area does this patent fall under?
Primary CPC classification G06F16/41. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Mar 24 2015 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).