System and method for fingerprinting datasets

US9679042B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9679042-B2
Application numberUS-201414169218-A
CountryUS
Kind codeB2
Filing dateJan 31, 2014
Priority dateMar 28, 2012
Publication dateJun 13, 2017
Grant dateJun 13, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems and methods for the matching of datasets, such as input audio segments, with known datasets in a database are disclosed. In an illustrative embodiment, the use of the presently disclosed systems and methods is described in conjunction with recognizing known network message recordings encountered during an outbound telephone call. The methodologies include creation of a ternary fingerprint bitmap to make the comparison process more efficient. Also disclosed are automated methodologies for creating the database of known datasets from a larger collection of datasets.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for creating a weighted compressed representation of a dataset, the method comprising the steps of: a. computing a compressed representation of the dataset; b. applying a transformation to the dataset to create a transformed dataset; c. after step (b), computing another compressed representation of the transformed dataset; d. repeating steps (b) and (c) a plurality of times to create a plurality of compressed representations of the dataset; and e. combining the plurality of compressed representations into a weighted compressed representation of the dataset, where each weight in the weighted compressed representation is computed as a function of elements in the plurality of compressed representations. 2. The method of claim 1 , wherein said compressed representation is a two-dimensional vector of first numbers. 3. The method of claim 2 , wherein said first numbers are each represented with a first number of bits. 4. The method of claim 3 , wherein said first number of bits is one. 5. The method of claim 1 , wherein said weighted compressed representation is a two-dimensional vector of second numbers. 6. The method of claim 5 , wherein said second numbers are represented with a second number of bits each. 7. The method of claim 1 , wherein said function computes said each weight as a correlation coefficient of the elements in said plurality of compressed representations. 8. The method of claim 7 , wherein said correlation coefficient is computed as follows: (1) If an element is below a first predetermined threshold in a first predetermined number of said plurality of compressed representations, set the correlation coefficient to a first value; (2) If an element is above a second predetermined threshold in a second predetermined number of said plurality of compressed representations, set the correlation coefficient to a second value; (3) Otherwise, set the correlation coefficient to a third value. 9. The method of claim 8 , wherein said first value is +1, said second value is −1, and said third value is 0 (zero). 10. The method of claim 8 , wherein said first predetermined threshold is 1 (one) and said second predetermined threshold is 0 (zero). 11. The method of claim 8 , wherein said first predetermined number comprises all of the plurality of compressed representations and said second predetermined number comprises all of the plurality of compressed representations. 12. The method of claim 1 , wherein said transformation comprises deleting a first number of elements from said dataset. 13. The method of claim 12 , wherein said first number is one.

Assignees

Inventors

Classifications

  • by means of a mask or a bit-map · CPC title

  • Vector coding (for television signals, see H04N19/94) · CPC title

  • using metadata automatically derived from the content · CPC title

  • G06F16/284Primary

    Relational databases · CPC title

  • H04M3/5158Primary

    in combination with automated outdialling systems (devices for signalling identity of wanted subscriber H04M1/26) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9679042B2 cover?
Systems and methods for the matching of datasets, such as input audio segments, with known datasets in a database are disclosed. In an illustrative embodiment, the use of the presently disclosed systems and methods is described in conjunction with recognizing known network message recordings encountered during an outbound telephone call. The methodologies include creation of a ternary fingerpri…
Who is the assignee on this patent?
Interactive Intelligence Inc, Interactive Intelligence Group Inc
What technology area does this patent fall under?
Primary CPC classification G06F16/284. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jun 13 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).