Multiple stage indexing of audio content

US12346373B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12346373-B2
Application numberUS-202418587628-A
CountryUS
Kind codeB2
Filing dateFeb 26, 2024
Priority dateMar 31, 2017
Publication dateJul 1, 2025
Grant dateJul 1, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Techniques of content unification are disclosed. In some example embodiments, a computer-implemented method comprises: determining clusters based a comparison of a plurality of audio content using a first matching criteria, each cluster of the plurality of clusters comprising at least two audio content from the plurality of audio content; for each cluster of the plurality of clusters, determining a representative audio content for the cluster from the at least two audio content of the cluster; loading the corresponding representative audio content of each cluster into an index; matching the query audio content to one of the representative audio contents using a first matching criteria; determining the corresponding cluster of the matched representative audio content; and identifying a match between the query audio content and at least one of the audio content of the cluster of the matched representative audio content based on a comparison using a second matching criteria.

First claim

Opening claim text (preview).

The invention claimed is: 1. A tangible, non-transitory computer-readable storage medium, comprising instructions that, when executed, cause one or more processors to perform a set of operations comprising: generating an index comprising a plurality of representative audio content, wherein each representative audio content of the plurality of representative audio content is stored in association with a hash value, and wherein the hash value is associated with a candidate reference identifier; removing candidate reference identifiers that appear less than a threshold number of times; in response to removing the candidate reference identifiers that appear less than a threshold number of times, generating a first comparison of query audio content to each representative audio content associated with a remaining set of candidate reference identifiers, wherein the remaining set of candidate reference identifiers does not include the removed candidate reference identifiers, and wherein the first comparison is generated using a first matching criteria; and matching the query audio content to at least one representative audio content of the plurality of representative audio content based on the generated first comparison. 2. The tangible, non-transitory computer-readable storage medium of claim 1 , wherein the first comparison comprises a comparison of at least one of: (i) a content duration ratio; (ii) a bit error rate at a matching location; and (iii) or a length of matching positions. 3. The tangible, non-transitory computer-readable storage medium of claim 1 , wherein the hash value is based on permutations of a binary vector formed using a spectral representation of the representative audio content. 4. The tangible, non-transitory computer-readable storage medium of claim 1 , wherein matching the query audio content to at least one representative audio content comprises comparing fingerprint data of the query audio content with fingerprint data of each representative audio content in the index using the first matching criteria. 5. The tangible, non-transitory computer-readable storage medium of claim 1 , wherein each representative audio content of the plurality of representative audio content comprises a song. 6. The tangible, non-transitory computer-readable storage medium of claim 1 , wherein each representative audio content comprises representative audio content for a cluster. 7. The tangible, non-transitory computer-readable storage medium of claim 6 , wherein the cluster comprises at least two audio contents. 8. The tangible, non-transitory computer-readable storage medium of claim 7 , wherein the set of operations further comprises determining a corresponding cluster of the matched at least one representative audio content. 9. The tangible, non-transitory computer-readable storage medium of claim 8 , wherein the set of operation further comprises generating a second comparison of the query audio content to the least two audio contents of a corresponding cluster of the matched at least one representative audio content using a second matching criteria. 10. The tangible, non-transitory computer-readable storage medium of claim 9 , wherein the second matching criteria is different from the first matching criteria. 11. A computer-implemented method comprising: generating an index comprising a plurality of representative audio content, wherein each representative audio content of the plurality of representative audio content is stored in association with a hash value, and wherein the hash value is associated with a candidate reference identifier; removing candidate reference identifiers that appear less than a threshold number of times; in response to removing the candidate reference identifiers that appear less than a threshold number of times, generating a first comparison of query audio content to each representative audio content associated with a remaining set of candidate reference identifiers, wherein the remaining set of candidate reference identifiers does not include the removed candidate reference identifiers, and wherein the first comparison is generated using a first matching criteria; and matching the query audio content to at least one representative audio content of the plurality of representative audio content based on the generated first comparison. 12. The computer-implemented method of claim 11 , wherein the first comparison comprises a comparison of at least one of: (i) a content duration ratio; (ii) a bit error rate at a matching location; and (iii) or a length of matching positions. 13. The computer-implemented method of claim 11 , wherein the hash value is based on permutations of a binary vector formed using a spectral representation of the representative audio content. 14. The computer-implemented method of claim 11 , wherein matching the query audio content to at least one representative audio content comprises comparing fingerprint data of the query audio content with fingerprint data of each representative audio content in the index using the first matching criteria. 15. The computer-implemented method of claim 11 , wherein each representative audio content of the plurality of representative audio content comprises a song. 16. The computer-implemented method of claim 11 , wherein each representative audio content comprises representative audio content for a cluster. 17. The computer-implemented method of claim 16 , wherein the cluster comprises at least two audio contents. 18. The computer-implemented method of claim 17 , further comprising determining a corresponding cluster of the matched at least one representative audio content. 19. The computer-implemented method of claim 18 , further comprising generating a second comparison of the query audio content to the least two audio contents of a corresponding cluster of the matched at least one representative audio content using a second matching criteria, wherein the second matching criteria is different from the first matching criteria. 20. A computing device comprising: one or more processors; and a tangible, non-transitory computer-readable storage medium, comprising instructions that, when executed, cause the one or more processors to perform a set of operations comprising: generating an index comprising a plurality of representative audio content, wherein each representative audio content of the plurality of representative audio content is stored in association with a hash value, and wherein the hash value is associated with a candidate reference identifier; removing candidate reference identifiers that appear less than a threshold number of times; in response to removing the candidate reference identifiers that appear less than a threshold number of times, generating a first comparison of query audio content to each representative audio content associated with a remaining set of candidate reference identifiers, wherein the remaining set of candidate reference identifiers does not include the removed candidate reference identifiers, and wherein the first comparison is generated using a first matching criteria; and matching the query audio content to at least one representative audio content of the plurality of representative audio content based on the generated first comparison.

Assignees

Inventors

Classifications

  • Indexing; Data structures therefor; Storage structures · CPC title

  • G06F16/683Primary

    using metadata automatically derived from the content · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12346373B2 cover?
Techniques of content unification are disclosed. In some example embodiments, a computer-implemented method comprises: determining clusters based a comparison of a plurality of audio content using a first matching criteria, each cluster of the plurality of clusters comprising at least two audio content from the plurality of audio content; for each cluster of the plurality of clusters, determini…
Who is the assignee on this patent?
Gracenote Inc
What technology area does this patent fall under?
Primary CPC classification G06F16/683. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jul 01 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).