Sound representation via winner-take-all coding of auditory spectra
US-9158842-B1 · Oct 13, 2015 · US
US12346373B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-12346373-B2 |
| Application number | US-202418587628-A |
| Country | US |
| Kind code | B2 |
| Filing date | Feb 26, 2024 |
| Priority date | Mar 31, 2017 |
| Publication date | Jul 1, 2025 |
| Grant date | Jul 1, 2025 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Techniques of content unification are disclosed. In some example embodiments, a computer-implemented method comprises: determining clusters based a comparison of a plurality of audio content using a first matching criteria, each cluster of the plurality of clusters comprising at least two audio content from the plurality of audio content; for each cluster of the plurality of clusters, determining a representative audio content for the cluster from the at least two audio content of the cluster; loading the corresponding representative audio content of each cluster into an index; matching the query audio content to one of the representative audio contents using a first matching criteria; determining the corresponding cluster of the matched representative audio content; and identifying a match between the query audio content and at least one of the audio content of the cluster of the matched representative audio content based on a comparison using a second matching criteria.
Opening claim text (preview).
The invention claimed is: 1. A tangible, non-transitory computer-readable storage medium, comprising instructions that, when executed, cause one or more processors to perform a set of operations comprising: generating an index comprising a plurality of representative audio content, wherein each representative audio content of the plurality of representative audio content is stored in association with a hash value, and wherein the hash value is associated with a candidate reference identifier; removing candidate reference identifiers that appear less than a threshold number of times; in response to removing the candidate reference identifiers that appear less than a threshold number of times, generating a first comparison of query audio content to each representative audio content associated with a remaining set of candidate reference identifiers, wherein the remaining set of candidate reference identifiers does not include the removed candidate reference identifiers, and wherein the first comparison is generated using a first matching criteria; and matching the query audio content to at least one representative audio content of the plurality of representative audio content based on the generated first comparison. 2. The tangible, non-transitory computer-readable storage medium of claim 1 , wherein the first comparison comprises a comparison of at least one of: (i) a content duration ratio; (ii) a bit error rate at a matching location; and (iii) or a length of matching positions. 3. The tangible, non-transitory computer-readable storage medium of claim 1 , wherein the hash value is based on permutations of a binary vector formed using a spectral representation of the representative audio content. 4. The tangible, non-transitory computer-readable storage medium of claim 1 , wherein matching the query audio content to at least one representative audio content comprises comparing fingerprint data of the query audio content with fingerprint data of each representative audio content in the index using the first matching criteria. 5. The tangible, non-transitory computer-readable storage medium of claim 1 , wherein each representative audio content of the plurality of representative audio content comprises a song. 6. The tangible, non-transitory computer-readable storage medium of claim 1 , wherein each representative audio content comprises representative audio content for a cluster. 7. The tangible, non-transitory computer-readable storage medium of claim 6 , wherein the cluster comprises at least two audio contents. 8. The tangible, non-transitory computer-readable storage medium of claim 7 , wherein the set of operations further comprises determining a corresponding cluster of the matched at least one representative audio content. 9. The tangible, non-transitory computer-readable storage medium of claim 8 , wherein the set of operation further comprises generating a second comparison of the query audio content to the least two audio contents of a corresponding cluster of the matched at least one representative audio content using a second matching criteria. 10. The tangible, non-transitory computer-readable storage medium of claim 9 , wherein the second matching criteria is different from the first matching criteria. 11. A computer-implemented method comprising: generating an index comprising a plurality of representative audio content, wherein each representative audio content of the plurality of representative audio content is stored in association with a hash value, and wherein the hash value is associated with a candidate reference identifier; removing candidate reference identifiers that appear less than a threshold number of times; in response to removing the candidate reference identifiers that appear less than a threshold number of times, generating a first comparison of query audio content to each representative audio content associated with a remaining set of candidate reference identifiers, wherein the remaining set of candidate reference identifiers does not include the removed candidate reference identifiers, and wherein the first comparison is generated using a first matching criteria; and matching the query audio content to at least one representative audio content of the plurality of representative audio content based on the generated first comparison. 12. The computer-implemented method of claim 11 , wherein the first comparison comprises a comparison of at least one of: (i) a content duration ratio; (ii) a bit error rate at a matching location; and (iii) or a length of matching positions. 13. The computer-implemented method of claim 11 , wherein the hash value is based on permutations of a binary vector formed using a spectral representation of the representative audio content. 14. The computer-implemented method of claim 11 , wherein matching the query audio content to at least one representative audio content comprises comparing fingerprint data of the query audio content with fingerprint data of each representative audio content in the index using the first matching criteria. 15. The computer-implemented method of claim 11 , wherein each representative audio content of the plurality of representative audio content comprises a song. 16. The computer-implemented method of claim 11 , wherein each representative audio content comprises representative audio content for a cluster. 17. The computer-implemented method of claim 16 , wherein the cluster comprises at least two audio contents. 18. The computer-implemented method of claim 17 , further comprising determining a corresponding cluster of the matched at least one representative audio content. 19. The computer-implemented method of claim 18 , further comprising generating a second comparison of the query audio content to the least two audio contents of a corresponding cluster of the matched at least one representative audio content using a second matching criteria, wherein the second matching criteria is different from the first matching criteria. 20. A computing device comprising: one or more processors; and a tangible, non-transitory computer-readable storage medium, comprising instructions that, when executed, cause the one or more processors to perform a set of operations comprising: generating an index comprising a plurality of representative audio content, wherein each representative audio content of the plurality of representative audio content is stored in association with a hash value, and wherein the hash value is associated with a candidate reference identifier; removing candidate reference identifiers that appear less than a threshold number of times; in response to removing the candidate reference identifiers that appear less than a threshold number of times, generating a first comparison of query audio content to each representative audio content associated with a remaining set of candidate reference identifiers, wherein the remaining set of candidate reference identifiers does not include the removed candidate reference identifiers, and wherein the first comparison is generated using a first matching criteria; and matching the query audio content to at least one representative audio content of the plurality of representative audio content based on the generated first comparison.
Indexing; Data structures therefor; Storage structures · CPC title
using metadata automatically derived from the content · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.