Extraction and matching of characteristic fingerprints from audio signals
US-9208790-B2 · Dec 8, 2015 · US
US9529907B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-9529907-B2 |
| Application number | US-201213732108-A |
| Country | US |
| Kind code | B2 |
| Filing date | Dec 31, 2012 |
| Priority date | Dec 31, 2012 |
| Publication date | Dec 27, 2016 |
| Grant date | Dec 27, 2016 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A matching system receives probe audio samples for comparison to references of a data store. Comparisons are generated to determine a sufficient match for a portion or a first amount of the probe sample. Ranking scores are assigned to the resulting match references. The match references are retained, unless meeting a score threshold. Comparisons are continually generated with second amounts of the probe sample and the retained references are updated with further matching references assigned ranking scores. The retained results are merged and determined to satisfy a score threshold for release as outputted results for matching references.
Opening claim text (preview).
What is claimed is: 1. A method, comprising: using a processor to execute computer executable instructions stored on a non-transitory computer readable medium to perform operations comprising: receiving, from an audio streaming system, a probe audio sample; comparing a first time segment of the probe audio sample to corresponding time segments of a plurality of reference audio samples to identify a plurality of sufficiently matching reference audio samples based upon a first set of consistency scores generated between one or more feature vectors of the first time segment of the probe audio sample and corresponding feature vectors of the first time segment of each of the reference audio samples; determining that the sufficiently matching reference audio samples do not meet a predetermined score threshold; retaining the sufficiently matching reference audio samples; assigning respective ranking scores to the sufficiently matching reference audio samples based on the first set of consistency scores; comparing a second time segment of the probe audio sample to corresponding time segments of the sufficiently matching reference audio samples to identify a plurality of additional matching reference audio samples based upon a second set of consistency scores generated between one or more features vectors of the second time segment of the probe audio sample and corresponding feature vectors of the second time segment of each of the sufficiently matching reference audio samples; assigning respective ranking scores to the additional matching reference audio samples based on the second set of consistency scores; separately weighting the first set of ranking scores and the second set of ranking scores; and outputting at least one of the reference audio samples based on the weighted first and second set of ranking scores. 2. The method of claim 1 , further comprising outputting ones of the matching reference audio samples that have respective assigned ranking scores that exceed a defined ranking threshold. 3. The method of claim 2 , further comprising, after a defined amount of time of outputting ones of the matching reference audio samples that have respective ranking scores that exceed the defined ranking threshold, outputting the plurality of matching reference audio samples with respective assigned ranking scores. 4. The method of claim 3 , wherein the defined amount of time is based on an amount of the probe audio sample received. 5. The method of claim 1 , wherein the comparing the first time segment of the probe audio sample to the corresponding time segment of the reference audio samples to identify the at least one matching reference audio sample comprises: generating a fingerprint of the first time segment of the probe audio sample; and identifying respective fingerprints of the corresponding time segment of the reference audio samples that match the fingerprint of the first time segment of the probe audio sample by at least a defined matching threshold. 6. The method of claim 1 , wherein the respective ranking scores are generated based on at least one from the group consisting of: respective availability measures indicating whether the respective matching reference audio samples are individually available for sale; respective popularity measures indicating respective popularities of the matching reference audio samples; respective compilation measures indicating whether the respective matching reference audio samples are from an original album or a compilation album; and respective sales measures indicating respective sales amounts of the matching reference audio samples. 7. The method of claim 6 , wherein the respective ranking scores comprise a sequence of bit vectors comprising a set of components that each comprise a set of bits occupying most to least significant bit positions based on an importance of a corresponding measure. 8. The method of claim 6 , wherein the respective popularity measures are generated from heuristics associated with at least one parameter related to the respective matching reference audio samples. 9. A system, comprising: a non-transitory memory that stores computer executable components; and a microprocessor that executes the following computer executable components stored in the memory, the computer executable components comprising: a receiving component that receives, from an audio streaming service, a probe audio sample; a comparing component that compares a first time segment of the probe audio sample to corresponding time segments of a plurality of reference audio samples; a matching component that identifies a plurality of sufficiently matching reference audio samples based upon a first set of consistency scores generated between one or more feature vectors of the first time segment of the probe audio sample and corresponding feature vectors of the first time segment of each of the reference audio samples; a retaining component that determines that the sufficiently matching reference audio samples do not meet a predetermined score threshold and retains the sufficiently matching reference audio samples; a ranking component that assigns respective ranking scores to the sufficiently matching reference audio samples based on the first set of consistency scores; and wherein the comparing component is further configured to compare a second time segment of the probe audio sample to corresponding time segments of the sufficiently matching reference audio samples to identify a plurality of additional matching reference audio samples based upon a second set of consistency scores generated between one or more feature vectors of the second time segment of the probe audio sample and corresponding feature vectors of the second time segment of each of the sufficiently matching reference audio samples; wherein the ranking component is further configured to assign respective ranking scores to the additional matching reference audio samples based on the second set of consistency scores; and a scoring component configured to separately weight the first set of ranking scores and the second set of ranking scores, wherein the scoring component is further configured to output at least one of the reference audio samples based on the weighted first and second set of ranking scores. 10. The system of claim 9 , wherein the second time segment includes the first time segment of the audio streaming content. 11. The system of claim 9 , wherein the scoring component dynamically determines which matching reference audio samples to output based on the score comparison. 12. The system of claim 9 , wherein the retaining component retains the matching reference audio samples based on a set of defined parameters that include at least one of a time for comparison operations, or an amount of the audio streaming content compared. 13. The system of claim 9 , wherein the respective ranking scores are generated based on at least one from the group consisting of: whether the respective matching reference audio samples are individually available for sale, respective popularities of the matching reference audio samples, whether the respective matching reference audio samples are from an original album or a compilation album, and respective sales amounts of the matching reference audio samples. 14. The system of claim 9 , further comprising: a merging component that merges the matching reference audio sample with at least one additional matching reference audio sample having second ranking scores from a sufficient match with a second time segment of the audio streaming content. 15. A non-transitory
Physics · mapped topic
for retrieval · CPC title
Physics · mapped topic
Presentation of query results · CPC title
Query by example, e.g. query by humming · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.