Pitch shift resistant audio matching
US-9052986-B1 · Jun 9, 2015 · US
US9396312B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-9396312-B2 |
| Application number | US-201514808888-A |
| Country | US |
| Kind code | B2 |
| Filing date | Jul 24, 2015 |
| Priority date | Nov 12, 2010 |
| Publication date | Jul 19, 2016 |
| Grant date | Jul 19, 2016 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A syndication system facilitates rights management services between media content owners and media hosting services that elect to participate in the syndication system and mutually elect to participate with each other. The syndication system utilizes a content recognition system to identify hosted media content and ownership rights associated with the hosted content. By applying melody recognition, the content recognition system can identify compositions embodied in hosted media content even when these compositions do not precisely match any known sound recording. Thus, the content recognition system is beneficially able to detect, for example, recorded cover performances and recorded live performances embodied in hosted media content. Once identified, ownership information is determined and the syndication system can facilitate rights management policies associated with the content such as monetizing or blocking the protected content.
Opening claim text (preview).
The invention claimed is: 1. A computer-implemented method for facilitating a rights management service between a composition owner and a media hosting service, the method comprising: receiving from the media hosting service, a representation of an input media file, the input media file including an audio clip; processing the audio clip to extract dominant musical tones present in the audio clip; generating, by a processing device, an input melody fingerprint for the audio clip, the input melody fingerprint representing a sequence of pitch intervals between the dominant musical tones present in the processed audio clip; determining a matching reference melody fingerprint from a melody fingerprint reference database that matches the input melody fingerprint, the matching reference melody fingerprint representing a melody of a composition that is embodied in the audio clip; determining a strength of the match between the matching reference melody fingerprint and the input melody fingerprint; generating an input audio fingerprint for the audio clip representing features that uniquely represent a sound recording embodied by the audio clip; comparing the input audio fingerprint to a plurality of reference audio fingerprints in an audio fingerprint reference database; and identifying a matching reference audio fingerprint that matches the input audio fingerprint responsive to the comparing and the determined strength of the match between the matching reference melody fingerprint and the input melody fingerprint; identifying the composition owner as having ownership of the composition based on a reference sound recording associated with the determined matching reference melody fingerprint and with the identified matching reference audio fingerprint; and facilitating an ownership management policy between the media hosting service and the composition owner, the ownership management policy associated with use of the input media file by the media hosting service. 2. The computer-implemented method of claim 1 , further comprising: receiving an agreement over a first online interface from the composition owner to participate in the rights management service; presenting to the media hosting service via a second online interface, an option to participate with the composition owner in the rights management service; responsive to receiving from the media hosting service via the second online interface, an affirmative response to participate with the composition owner, presenting to the composition owner via the first online interface, an option to participate with the media hosting service; and responsive to receiving from the composition owner via the first online interface, an affirmative response to participate with the media hosting service storing an indication of an agreement between the composition owner and the media hosting service. 3. The computer-implemented method of claim 1 , wherein facilitating an ownership management policy comprises: determining an ownership policy associated with use of the composition by the media hosting service; and responsive to the ownership policy indicating a block on the composition, causing the media hosting service to block access to the input media file. 4. The computer-implemented method of claim 1 , further comprising: determining an ownership policy associated with use of the composition by the media hosting service; and responsive to the ownership policy indicating monetization of the composition, allocating a portion of revenue associated with use of the input media file by the media hosting service to the composition owner. 5. The computer-implemented method of claim 1 , further comprising: generating an input audio fingerprint for the audio clip representing features that uniquely represent a sound recording embodied by the audio clip; determining a matching reference audio fingerprint from an audio fingerprint reference database that matches the input audio fingerprint; identifying a sound recording owner as having ownership of the sound recording based on metadata associated with the matching reference audio fingerprint; and facilitating an ownership management policy between the media hosting service and the sound recording owner, the ownership management policy associated with use of the input media file by the media hosting service. 6. The computer-implemented method of claim 1 , wherein determining a matching reference melody fingerprint comprises: dividing the input melody fingerprint into a plurality of bands determined using locality-sensitive hashing (LSH); using each band as an index into the melody fingerprint reference database to identify a set of candidate reference fingerprints; and comparing the input melody fingerprint to the candidate reference fingerprints in the set to identify the matching reference melody fingerprint. 7. The computer-implemented method of claim 1 , wherein the input melody fingerprint is invariant to parameters used to perform or record a performance of the composition that is embodied in the audio clip. 8. A non-transitory computer-readable storage medium storing instructions for facilitating a rights management service between a composition owner and a media hosting service, the instructions when executed by a processor causing the processor to perform steps comprising: receiving from the media hosting service, a representation of an input media file, the input media file including an audio clip; processing the audio clip to extract dominant musical tones present in the audio clip; generating an input melody fingerprint for the audio clip, the input melody fingerprint representing a sequence of pitch intervals between the dominant musical tones present in the processed audio clip; determining a matching reference melody fingerprint from a melody fingerprint reference database that matches the input melody fingerprint, the matching reference melody fingerprint representing a melody of a composition that is embodied in the audio clip; determining a strength of the match between the matching reference melody fingerprint and the input melody fingerprint; generating an input audio fingerprint for the audio clip representing features that uniquely represent a sound recording embodied by the audio clip; comparing the input audio fingerprint to a plurality of reference audio fingerprints in an audio fingerprint reference database; and identifying a matching reference audio fingerprint that matches the input audio fingerprint responsive to the comparing and the determined strength of the match between the matching reference melody fingerprint and the input melody fingerprint; identifying the composition owner as having ownership of the composition based on a reference sound recording associated with the determined matching reference melody fingerprint and with the identified matching reference audio fingerprint; and facilitating an ownership management policy between the media hosting service and the composition owner, the ownership management policy associated with use of the input media file by the media hosting service. 9. The non-transitory computer-readable storage medium of claim 8 , further comprising instructions for: receiving an agreement over a first online interface from the composition owner to participate in the rights management service; presenting to the media hosting service via a second online interface, an option to participate with the composition owner in the rights management service; responsive to receiving from the media hosting service via the second online interface, an affirmative response to participate with the composition owner, presenting to the composition owner via the first online interfa
Collaborative creation, e.g. joint development of products or services · CPC title
by using information signals recorded by the same method as the main recording {(G11B27/22 takes precedence)} · CPC title
for managing network security; network security policies in general (filtering policies H04L63/0227) · CPC title
Physics · mapped topic
Protecting distributed programs or content, e.g. vending or licensing of copyrighted material (protection in video systems or pay television H04N7/16) {; Digital rights management [DRM]} · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.