Musical analysis platform

US9804818B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9804818-B2
Application numberUS-201514871897-A
CountryUS
Kind codeB2
Filing dateSep 30, 2015
Priority dateSep 30, 2015
Publication dateOct 31, 2017
Grant dateOct 31, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A platform or system is disclosed for performing musical analysis to detect musical properties in received live or pre-recorded audio data. The analysis can include a synchronous analysis for generating estimated one or more transitory musical properties and an asynchronous analysis for generating one or more aggregate musical properties which can be applied to the transitory musical properties to generate confirmed musical properties, which can be stored as metadata associated with an audio file. In some cases, live audio data can be received, recorded, dynamically analyzed to provide realtime metadata (e.g., to a display), then the realtime metadata can be analyzed to provide confirmed, updated, or validated metadata. In some cases, initial analysis (e.g., dynamic analysis) can determine chord estimates, usable in further analysis (e.g., offline analysis) to estimate a musical key, which can then be applied to the chord estimates to determine the most likely chord estimates and determine chord progressions.

First claim

Opening claim text (preview).

What is claimed is: 1. A system, comprising: one or more data processors; and a non-transitory computer-readable storage medium containing instructions which, when executed on the one or more data processors, cause the one or more data processors to perform operations including: receiving incoming audio data; detecting one or more transients in the audio data; splitting the audio data into a plurality of audio segments, wherein splitting the audio data includes using the one or more transients; processing the plurality of audio segments by a frequency analyzer to identify one or more peak frequencies in one of the plurality of audio segments; selecting a target frequency of an audio segment by a target frequency selector from the one or more peak frequencies, wherein the target frequency corresponds to a reference note having a reference frequency; processing the audio segment to determine a tuning offset including calculating a difference in frequency between the target frequency and the reference frequency; and applying the tuning offset using a frequency adjustor when the audio data includes untuned audio data, wherein applying the tuning offset includes applying the tuning offset to the plurality of audio segments. 2. The system of claim 1 , wherein selecting the target frequency includes: determining correlation scores between each of the one or more peak frequencies and each of one or more reference notes, wherein determining a correlation score includes determining a likelihood that a peak frequency corresponds to a reference note; and choosing the target frequency form the one or more peak frequencies as having the highest of the correlation scores. 3. The system of claim 1 , wherein selecting the target frequency includes: choosing a target frequency from the one or more peak frequencies; determining correlation scores between the target frequency and each of a plurality of reference notes, wherein determining a correlation score includes determining a likelihood that the target frequency corresponds to a reference note; and selecting the reference note from the plurality of references notes as having the highest of the correlation scores. 4. The system of claim 1 , wherein selecting the target frequency further includes identifying that the target frequency falls within a preset frequency window, and wherein applying the tuning offset includes adjusting one or more benchmark frequency values when the one or more benchmark frequency values falls within the preset frequency window. 5. The system of claim 4 , wherein the operations further include: determining an additional tuning offset for an additional target frequency that falls within an additional preset frequency window, wherein the additional preset frequency window corresponds to notes that do not fall within the preset frequency window, and wherein applying the tuning offset includes adjusting an additional one or more benchmark frequency values when the additional one or more benchmark frequency values falls within the additional preset frequency window. 6. The system of claim 1 , wherein the operations further include: selecting an additional frequency from one or more additional peak frequencies of the one of the plurality of audio segments or another of the plurality of audio segments, wherein the selected additional frequency corresponds to an additional reference note; determining an additional tuning offset by calculating a difference in frequency between the selected additional frequency and an additional reference frequency of the additional reference note; and updating the tuning offset using the additional tuning offset. 7. The system of claim 1 , wherein applying the tuning offset includes adjusting one or more benchmark frequency values, and wherein the operations further include: using the one or more adjusted benchmark frequency values to estimate one or more semitones present in the audio data; and determining a musical property of the audio data using the one or more estimated semitones. 8. The system of claim 7 , wherein the musical property is a chord transcription of the audio data corresponding to one or more chords present across a duration of the audio data, wherein determining the musical property includes determining a set of chord candidates having one or more notes matching the one or more estimated semitones. 9. The system of claim 7 , wherein determining the musical property includes determining a confidence score using the one or more semitones and wherein the confidence score is influenced by a difference between the one or more semitones and one or more expected values. 10. The system of claim 1 , wherein applying the tuning offset including adjusting one or more benchmark frequency values or adjusting one or more frequencies of the plurality of audio segments. 11. A computer-program product tangibly embodied in a non-transitory machine-readable storage medium, including instructions configured to cause a data processing apparatus to perform operations including: receiving incoming audio data; detecting one or more transients in the audio data; splitting the audio data into a plurality of audio segments, wherein splitting the audio data includes using the one or more transients; processing the plurality of audio segments by a frequency analyzer to identify one or more peak frequencies in one of the plurality of audio segments; selecting a target frequency of an audio segment by a target frequency selector from the one or more peak frequencies, wherein the target frequency corresponds to a reference note having a reference frequency; processing the audio segment to determine a tuning offset including calculating a difference in frequency between the target frequency and the reference frequency; and applying the tuning offset using a frequency adjustor when the audio data includes untuned audio data, wherein applying the tuning offset includes applying the tuning offset to the plurality of audio segments. 12. The computer-program product of claim 11 , wherein selecting the target frequency includes: determining correlation scores between each of the one or more peak frequencies and each of one or more reference notes, wherein determining a correlation score includes determining a likelihood that a peak frequency corresponds to a reference note; and choosing the target frequency form the one or more peak frequencies as having the highest of the correlation scores. 13. The computer-program product of claim 11 , wherein selecting the target frequency includes: choosing a target frequency from the one or more peak frequencies; determining correlation scores between the target frequency and each of a plurality of reference notes, wherein determining a correlation score includes determining a likelihood that the target frequency corresponds to a reference note; and selecting the reference note from the plurality of references notes as having the highest of the correlation scores. 14. The computer-program product of claim 11 , wherein selecting the target frequency further includes identifying that the target frequency falls within a preset frequency window, and wherein applying the tuning offset includes adjusting one or more benchmark frequency values when the one or more benchmark frequency values falls within the preset frequency window. 15. The computer-program product of claim 14 , wherein the operations further include: determining an additional tuning offset for an additional target frequency that falls within an additional preset frequency window, wherein the additional preset

Assignees

Inventors

Classifications

  • G10G1/00Primary

    Means for the representation of music · CPC title

  • G06F3/165Primary

    Management of the audio stream, e.g. setting of volume, audio stream path · CPC title

  • for automatic key or tonality recognition, e.g. using musical rules or a knowledge base · CPC title

  • Chord detection and/or recognition, e.g. for correction, or automatic bass generation · CPC title

  • for extraction of timing, tempo; Beat detection · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9804818B2 cover?
A platform or system is disclosed for performing musical analysis to detect musical properties in received live or pre-recorded audio data. The analysis can include a synchronous analysis for generating estimated one or more transitory musical properties and an asynchronous analysis for generating one or more aggregate musical properties which can be applied to the transitory musical properties…
Who is the assignee on this patent?
Apple Inc
What technology area does this patent fall under?
Primary CPC classification G10G1/00. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Oct 31 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 5 related publications on this page (citations in our corpus or others sharing the same primary CPC).