Systems and methods for sound recognition

US9280598B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9280598-B2
Application numberUS-77375310-A
CountryUS
Kind codeB2
Filing dateMay 4, 2010
Priority dateMay 4, 2010
Publication dateMar 8, 2016
Grant dateMar 8, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems and methods for recognizing sounds are provided herein. User input relating to one or more sounds is received from a computing device. Instructions, which are stored in memory, are executed by a processor to discriminate the one or more sounds, extract music features from the one or more sounds, analyze the music features using one or more databases, and obtain information regarding the music features based on the analysis. Further, information regarding the music features of the one or more sounds may be transmitted to display on the computing device.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for recognizing one or more sounds on a server computer, the method comprising: receiving on the server computer a user input from a computing device, the user input providing a search query comprising the one or more sounds, the one or more sounds having been recorded on the computing device using at least one microphone; executing instructions stored in memory on the server computer, wherein execution of the instructions by a processor on the server computer: provides an available set of databases, each database comprising a sound type and associated data, the available set of databases comprising a first database for recorded audio and second database for speech, discriminates the one or more sounds by classifying the one or more sounds according to a sound type and routing the one or more sounds to a corresponding sound recognition application for processing based on the sound type, the corresponding sound recognition application being selected from a first sound recognition application for recorded audio and second sound recognition application for speech, extracts features from the one or more sounds, analyzes the extracted features using a database selected from the available set of databases based on the sound type, and obtains information regarding the extracted features based on the analysis; and in response to the search query, transmitting for display the information regarding the extracted features of the one or more sounds to the computing device. 2. The method of claim 1 , wherein the method discriminates the one or more sounds by classifying the one or more sounds of the search query as at least one of a singing or humming sound, a recorded music sound, and a speech sound. 3. The method of claim 1 , wherein the one or more sounds includes any of a song, a song clip, a song snippet, a singing or humming sound, voice, spoken words or any combination thereof. 4. The method of claim 1 , wherein information regarding the extracted features of the one or more sounds comprises a song title, a name of an artist, an artist's biographical information, identification of similar artists, a link to download a song, a link to download a video related to the song, or any combination thereof. 5. The method of claim 1 , further comprising executing instructions stored in memory, wherein execution of the instructions by a processor separates sound artifacts from the sound by noise cancellation. 6. The method of claim 1 , further comprising: executing instructions stored in memory, wherein execution of the instructions by a processor determines whether the extracted features of the one or more sounds are already included in a local music database of the computing device; and transmitting to display a flag on the display of the computing device, the flag depicting the determination of whether the extracted features of the one or more sounds are already included in a local music database of the computing device. 7. The method of claim 1 , further comprising: executing instructions stored in memory, wherein execution of the instructions by a processor provides relevant content relating to the extracted features; and transmitting to display a premium row on the display of the computing device, the premium row depicting the relevant content relating to the extracted features. 8. A method for recognizing one or more sounds on a server computer, the method comprising: receiving on the server computer a user input from a computing device, the user input providing a search query comprising the one or more sounds, the one or more sounds having been recorded on the computing device using at least one microphone; executing instructions stored in memory on the server computer, wherein execution of the instructions by a processor on the server computer: provides an available set of sound recognition applications, the available set of sound recognition applications comprising a first sound recognition application for recorded audio and second sound recognition application for speech; provides an available set of databases, each database comprising a sound type and associated data, the available set of databases comprising a first database for recorded audio and second database for speech; discriminates the one or more sounds by classifying the one or more sounds according to a sound type and routing the one or more sounds to a corresponding sound recognition application for processing based on the sound type, the corresponding sound recognition application being selected from the available set of sound recognition applications, analyzes a databases for searching based on the sound type using data included in the discriminated one or more sounds, the database being selected from the available set of databases, and obtains information regarding the discriminated one or more sounds based on the analysis; and in response to the search query, transmitting for display the information regarding the discriminated one or more sounds to the display of the computing device. 9. A non-transitory computer readable storage medium having embodied thereon a program, the program being executable by a processor to perform a method for recognizing one or more sounds, the method comprising: receiving on a server computer a user input from a computing device, the user input providing a search query comprising the one or more sounds, the one or more sounds having been recorded on the computing device using at least one microphone; providing an available set of sound recognition applications on the server computer, the available set of sound recognition applications comprising a first sound recognition application for recorded audio and second sound recognition application for speech; providing an available set of databases on the server computer, each database comprising a sound type and associated data, the available set of databases comprising a first database for recorded audio and second database for speech; discriminating the one or more sounds on the server computer by classifying the sound according to a sound type and routing the sound to a corresponding sound recognition application for processing based on the sound type, the corresponding sound recognition application being selected from of the available set of sound recognition applications; analyzing a database on the server computer based on the sound type using data included in the discriminated sound, the database being selected from the available set of databases; obtaining information regarding the discriminated one or more sounds, from the database, based on the analysis; and in response to the search query, transmitting for display the information regarding the discriminated one or more sounds to the computing device. 10. The non-transitory computer readable storage medium of claim 9 , wherein discriminating the one or more sounds further comprises classifying the one or more sounds of the search query as at least one of a singing or humming sound, a recorded music sound, and a speech sound. 11. The non-transitory computer readable storage medium of claim 9 , wherein the one or more sounds includes any of a song, a song clip, a song snippet, a humming sound, voice, or any combination thereof. 12. The non-transitory computer readable storage medium of claim 9 , wherein information regarding the discriminated one or more sounds comprises a song title, a name of an artist, an artist's biographical information, identification of similar artists, a link to download a song, a link to download a video related to the song, or any combination thereof. 13. The non-transitory compu

Assignees

Inventors

Classifications

  • Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually · CPC title

  • G06F16/683Primary

    using metadata automatically derived from the content · CPC title

  • Query by example, e.g. query by humming · CPC title

  • for comparison or discrimination · CPC title

  • Physics · mapped topic

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9280598B2 cover?
Systems and methods for recognizing sounds are provided herein. User input relating to one or more sounds is received from a computing device. Instructions, which are stored in memory, are executed by a processor to discriminate the one or more sounds, extract music features from the one or more sounds, analyze the music features using one or more databases, and obtain information regarding the…
Who is the assignee on this patent?
Master Aaron Steven, Stonehocker Timothy P, Levitt Benjamin John, and 3 more
What technology area does this patent fall under?
Primary CPC classification G06F16/683. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Mar 08 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).