Modified mel filter bank structure using spectral characteristics for sound analysis

US9704495B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9704495-B2
Application numberUS-201314380297-A
CountryUS
Kind codeB2
Filing dateFeb 11, 2013
Priority dateFeb 21, 2012
Publication dateJul 11, 2017
Grant dateJul 11, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A system and method for detection of sound of interest amongst plurality of other dynamically varying sounds is disclosed. In one embodiment, a spectrum detector identifies dominant spectrum energy frequency by detecting the dominant spectrum energy band present in spectrum of sound energy. A modified mel filter bank is designed by revising spectral positioning of the first mel filter bank and the second mel filter bank according to the identified dominant frequency. A feature extractor extracts the features from first mel filter bank, second mel filter bank and the modified mel filter bank which are further classified in order to detect the sound of interest.

First claim

Opening claim text (preview).

We claim: 1. A system for detection of a sound of interest amongst a plurality of dynamically varying sounds, the system comprising: a spectrum detector to identify a dominant frequency by detecting a dominant spectrum energy band present in a spectrum of sound energy of dynamically varying sounds; a first mel filter bank and a second mel filter bank that each comprises mel filters that filter a frequency band of the sound energy for detecting the sound of interest; a modified mel filter bank modified according to the dominant frequency includes a revised spectral positioning of the first mel filter bank ranging from the dominant frequency to a maximum frequency and the second mel filter bank ranging from a minimum frequency to the dominant frequency for detection of the dynamically varying sound of interest; a feature extractor, coupled with the modified mel filter bank, to extract a plurality of spectral characteristics of sound received from the modified filter bank; and a classifier to classify the plurality of spectral characteristics of the sound according to the dominant frequency to detect the sound of interest. 2. The system as claimed in claim 1 , wherein the second mel filter bank is an inverse of the first mel filter bank. 3. The system as claimed in claim 1 , wherein the classifier includes a Gaussian Mixture Model (GMM) to classify the spectral characteristics of the sound of interest. 4. The system as claimed in claim 1 , wherein the dynamically varying sounds includes a horn sound in an automobile. 5. The system as claimed in claim 1 , wherein the system further comprises: a fuser to fuse the features extracted from the first mel filter bank, the second mel filter bank, and the modified mel filter bank to provide a performance evaluation of the system. 6. A method for detection of a sound of interest amongst a plurality of dynamically varying sounds, the method comprising steps of: identifying a dominant frequency present in a spectrum of sound energy; modifying a mel filter bank according to the dominant frequency by revising a spectral position of a first mel filter bank ranging from the dominant s frequency to the maximum frequency and a second mel filter bank ranging from the minimum frequency to the dominant frequency for detection of a dynamically varying sound of interest; extracting a plurality of spectral characteristic of a sound received from the modified filter bank; and classifying the plurality of spectral characteristics of the sound to detect the sound of interest according to the dominant frequency, wherein the identifying, the modifying, the extracting, and the classifying are performed by a processor by executing programmed instructions stored in a memory coupled with said processor. 7. The method as claimed in claim 6 , wherein the dominant frequency includes a frequency of band with maximum energy in the energy spectrum of the sound of interest. 8. The method as claimed in claim 6 , wherein the method further comprises: fusing, by the processor, the features extracted from the first mel filter bank, the second mel filter bank, and the modified mel filter bank in order to provide a performance evaluation while detecting the sound of interest. 9. A non-transitory computer-readable medium storing instructions that, when executed by a processor, cause the processor to perform a method, the method comprising steps of: identifying a dominant frequency present in a spectrum of sound energy; modifying a mel filter bank according to the dominant frequency by revising a spectral position of a first mel filter bank ranging from the dominant frequency to the maximum frequency and a second mel filter bank ranging from the minimum frequency to the dominant frequency for detection of a dynamically varying sound of interest; extracting a plurality of spectral characteristic of a sound received from the modified filter bank; and classifying the plurality of spectral characteristics of the sound to detect the sound of interest according to the dominant frequency. 10. The non-transitory computer-readable medium as claimed in claim 9 , wherein the dominant frequency includes a frequency of band with maximum energy in the energy spectrum of the sound of interest. 11. The non-transitory computer-readable medium as claimed in claim 9 , wherein the method further comprises: fusing the features extracted from the first mel filter bank, the second mel filter bank, and the modified mel filter bank in order to provide a performance evaluation while detecting the sound of interest.

Assignees

Inventors

Classifications

  • G10L25/51Primary

    for comparison or discrimination · CPC title

  • the extracted parameters being spectral information of each sub-band · CPC title

  • G10L19/02Primary

    using spectral analysis, e.g. transform vocoders or subband vocoders · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9704495B2 cover?
A system and method for detection of sound of interest amongst plurality of other dynamically varying sounds is disclosed. In one embodiment, a spectrum detector identifies dominant spectrum energy frequency by detecting the dominant spectrum energy band present in spectrum of sound energy. A modified mel filter bank is designed by revising spectral positioning of the first mel filter bank and …
Who is the assignee on this patent?
Tata Consultancy Services Ltd
What technology area does this patent fall under?
Primary CPC classification G10L25/51. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jul 11 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).