Voice data transmission method and apparatus
US-2024363120-A1 · Oct 31, 2024 · US
US9704495B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-9704495-B2 |
| Application number | US-201314380297-A |
| Country | US |
| Kind code | B2 |
| Filing date | Feb 11, 2013 |
| Priority date | Feb 21, 2012 |
| Publication date | Jul 11, 2017 |
| Grant date | Jul 11, 2017 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A system and method for detection of sound of interest amongst plurality of other dynamically varying sounds is disclosed. In one embodiment, a spectrum detector identifies dominant spectrum energy frequency by detecting the dominant spectrum energy band present in spectrum of sound energy. A modified mel filter bank is designed by revising spectral positioning of the first mel filter bank and the second mel filter bank according to the identified dominant frequency. A feature extractor extracts the features from first mel filter bank, second mel filter bank and the modified mel filter bank which are further classified in order to detect the sound of interest.
Opening claim text (preview).
We claim: 1. A system for detection of a sound of interest amongst a plurality of dynamically varying sounds, the system comprising: a spectrum detector to identify a dominant frequency by detecting a dominant spectrum energy band present in a spectrum of sound energy of dynamically varying sounds; a first mel filter bank and a second mel filter bank that each comprises mel filters that filter a frequency band of the sound energy for detecting the sound of interest; a modified mel filter bank modified according to the dominant frequency includes a revised spectral positioning of the first mel filter bank ranging from the dominant frequency to a maximum frequency and the second mel filter bank ranging from a minimum frequency to the dominant frequency for detection of the dynamically varying sound of interest; a feature extractor, coupled with the modified mel filter bank, to extract a plurality of spectral characteristics of sound received from the modified filter bank; and a classifier to classify the plurality of spectral characteristics of the sound according to the dominant frequency to detect the sound of interest. 2. The system as claimed in claim 1 , wherein the second mel filter bank is an inverse of the first mel filter bank. 3. The system as claimed in claim 1 , wherein the classifier includes a Gaussian Mixture Model (GMM) to classify the spectral characteristics of the sound of interest. 4. The system as claimed in claim 1 , wherein the dynamically varying sounds includes a horn sound in an automobile. 5. The system as claimed in claim 1 , wherein the system further comprises: a fuser to fuse the features extracted from the first mel filter bank, the second mel filter bank, and the modified mel filter bank to provide a performance evaluation of the system. 6. A method for detection of a sound of interest amongst a plurality of dynamically varying sounds, the method comprising steps of: identifying a dominant frequency present in a spectrum of sound energy; modifying a mel filter bank according to the dominant frequency by revising a spectral position of a first mel filter bank ranging from the dominant s frequency to the maximum frequency and a second mel filter bank ranging from the minimum frequency to the dominant frequency for detection of a dynamically varying sound of interest; extracting a plurality of spectral characteristic of a sound received from the modified filter bank; and classifying the plurality of spectral characteristics of the sound to detect the sound of interest according to the dominant frequency, wherein the identifying, the modifying, the extracting, and the classifying are performed by a processor by executing programmed instructions stored in a memory coupled with said processor. 7. The method as claimed in claim 6 , wherein the dominant frequency includes a frequency of band with maximum energy in the energy spectrum of the sound of interest. 8. The method as claimed in claim 6 , wherein the method further comprises: fusing, by the processor, the features extracted from the first mel filter bank, the second mel filter bank, and the modified mel filter bank in order to provide a performance evaluation while detecting the sound of interest. 9. A non-transitory computer-readable medium storing instructions that, when executed by a processor, cause the processor to perform a method, the method comprising steps of: identifying a dominant frequency present in a spectrum of sound energy; modifying a mel filter bank according to the dominant frequency by revising a spectral position of a first mel filter bank ranging from the dominant frequency to the maximum frequency and a second mel filter bank ranging from the minimum frequency to the dominant frequency for detection of a dynamically varying sound of interest; extracting a plurality of spectral characteristic of a sound received from the modified filter bank; and classifying the plurality of spectral characteristics of the sound to detect the sound of interest according to the dominant frequency. 10. The non-transitory computer-readable medium as claimed in claim 9 , wherein the dominant frequency includes a frequency of band with maximum energy in the energy spectrum of the sound of interest. 11. The non-transitory computer-readable medium as claimed in claim 9 , wherein the method further comprises: fusing the features extracted from the first mel filter bank, the second mel filter bank, and the modified mel filter bank in order to provide a performance evaluation while detecting the sound of interest.
Related publications grouped by family.
Answers are generated from the same data shown on this page.