Method and apparatus for sound event detection robust to frequency change

US10540988B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10540988-B2
Application numberUS-201816196356-A
CountryUS
Kind codeB2
Filing dateNov 20, 2018
Priority dateMar 15, 2018
Publication dateJan 21, 2020
Grant dateJan 21, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Disclosed is a sound event detecting method including receiving an audio signal, transforming the audio signal into a two-dimensional (2D) signal, extracting a feature map by training a convolutional neural network (CNN) using the 2D signal, pooling the feature map based on a frequency, and determining whether a sound event occurs with respect to each of at least one time interval based on a result of the pooling.

First claim

Opening claim text (preview).

What is claimed is: 1. A sound event detecting method performed by a sound event detecting apparatus, the sound event detecting method comprising: receiving an audio signal; transforming the audio signal into a two-dimensional (2D) time-frequency signal; extracting a feature map from the 2D signal using a trained convolutional neural network (CNN); pooling the feature map based on a frequency; and determining whether a sound event occurs with respect to each of one or more time intervals based on a result of the pooling. 2. The sound event detecting method of claim 1 , wherein the determining comprises: calculating a probability value of a sound event occurring with respect to each of the one or more time intervals based on the result of the pooling; and determining whether a sound event occurs with respect to each of the one or more time intervals based on the probability value. 3. The sound event detecting method of claim 2 , wherein the determining of whether a sound event occurs with respect to each of the one or more time intervals based on the probability value comprises determining that a sound event occurs at a time interval if a probability value corresponding to the time interval is greater than or equal to a predetermined value. 4. The sound event detecting method of claim 1 , further comprising: classifying a sound event occurring at each time interval based on predefined sound event information. 5. The sound event detecting method of claim 1 , wherein the audio signal is transformed into the 2D signal using one of fast Fourier transform (FFT), constant Q transform (CQT), and Wavelet. 6. A non-transitory computer-readable medium storing instructions that when executed by one or more processors, cause the one or more processors to perform the method of claim 1 . 7. A sound event detecting apparatus, comprising: a memory configured to store a control program; one or more processors configured to operate based on the control program; and a receiver configured to receive an audio signal from an outside, wherein the control program is configured to perform: receiving an audio signal from an outside, transforming the audio signal into a two-dimensional (2D) time-frequency signal, extracting a feature map from the 2D signal using a trained neural network (CNN), pooling the feature map based on a frequency, and determining whether a sound event occurs with respect to each of one or more time intervals based on a result of the pooling. 8. The sound event detecting apparatus of claim 7 , wherein the determining comprises: calculating a probability value of a sound event occurring with respect to each of the one or more time intervals based on the result of the pooling; and determining whether a sound event occurs with respect to each of the one or more time intervals based on the probability value. 9. The sound event detecting apparatus of claim 8 , wherein the determining of whether a sound event occurs with respect to each of the one or more time intervals based on the probability value comprises determining that a sound event occurs at a time interval if a probability value corresponding to the time interval is greater than or equal to a predetermined value. 10. The sound event detecting apparatus of claim 7 , wherein the control program is further configured to perform classifying a sound event occurring at each time, interval based on predefined sound event information. 11. The sound event detecting apparatus of claim 7 , wherein the audio signal is transformed into the 2D signal using one of fast Fourier transform (FFT), constant transform (CQT), and Wavelet.

Assignees

Inventors

Classifications

  • Learning methods · CPC title

  • for comparison or discrimination · CPC title

  • the extracted parameters being spectral information of each sub-band · CPC title

  • Combinations of networks · CPC title

  • Probabilistic graphical models, e.g. probabilistic networks · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10540988B2 cover?
Disclosed is a sound event detecting method including receiving an audio signal, transforming the audio signal into a two-dimensional (2D) signal, extracting a feature map by training a convolutional neural network (CNN) using the 2D signal, pooling the feature map based on a frequency, and determining whether a sound event occurs with respect to each of at least one time interval based on a re…
Who is the assignee on this patent?
Electronics & Telecommunications Res Inst
What technology area does this patent fall under?
Primary CPC classification G10L21/14. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jan 21 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 5 related publications on this page (citations in our corpus or others sharing the same primary CPC).