Who is the assignee on this patent?

Univ Hanyang Ind Univ Coop Found

What technology area does this patent fall under?

Primary CPC classification G10L21/0216. Mapped technology areas include Physics.

When was this patent published?

Publication date Tue Dec 01 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Multichannel microphone-based reverberation time estimation method and device which use deep neural network

US10854218B2 · US · B2

Patent metadata
Field	Value
Publication number	US-10854218-B2
Application number	US-201716469938-A
Country	US
Kind code	B2
Filing date	Dec 15, 2017
Priority date	Dec 15, 2016
Publication date	Dec 1, 2020
Grant date	Dec 1, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A multichannel microphone-based reverberation time estimation method and device which use a deep neural network (DNN) are disclosed. A multichannel microphone-based reverberation time estimation method using a DNN, according to one embodiment, comprises the steps of: receiving a voice signal through a multichannel microphone; deriving a feature vector including spatial information by using the inputted voice signal; and estimating the degree of reverberation by applying the feature vector to the DNN.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for multichannel microphone-based reverberation time estimation using a deep neural network (DNN), the method comprising: receiving an input of a voice signal through a multichannel microphone; deriving a feature vector that includes spatial information using the input voice signal, and estimating a degree of reverberation by applying the feature vector to the DNN, wherein the deriving of the feature vector comprises: deriving a negative-side variance (NSV) by deriving time and frequency information from the input voice signal using a short-time Fourier transform (STFT) and by deriving distribution of envelopes for each frequency band based on the derived time and frequency information; and deriving a cross-correlation function representing a correlation between two microphones in the input voice signal, and wherein the estimating of the degree of reverberation comprises estimating a reverberation time by using the derived NSV and the cross-correlation function as an input of the DNN. 2. The method of claim 1 , wherein the receiving of the voice signal through the multichannel microphone comprises estimating relative spatial information between voice signals input using the multichannel microphone. 3. The method of claim 1 , wherein the deriving of the NSV comprises: deriving a log-energy envelope from a domain of the STFT; deriving a gradient from the log-energy envelope using a least squares linear fitting; and deriving an NSV for estimating a reverberation time having a negative gradient, excluding a reverberation time having a positive gradient. 4. The method of claim 1 , wherein the estimating of the degree of reverberation by applying the feature vector to the DNN comprises estimating a reverberation time by using the derived NSV as an input of the DNN. 5. The method of claim 1 , wherein the DNN comprises three hidden layers, and each of the hidden layers is configured to be finely adjusted through a pre-training process using a plurality of epochs. 6. An apparatus for multichannel microphone-based reverberation time estimation using a deep neural network (DNN), the apparatus comprising: an inputter configured to receive an input of a voice signal through a multichannel microphone; a feature vector extractor configured to derive a feature vector that includes spatial information using the input voice signal; and a reverberation estimator configured to estimate a degree of reverberation by applying the feature vector to the DNN, wherein the feature vector extractor comprises: a negative-side variance (NSV) deriver configured to derive an NSV by deriving time and frequency information from the input voice signal using a short-time Fourier transform (STFT) and by deriving distribution of envelopes for each frequency band based on the derived time and frequency information; and a cross-correlation function deriver configured to derive a cross-correlation function representing a correlation between two microphones in the input voice signal, and wherein the reverberation estimator is configured to estimate a reverberation time by using the derived NSV the cross-correlation function as an input of the DNN. 7. The apparatus of claim 6 , wherein the NSV deriver is configured to derive a log-energy envelope from a domain of the STFT, to derive a gradient from the log-energy envelope using a least squares linear fitting, and to derive an NSV for estimating a reverberation having a negative gradient, excluding a reverberation time having a positive gradient. 8. The apparatus of claim 6 , wherein the reverberation estimator is configured to estimate a reverberation time by using the derived NSV as an input of the DNN. 9. The apparatus of claim 6 , wherein the DNN comprises three hidden layers, and each of the hidden layers is configured to be finely adjusted through a pre-training process using a plurality of epochs.

Assignees

Univ Hanyang Ind Univ Coop Found

Inventors

Classifications

G06N3/09
Supervised learning · CPC title
G06N3/0499
Feedforward networks · CPC title
G10L2021/02165
Two microphones, one receiving mainly the noise signal and the other one mainly the speech signal · CPC title
G10L2021/02082
the noise being echo, reverberation of the speech · CPC title
G10L21/0216Primary
characterised by the method used for estimating noise · CPC title

Patent family

Related publications grouped by family.

View patent family 62558970

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10854218B2 cover?: A multichannel microphone-based reverberation time estimation method and device which use a deep neural network (DNN) are disclosed. A multichannel microphone-based reverberation time estimation method using a DNN, according to one embodiment, comprises the steps of: receiving a voice signal through a multichannel microphone; deriving a feature vector including spatial information by using the …
Who is the assignee on this patent?: Univ Hanyang Ind Univ Coop Found
What technology area does this patent fall under?: Primary CPC classification G10L21/0216. Mapped technology areas include Physics.
When was this patent published?: Publication date Tue Dec 01 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).