Audio processing for an acoustical environment
US-2017278519-A1 · Sep 28, 2017 · US
US11183199B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-11183199-B2 |
| Application number | US-201916415392-A |
| Country | US |
| Kind code | B2 |
| Filing date | May 17, 2019 |
| Priority date | Nov 17, 2016 |
| Publication date | Nov 23, 2021 |
| Grant date | Nov 23, 2021 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
An apparatus for decomposing an audio signal into a background component signal and a foreground component signal includes: a block generator for generating a time sequence of blocks of audio signal values; an audio signal analyzer for determining a block characteristic of a current block of the audio signal and for determining an average characteristic for a group of blocks, the group of blocks including at least two blocks; and a separator for separating the current block into a background portion and a foreground portion in response to a ratio of the block characteristic of the current block and the average characteristic of the group of blocks, wherein the background component signal includes the background portion of the current block and the foreground component signal includes the foreground portion of the current block.
Opening claim text (preview).
The invention claimed is: 1. An apparatus for decomposing an audio signal into a background component signal and a foreground component signal, the apparatus comprising: a block generator for generating a time sequence of blocks of audio signal values; an audio signal analyzer for determining a block characteristic of a current block of the audio signal and for determining an average characteristic for a group of blocks, the group of blocks comprising at least two blocks; and a separator for separating the current block into a background portion and a foreground portion in response to a ratio of the block characteristic of the current block and the average characteristic of the group of blocks, wherein the background component signal comprises the background portion of the current block and the foreground component signal comprises the foreground portion of the current block. 2. The apparatus of claim 1 , wherein the audio signal analyzer is configured for analyzing an amplitude-related measure as the block characteristic of the current block and the amplitude-related measure as the average characteristic for the group of blocks. 3. The apparatus of claim 1 , wherein the audio signal analyzer is configured for analyzing a power measure or an energy measure for the current block and an average power measure or an average energy measure for the group of blocks. 4. The apparatus of claim 1 , wherein the separator is configured to calculate a separation gain from the ratio, to weight the audio signal values of the current block using the separation gain to acquire the foreground portion of the current block, and to determine the background portion so that the background component signal constitutes a remaining signal, or wherein the separator is configured to calculate the separation gain from the ratio, to weight the audio signal values of the current block using the separation gain to acquire the background portion of the current block, and to determine the foreground portion so that the foreground component signal constitutes a remaining signal. 5. The apparatus of claim 1 , wherein the separator is configured to calculate a separation gain using weighting the ratio using a predetermined weighting factor different from zero. 6. The apparatus of claim 5 , wherein the separator is configured to calculate the separation gain using a term 1−(g N /ψ(n) P or (max(1−(g N /ψ(n))) P , wherein g N is the predetermined weighting factor, ψ(n) is the ratio and p is a power greater than zero and being an integer or a non-integer number, and wherein n is a block index, and wherein max is a maximum function for selecting a greater value of 1 and (g N /ψ(n) P . 7. The apparatus of claim 1 , wherein the separator is configured to compare the ratio of the current block to a separation threshold and to separate the current block, when the ratio of the current block is in a predetermined relation to the separation threshold, and wherein the separator is configured to not separate a further block, the further block comprising a ratio not exhibiting the predetermined relation to the separation threshold, so that the further block fully belongs to the background component signal. 8. The apparatus of claim 7 , wherein the separator is configured to separate a following block following the current block in time using comparing a ratio of the following block to a release threshold, and wherein the release threshold is set such that the ratio that is not in the predetermined relation to the separation threshold is in the predetermined relation to the release threshold. 9. The apparatus of claim 8 , wherein the predetermined relation is “greater than” and wherein the release threshold is lower than the separation threshold, or wherein the predetermined relation is “lower than” and wherein the release threshold is greater than the separation threshold. 10. The apparatus of claim 1 , wherein the block generator is configured to determine temporally overlapping blocks of audio signal values, or wherein the temporally overlapping blocks comprise a number of sampling values being less than or equal to 600. 11. The apparatus of claim 1 , wherein the block generator is configured to perform a block-wise conversion of the audio signal being a time domain audio signal into a frequency domain to acquire a spectral representation for each block, wherein the audio signal analyzer is configured to calculate the block characteristic or the average characteristic using the spectral representation of the current block, and wherein the separator is configured to separate the spectral representation into the background portion and the foreground portion so that, for spectral bins of the background portion and the foreground portion corresponding to a same frequency, each comprises a spectral value different from zero, wherein a relation of the spectral value of the foreground portion and the spectral value of the background portion within a same frequency bin depends on the ratio of the block characteristic of the current block and the average characteristic of the group of blocks. 12. The apparatus of claim 1 , wherein the block generator is configured to perform a block-wise conversion of a time domain into a frequency domain to acquire a spectral representation for each block, wherein time adjacent blocks are overlapping in an overlapping range, wherein the apparatus further comprises a signal composer for composing the background component signal and for composing the foreground component signal, and wherein the signal composer is configured for performing a frequency-time conversion for the background component signal and for the foreground component signal and for cross-fading time representations of the time-adjacent blocks within the overlapping range to acquire a time domain foreground component signal and a separate time domain background component signal. 13. The apparatus of claim 1 , wherein the audio signal analyzer is configured to determine the average characteristic for the group of blocks using a weighted addition of individual block characteristics of blocks in the group of blocks. 14. The apparatus of claim 1 , wherein the audio signal analyzer is configured to perform a weighted addition of individual block characteristics of blocks in the group of blocks, wherein a weighting value for a block characteristic of a block close in time to the current block is greater than a weighting value for a block characteristic of a further block less close in time to the current block. 15. The apparatus of claim 13 , wherein the audio signal analyzer is configured to determine the group of blocks so that the group of blocks comprises at least twenty blocks before the current block or at least twenty blocks subsequent to the current block. 16. The apparatus of claim 1 , wherein the audio signal analyzer is configured to use a normalization value depending on a number of blocks in the group of blocks or depending on weighting values for blocks in the group of blocks. 17. The apparatus of claim 1 , further comprising a signal characteristic measurer for measuring a signal characteristic of at least one of the background component signals and the foreground component signal. 18. The apparatus of claim 17 , wherein the signal characteristic measurer is configured to determine a foreground density using the foreground component signal or to determine a foreground prominence using the foreground component signal and the audio signal.
for differentiation between music and non-music signals, based on the identification of musical parameters, e.g. based on tempo detection · CPC title
Crossfade, i.e. time domain amplitude envelope control of the transition between musical sounds or melodies, obtained for musical purposes, e.g. for ADSR tone generation, articulations, medley, remix · CPC title
Fourier transform; Discrete Fourier Transform [DFT]; Fast Fourier Transform [FFT] · CPC title
using properties of sound source · CPC title
in which the audio signals are in digital form, i.e. employing more than two discrete digital channels (data reduction aspects thereof based on psychoacoustics G10L19/02) · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.