Apparatus and method for decomposing an audio signal using a ratio as a separation characteristic

US11183199B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11183199-B2
Application numberUS-201916415392-A
CountryUS
Kind codeB2
Filing dateMay 17, 2019
Priority dateNov 17, 2016
Publication dateNov 23, 2021
Grant dateNov 23, 2021

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An apparatus for decomposing an audio signal into a background component signal and a foreground component signal includes: a block generator for generating a time sequence of blocks of audio signal values; an audio signal analyzer for determining a block characteristic of a current block of the audio signal and for determining an average characteristic for a group of blocks, the group of blocks including at least two blocks; and a separator for separating the current block into a background portion and a foreground portion in response to a ratio of the block characteristic of the current block and the average characteristic of the group of blocks, wherein the background component signal includes the background portion of the current block and the foreground component signal includes the foreground portion of the current block.

First claim

Opening claim text (preview).

The invention claimed is: 1. An apparatus for decomposing an audio signal into a background component signal and a foreground component signal, the apparatus comprising: a block generator for generating a time sequence of blocks of audio signal values; an audio signal analyzer for determining a block characteristic of a current block of the audio signal and for determining an average characteristic for a group of blocks, the group of blocks comprising at least two blocks; and a separator for separating the current block into a background portion and a foreground portion in response to a ratio of the block characteristic of the current block and the average characteristic of the group of blocks, wherein the background component signal comprises the background portion of the current block and the foreground component signal comprises the foreground portion of the current block. 2. The apparatus of claim 1 , wherein the audio signal analyzer is configured for analyzing an amplitude-related measure as the block characteristic of the current block and the amplitude-related measure as the average characteristic for the group of blocks. 3. The apparatus of claim 1 , wherein the audio signal analyzer is configured for analyzing a power measure or an energy measure for the current block and an average power measure or an average energy measure for the group of blocks. 4. The apparatus of claim 1 , wherein the separator is configured to calculate a separation gain from the ratio, to weight the audio signal values of the current block using the separation gain to acquire the foreground portion of the current block, and to determine the background portion so that the background component signal constitutes a remaining signal, or wherein the separator is configured to calculate the separation gain from the ratio, to weight the audio signal values of the current block using the separation gain to acquire the background portion of the current block, and to determine the foreground portion so that the foreground component signal constitutes a remaining signal. 5. The apparatus of claim 1 , wherein the separator is configured to calculate a separation gain using weighting the ratio using a predetermined weighting factor different from zero. 6. The apparatus of claim 5 , wherein the separator is configured to calculate the separation gain using a term 1−(g N /ψ(n) P or (max(1−(g N /ψ(n))) P , wherein g N is the predetermined weighting factor, ψ(n) is the ratio and p is a power greater than zero and being an integer or a non-integer number, and wherein n is a block index, and wherein max is a maximum function for selecting a greater value of 1 and (g N /ψ(n) P . 7. The apparatus of claim 1 , wherein the separator is configured to compare the ratio of the current block to a separation threshold and to separate the current block, when the ratio of the current block is in a predetermined relation to the separation threshold, and wherein the separator is configured to not separate a further block, the further block comprising a ratio not exhibiting the predetermined relation to the separation threshold, so that the further block fully belongs to the background component signal. 8. The apparatus of claim 7 , wherein the separator is configured to separate a following block following the current block in time using comparing a ratio of the following block to a release threshold, and wherein the release threshold is set such that the ratio that is not in the predetermined relation to the separation threshold is in the predetermined relation to the release threshold. 9. The apparatus of claim 8 , wherein the predetermined relation is “greater than” and wherein the release threshold is lower than the separation threshold, or wherein the predetermined relation is “lower than” and wherein the release threshold is greater than the separation threshold. 10. The apparatus of claim 1 , wherein the block generator is configured to determine temporally overlapping blocks of audio signal values, or wherein the temporally overlapping blocks comprise a number of sampling values being less than or equal to 600. 11. The apparatus of claim 1 , wherein the block generator is configured to perform a block-wise conversion of the audio signal being a time domain audio signal into a frequency domain to acquire a spectral representation for each block, wherein the audio signal analyzer is configured to calculate the block characteristic or the average characteristic using the spectral representation of the current block, and wherein the separator is configured to separate the spectral representation into the background portion and the foreground portion so that, for spectral bins of the background portion and the foreground portion corresponding to a same frequency, each comprises a spectral value different from zero, wherein a relation of the spectral value of the foreground portion and the spectral value of the background portion within a same frequency bin depends on the ratio of the block characteristic of the current block and the average characteristic of the group of blocks. 12. The apparatus of claim 1 , wherein the block generator is configured to perform a block-wise conversion of a time domain into a frequency domain to acquire a spectral representation for each block, wherein time adjacent blocks are overlapping in an overlapping range, wherein the apparatus further comprises a signal composer for composing the background component signal and for composing the foreground component signal, and wherein the signal composer is configured for performing a frequency-time conversion for the background component signal and for the foreground component signal and for cross-fading time representations of the time-adjacent blocks within the overlapping range to acquire a time domain foreground component signal and a separate time domain background component signal. 13. The apparatus of claim 1 , wherein the audio signal analyzer is configured to determine the average characteristic for the group of blocks using a weighted addition of individual block characteristics of blocks in the group of blocks. 14. The apparatus of claim 1 , wherein the audio signal analyzer is configured to perform a weighted addition of individual block characteristics of blocks in the group of blocks, wherein a weighting value for a block characteristic of a block close in time to the current block is greater than a weighting value for a block characteristic of a further block less close in time to the current block. 15. The apparatus of claim 13 , wherein the audio signal analyzer is configured to determine the group of blocks so that the group of blocks comprises at least twenty blocks before the current block or at least twenty blocks subsequent to the current block. 16. The apparatus of claim 1 , wherein the audio signal analyzer is configured to use a normalization value depending on a number of blocks in the group of blocks or depending on weighting values for blocks in the group of blocks. 17. The apparatus of claim 1 , further comprising a signal characteristic measurer for measuring a signal characteristic of at least one of the background component signals and the foreground component signal. 18. The apparatus of claim 17 , wherein the signal characteristic measurer is configured to determine a foreground density using the foreground component signal or to determine a foreground prominence using the foreground component signal and the audio signal.

Assignees

Inventors

Classifications

  • for differentiation between music and non-music signals, based on the identification of musical parameters, e.g. based on tempo detection · CPC title

  • Crossfade, i.e. time domain amplitude envelope control of the transition between musical sounds or melodies, obtained for musical purposes, e.g. for ADSR tone generation, articulations, medley, remix · CPC title

  • Fourier transform; Discrete Fourier Transform [DFT]; Fast Fourier Transform [FFT] · CPC title

  • G10L21/028Primary

    using properties of sound source · CPC title

  • in which the audio signals are in digital form, i.e. employing more than two discrete digital channels (data reduction aspects thereof based on psychoacoustics G10L19/02) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11183199B2 cover?
An apparatus for decomposing an audio signal into a background component signal and a foreground component signal includes: a block generator for generating a time sequence of blocks of audio signal values; an audio signal analyzer for determining a block characteristic of a current block of the audio signal and for determining an average characteristic for a group of blocks, the group of block…
Who is the assignee on this patent?
Fraunhofer Ges Forschung
What technology area does this patent fall under?
Primary CPC classification G10L21/028. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Nov 23 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).