Audio Classification Based on Perceptual Quality for Low or Medium Bit Rates
US-2017116999-A1 · Apr 27, 2017 · US
US11769512B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-11769512-B2 |
| Application number | US-202117217533-A |
| Country | US |
| Kind code | B2 |
| Filing date | Mar 30, 2021 |
| Priority date | Jul 22, 2013 |
| Publication date | Sep 26, 2023 |
| Grant date | Sep 26, 2023 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
An apparatus for decoding an encoded signal includes: an audio decoder for decoding an encoded representation of a first set of first spectral portions to obtain a decoded first set of first spectral portions; a parametric decoder for decoding an encoded parametric representation of a second set of second spectral portions to obtain a decoded representation of the parametric representation, wherein the parametric information includes, for each target frequency tile, a source region identification as a matching information; and a frequency regenerator for regenerating a target frequency tile using a source region from the first set of first spectral portions identified by the matching information.
Opening claim text (preview).
The invention claimed is: 1. An apparatus for decoding an encoded audio signal to obtain a decoded audio signal, the apparatus comprising: an audio decoder configured for decoding an encoded representation of a first set of first spectral portions of the encoded audio signal to acquire a decoded first set of first spectral portions; a parametric decoder configured for decoding an encoded parametric representation of a second set of second spectral portions of the encoded audio signal to acquire a decoded parametric representation; and a frequency regenerator configured for regenerating a target frequency tile using a source region from the decoded first set of first spectral portions, wherein the decoded audio signal comprises the target frequency tile, wherein the frequency regenerator is configured for applying a whitening filter to the source region, wherein the frequency regenerator is configured, when applying the whitening filter, for calculating a spectral envelope estimate of the source region and for dividing a spectrum of the source region by a spectral envelope indicated by the spectral envelope estimate. 2. The apparatus of claim 1 , wherein the audio decoder is a spectral domain audio decoder, and wherein the apparatus further comprises a spectrum-time converter configured for converting a spectral representation of the decoded first set of first spectral portions and reconstructed second spectral portions comprising the target frequency tile into a time representation. 3. The apparatus of claim 1 , wherein the frequency regenerator comprises the whitening filter, the whitening filter being configured as a controllable whitening filter, wherein the decoded parametric representation comprises a whitening information, and wherein the frequency regenerator is configured for applying the whitening filter to the source region identified by a matching information before performing a spectral envelope adjustment, when the whitening information for the source region indicates that the source region is to be whitened. 4. The apparatus of claim 3 , wherein the whitening information comprises, for a tile or a group of tiles, a whitening level information indicating a whitening level to be applied to a source frequency tile of the source region, when regenerating the target frequency tile, and wherein the frequency regenerator is configured for selecting the whitening filter from a group of different whitening filters in response to the whitening information, before applying the whitening filter. 5. The apparatus of claim 1 , wherein the frequency regenerator comprises a source region modifier, wherein the decoded parametric representation comprises, in addition to the source region identification, a sign information, and wherein the source region modifier is configured for applying an operation to acquire a phase shift of the source region spectral values in accordance with the sign information. 6. The apparatus of claim 1 , wherein the frequency regenerator comprises a tile modulator, wherein the decoded parametric representation comprises a correlation lag in addition to the source region identification, and wherein the tile modulator is configured for applying a tile modulation in accordance with the correlation lag associated with the source region identification. 7. The apparatus of claim 1 , wherein the frequency regenerator comprises a tile modulator, wherein the decoded parametric representation comprises a correlation lag in addition to the source region identification, and wherein the tile modulator is configured for applying a tile modulation using an alternating temporal sequence of −1/1 when the correlation lag is an odd number. 8. A method of decoding an encoded audio signal to obtain a decoded audio signal, the method comprising: decoding an encoded representation of a first set of first spectral portions to acquire a decoded first set of first spectral portions of the encoded audio signal; decoding an encoded parametric representation of a second set of second spectral portions to acquire a decoded parametric representation; and regenerating a target frequency tile using a source region from the decoded first set of first spectral portions, wherein the decoded audio signal comprises the target frequency tile, wherein the regenerating comprises applying a whitening filter to the source region identified, wherein the applying the whitening filter comprises calculating a spectral envelope estimate of the source region and dividing a spectrum of the source region by a spectral envelope indicated by the spectral envelope estimate. 9. A non-transitory digital storage medium having a computer program stored thereon to perform, when said computer program is run by a computer, a method of decoding an encoded audio signal to obtain a decoded audio signal, the method comprising: decoding an encoded representation of a first set of first spectral portions of the encoded audio signal to acquire a decoded first set of first spectral portions; decoding an encoded parametric representation of a second set of second spectral portions to acquire a decoded parametric representation; and regenerating a target frequency tile using a source region from the decoded first set of first spectral portions, wherein the decoded audio signal comprises the target frequency tile, wherein the regenerating comprises applying a whitening filter to the source region, wherein the applying the whitening filter comprises calculating a spectral envelope estimate of the source region and dividing a spectrum of the source region by a spectral envelope indicated by the spectral envelope estimate.
Blocking, i.e. grouping of samples in time; Choice of analysis windows; Overlap factoring · CPC title
using band spreading techniques · CPC title
Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing · CPC title
using subband decomposition · CPC title
Detection of transients or attacks for time/frequency resolution switching · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.