Apparatus and method for decoding and encoding an audio signal using adaptive spectral tile selection

US10147430B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10147430-B2
Application numberUS-201615003334-A
CountryUS
Kind codeB2
Filing dateJan 21, 2016
Priority dateJul 22, 2013
Publication dateDec 4, 2018
Grant dateDec 4, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An apparatus for decoding an encoded signal includes: an audio decoder for decoding an encoded representation of a first set of first spectral portions to obtain a decoded first set of first spectral portions; a parametric decoder for decoding an encoded parametric representation of a second set of second spectral portions to obtain a decoded representation of the parametric representation, wherein the parametric information includes, for each target frequency tile, a source region identification as a matching information; and a frequency regenerator for regenerating a target frequency tile using a source region from the first set of first spectral portions identified by the matching information.

First claim

Opening claim text (preview).

The invention claimed is: 1. An audio decoder for decoding an encoded audio signal to obtain a decoded audio signal, comprising: an audio decoder element configured for decoding an encoded representation of a first set of first spectral portions of the encoded audio signal to acquire a decoded first set of first spectral portions; a parametric decoder configured for decoding an encoded parametric representation of a second set of second spectral portions of the encoded audio signal to acquire a decoded representation of the parametric representation, wherein the decoded representation of the parametric representation comprises, for each target frequency tile, a source region identification as a matching information; and a frequency regenerator configured for regenerating a target frequency tile using a source region from the first set of first spectral portions identified by the matching information, wherein the decoded audio signal comprises the target frequency tile, wherein the frequency regenerator comprises a controllable whitening filter, wherein the decoded representation of the parametric representation comprises a whitening information, wherein the frequency regenerator is configured for applying the whitening filter to a source region selected in accordance with the matching information before performing a spectral envelope adjustment, when the whitening information for the source region indicates that the source region is to be whitened, wherein the applying the whitening filter comprises calculating a spectral envelope estimate of the source region and dividing a spectrum of the source region by a spectral envelope indicated by the spectral envelope estimate, and wherein one or more of the audio decoder element, the parametric decoder and the frequency regenerator is implemented, at least in part, by one or more hardware elements of the audio decoder. 2. The audio decoder of claim 1 , wherein the audio decoder element is a spectral domain audio decoder, and wherein the audio decoder further comprises a spectrum-time converter configured for converting a spectral representation of the first spectral portions and reconstructed second spectral portions into a time representation. 3. The audio decoder of claim 1 , wherein the whitening information comprises, for a tile or a group of tiles, a whitening level information indicating a whitening level to be applied to a source frequency tile when regenerating the target frequency tile, and wherein the frequency regenerator is configured for applying a whitening filter selected from a group of different whitening filters in response to the whitening information. 4. The audio decoder in accordance with claim 1 , wherein the frequency regenerator comprises a source region modifier, wherein the decoded representation of the parametric representation comprises, in addition to the source region identification , a sign information, and wherein the source region modifier is configured for applying an operation to acquire a phase shift of the source region spectral values in accordance with the sign information. 5. The audio decoder in accordance with claim 1 , wherein the frequency regenerator comprises a tile modulator, wherein the decoded representation of the parametric representation comprises a correlation lag in addition to the source region identification, and wherein the tile modulator is configured for applying a tile modulation in accordance with the correlation lag associated with the source region identification. 6. The audio decoder in accordance with claim 1 , wherein the frequency regenerator comprises a tile modulator, wherein the decoded representation of the parametric representation comprises a correlation lag in addition to the source region identification, and wherein the tile modulator is configured for applying a tile modulation using an alternating temporal sequence of −1/1 when the correlation lag is an odd number. 7. An audio encoder for encoding an audio signal to obtain an encoded audio signal, comprising: a time-spectrum converter configured for converting the audio signal into a spectral representation; a spectral analyzer configured for analyzing the spectral representation to determine a first set of first spectral portions to be encoded with a first spectral resolution, and a second set of second spectral portions to be encoded with a second spectral resolution, wherein the second spectral resolution is lower than the first spectral resolution; a parameter calculator configured for calculating similarities between predefined source regions and target regions using a correlation processing, a source region comprising a first spectral portion of the first set of first spectral portions and a target region comprising a second spectral portion of the second set of second spectral portions, wherein the parameter calculator is configured for comparing matching results for different pairs of a first spectral portion of the source region and a second spectral portion of the target region to determine a selected matching pair and for providing matching information identifying the selected matching pair; a core coder configured for encoding the first set of first spectral portions, wherein the first set of first spectral portions comprises the predefined source regions and spectral portions different from the predefined source regions; and a parametric coder for encoding the second set of second spectral portions, wherein the encoded audio signal comprises an encoded first set of first spectral portions, an encoded representation of the second set of second special portions, and the matching information, wherein the parameter calculator is configured for spectrally whitening the first or the second spectral portion of the pairs before performing the correlation processing to acquire the matching identification, wherein the spectrally whitening comprises calculating a spectral envelope estimate of the of the first or the second spectral portion and dividing a spectrum of the first or the second spectral portion, respectively, by a spectral envelope indicated by the spectral envelope estimate, and wherein one or more of the time-spectrum converter, the spectral analyzer, and the parameter calculator is implemented, at least in part, by one or more hardware elements of the audio encoder. 8. The audio encoder of claim 7 , wherein the parameter calculator is configured for using predefined target regions in the second set of second spectral portions or predefined source regions in the first set of first spectral portions. 9. The audio encoder of claim 7 , wherein the parameter calculator is configured so that the predefined target regions are non-overlapping, or the predefined source regions are overlapping, or wherein the predefined source regions are a subset of the first set of the first spectral portions below a gap filling start frequency, or wherein a predefined target region covers a lowest spectral region coinciding with the gap filling start frequency. 10. The audio encoder in accordance with claim 7 , wherein the parameter calculator is configured for comparing pairs of a target region and a source region and a pair of the target region and the same source region, wherein the same source region is shifted by a correlation lag to provide information on the correlation lag of a selected pair as an additional matching information. 11. The audio encoder encoder of claim 7 , wherein the parameter calculator is configured for performing a correlation processing to acquire a matching result for a pair of the first spectral portion and the second spectral portion, the matching result having a negat

Assignees

Inventors

Classifications

  • Details of processing therefor · CPC title

  • G10L19/03Primary

    Spectral prediction for preventing pre-echo; Temporary noise shaping [TNS], e.g. in MPEG2 or MPEG4 · CPC title

  • Blocking, i.e. grouping of samples in time; Choice of analysis windows; Overlap factoring · CPC title

  • Noise substitution, i.e. substituting non-tonal spectral components by noisy source (comfort noise for discontinuous speech transmission G10L19/012) · CPC title

  • Detection of transients or attacks for time/frequency resolution switching · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10147430B2 cover?
An apparatus for decoding an encoded signal includes: an audio decoder for decoding an encoded representation of a first set of first spectral portions to obtain a decoded first set of first spectral portions; a parametric decoder for decoding an encoded parametric representation of a second set of second spectral portions to obtain a decoded representation of the parametric representation, whe…
Who is the assignee on this patent?
Fraunhofer Ges Forschung
What technology area does this patent fall under?
Primary CPC classification G10L21/0388. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Dec 04 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 5 related publications on this page (citations in our corpus or others sharing the same primary CPC).