Apparatus and method for encoding or decoding an audio signal with intelligent gap filling in the spectral domain

US11922956B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11922956-B2
Application numberUS-202217653332-A
CountryUS
Kind codeB2
Filing dateMar 3, 2022
Priority dateJul 22, 2013
Publication dateMar 5, 2024
Grant dateMar 5, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An apparatus for decoding an encoded audio signal, includes a spectral domain audio decoder for generating a first decoded representation of a first set of first spectral portions, the decoded representation having a first spectral resolution; a parametric decoder for generating a second decoded representation of a second set of second spectral portions having a second spectral resolution being lower than the first spectral resolution; a frequency regenerator for regenerating every constructed second spectral portion having the first spectral resolution using a first spectral portion and spectral envelope information for the second spectral portion; and a spectrum time converter for converting the first decoded representation and the reconstructed second spectral portion into a time representation.

First claim

Opening claim text (preview).

The invention claimed is: 1. An apparatus for decoding an encoded audio signal, comprising: a spectral domain audio decoder for generating a first decoded representation of a first set of first spectral portions, the first decoded representation comprising a first spectral resolution; a parametric decoder for generating a second decoded representation of a second set of second spectral portions, the second decoded representation comprising spectral envelope information comprising a second spectral resolution being lower than the first spectral resolution; a frequency regenerator for regenerating a reconstructed second spectral portion comprising the first spectral resolution using a first spectral portion of the first set of first spectral portions and spectral envelope information for a second spectral portion of the second set of second spectral portions; and a spectrum time converter for converting the first decoded representation and the reconstructed second spectral portion into a time representation, wherein a first encoded representation of the first set of first spectral portions comprises encoded spectral lines in the first set of first spectral portions and scale factors for each scale factor band in a core range below an intelligent gap filling start frequency and for each scale factor band in a reconstruction band above the intelligent gap filling start frequency until a maximum frequency, the maximum frequency being smaller than or equal to a half of a sampling frequency of an audio signal, or wherein the spectral envelope information for the second set of second spectral portions corresponds to energy information values for scale factor bands above the intelligent gap filling start frequency, wherein each energy information value represents a single spectral value per scale factor band above the intelligent gap filling start frequency until a maximum frequency, or wherein the spectral domain audio decoder is configured to generate the first decoded representation so that a first spectral portion of the first set of first spectral portions is placed, with respect to frequency, between two second spectral portions of the second set of second spectral portions, and wherein the first spectral portion of the first set of first spectral portions and at least one of the two second spectral portions of the second set of second spectral portions belong to a scale factor band in a reconstruction band above the intelligent gap filling start frequency, or wherein the reconstructed second spectral portion comprises the scale factor bands in a reconstruction band above the intelligent gap filling start frequency, wherein the frequency regenerator comprises: a frequency tile generator for performing a tile filling operation using a source band identification for identifying the first spectral portion of the first set of first spectral portions and a target band identification identifying a target band to generate a raw second portion of spectral lines comprising the first spectral resolution; a gain factor calculator for calculating a gain factor using an analysis of a source band identified by the source band identification or the raw second portion and an analysis of the first spectral portion of the first set of first spectral portions in a reconstruction band; and an adjuster for adjusting an energy of the raw second portion so that an energy of a scale factor band in the reconstruction band in an adjusted frame comprises an energy as indicated by the energy information value for the corresponding scale factor band, wherein the first spectral portion of the first set of first spectral portions in the reconstruction band in the adjusted frame is not influenced by the adjuster, or wherein the frequency regenerator comprises a surviving energy calculator for determining a survive energy information comprising an accumulated energy of the first spectral portion of the first set of first spectral portions in the specific scale factor band; a tile energy calculator for determining a tile energy information of the two second spectral portions of the second set of second spectral portions belonging to the specific scale factor band, wherein the two second spectral portions of the second set of second spectral portions belonging to the specific scale factor band are to be generated by frequency regeneration using a first source spectral portion different from the first spectral portion in the specific scale factor band; a missing energy calculator for calculating a missing energy in the specific scale factor band, wherein the missing energy calculator is configured to operate using an energy information value for the specific scale factor band and the survive energy information generated by the surviving energy calculator; and a spectral envelope adjuster for adjusting the two second spectral portions of the second set of second spectral portions belonging to the specific scale factor band based on the missing energy information acquired by the missing energy calculator and the tile energy information acquired by the tile energy calculator to acquire the reconstructed second spectral portion comprising the first spectral resolution, wherein the first spectral portion of the first set of first spectral portions in the specific scale factor band is not influenced by the spectral envelope adjuster, or wherein the apparatus comprises an inverse scaling block for inverse scaling dequantized spectral values corresponding to first spectral portions of the first set of spectral portions using the scale factors for the scale factor bands to provide the first spectral portions below the frequency gap filling start frequency and the first spectral portions above the frequency gap filling start frequency in a reconstruction band, and wherein a spectral envelope adjuster of the frequency regenerator is configured to receive a first source spectral portion used for frequency tile filling in the reconstruction band, wherein adjusted spectral values for the second spectral portions in the reconstruction band acquired by the spectral envelope adjuster and the first spectral portion in the reconstruction band jointly represent a spectral representation of the reconstruction band. 2. The apparatus according to claim 1 , wherein the spectral domain audio decoder is configured to output a sequence of decoded frames of spectral values, a decoded frame of the sequence of decoded frames being the first decoded representation, wherein the frame comprises spectral values for the first set of first spectral portions and zero indications for the second set of second spectral portions, wherein the apparatus for decoding further comprises a combiner configured for combining spectral values generated by the frequency regenerator for the second set of second spectral portions and spectral values of the first set of first spectral portions in the reconstruction band to acquire a reconstructed spectral frame comprising spectral values for the first set of the first spectral portions and the second set of second spectral portion, and wherein the spectrum-time converter is configured to convert the reconstructed spectral frame into the time representation. 3. The apparatus according to claim 1 , wherein the spectrum-time converter is configured to perform an inverse modified discrete cosine transform, and further comprises an overlap-add stage configured for overlapping and adding subsequent time domain frames, each subsequent time domain frame originating from a spectrum representation comprising the first decoded representation and the reconstructed second spectral portion. 4. The apparatus according to claim 1 , wherein the spectral domain audio decoder is configured to generate the first decoded representation so that the first decoded representation comprises a Nyquist frequency def

Assignees

Inventors

Classifications

  • Blocking, i.e. grouping of samples in time; Choice of analysis windows; Overlap factoring · CPC title

  • using band spreading techniques · CPC title

  • G10L19/008Primary

    Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing · CPC title

  • using subband decomposition · CPC title

  • Subband vocoders · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11922956B2 cover?
An apparatus for decoding an encoded audio signal, includes a spectral domain audio decoder for generating a first decoded representation of a first set of first spectral portions, the decoded representation having a first spectral resolution; a parametric decoder for generating a second decoded representation of a second set of second spectral portions having a second spectral resolution being…
Who is the assignee on this patent?
Fraunhofer Ges Forschung
What technology area does this patent fall under?
Primary CPC classification G10L19/008. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Mar 05 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).