Audio decoder, apparatus for generating encoded audio output data and methods permitting initializing a decoder

US12094478B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12094478-B2
Application numberUS-202418593784-A
CountryUS
Kind codeB2
Filing dateMar 1, 2024
Priority dateOct 18, 2013
Publication dateSep 17, 2024
Grant dateSep 17, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An audio decoder decodes a bit stream of encoded audio data, which bit stream represents a sequence of audio sample values and includes a plurality of frames, wherein each frame includes associated encoded audio sample values. The audio decoder includes a determiner configured to determine whether a frame of the encoded audio data is a special frame including encoded audio sample values associated with the special frame and additional information, wherein the additional information include encoded audio sample values of a number of frames preceding the special frame, wherein the encoded audio sample values of the preceding frames are encoded using the same codec configuration as the special frame, wherein the number of preceding frames is sufficient to initialize the decoder to be in a position to decode the audio sample values associated with the special frame if the special frame is the first frame upon start-up of the decoder.

First claim

Opening claim text (preview).

The invention claimed is: 1. An audio decoder for decoding a bit stream of encoded audio data, wherein the bit stream of encoded audio data represents a sequence of audio sample values and comprises a plurality of frames, wherein each frame comprises associated encoded audio sample values, the audio decoder comprising: a determiner configured to determine whether a frame of the bit stream of encoded audio data is a special frame comprising encoded audio sample values associated with the current frame and additional information, wherein the additional information comprise encoded audio sample values of a number of frames preceding the special frame, wherein the encoded audio sample values of the preceding frames are encoded using the same codec configuration as the current frame, wherein the number of preceding frames is sufficient to initialize the decoder to be in a position to decode the audio sample values associated with the current frame if the special frame is the first frame upon start-up of the decoder; and an initializer configured to initialize the decoder if the determiner determines that the frame is a special frame, wherein initializing the decoder comprises decoding the encoded audio sample values included in the additional information before decoding the encoded audio sample values associated with the current frame wherein the initializer is configured to switch the audio decoder from a first codec configuration to a different second codec configuration if the determiner determines that the frame is a special frame and if the audio sample values of the current frame have been encoded using the different second codec configuration, and wherein, prior to being initialized, the decoder is flushed upon detection of a codec configuration change and the resulting output samples are stored in a flush buffer. 2. The audio decoder of claim 1 , wherein output samples resulting from flushing the decoder are stored in a first buffer, wherein at least a portion of the output samples resulting from flushing the decoder are input to a crossfade process as a first input and output samples acquired by decoding the encoded audio sample values associated with the special frame are stored in a second buffer and input to the crossfade process as a second input. 3. The audio decoder of claim 1 , configured to decode the current frame using the first codec configuration and to discard the additional information if the determiner determines that the frame is a special frame and if the audio sample values of the special frame have been encoded using the first codec configuration. 4. The audio decoder of claim 1 , wherein the additional information comprise information on the codec configuration used for encoding the audio sample values associated with the current frame, wherein the determiner is configured to determine whether the codec configuration of the additional information is different from the first codec configuration. 5. The audio decoder of claim 1 , comprising a crossfader configured to perform crossfading between a plurality of output sample values acquired using the current codec configuration and a plurality of output sample values acquired by decoding the encoded audio sample values associated with the current frame. 6. The audio decoder of claim 5 , wherein the crossfader is configured to perform crossfading of output sample values obtained by flushing the decoder in the first codec configuration and output sample values acquired by decoding the encoded audio sample values associated with the current frame using the different second codec configuration. 7. The audio decoder of claim 1 , wherein an earliest frame of the number of frames comprised in the additional information is not time-differentially encoded or entropy encoded relative to any frame previous to the earliest frame and wherein the special frame is not time-differentially encoded or entropy encoded relative to any frame previous to the earliest frame of the number of frames preceding the special frame or relative to any frame previous to the special frame. 8. The audio decoder of claim 1 , wherein the additional information is implemented as an extension payload type defined to carry the encoded audio sample values of the number of frames preceding the special frame. 9. A method for decoding a bit stream of encoded audio data, wherein the bit stream of encoded audio data represents a sequence of audio sample values and comprises a plurality of frames, wherein each frame comprises associated encoded audio sample values, comprising: determining whether a frame of the bitstream of encoded audio data is a special frame comprising encoded audio sample values associated with a current frame and additional information, wherein the additional information comprise encoded audio sample values of a number of frames preceding the special frame, wherein the encoded audio sample values of the preceding frames are encoded using the same codec configuration as the special frame, wherein the number of preceding frames is sufficient to initialize a decoder to be in a position to decode the audio sample values associated with the current frame if the special frame is the first frame upon start-up of the decoder; and initializing the decoder if it is determined that the frame is a special frame, wherein the initializing comprises decoding the encoded audio sample values included in the additional information before decoding the encoded audio sample values associated with the current frame, switching the audio decoder from a first codec configuration to a different second codec configuration if it is determined that the frame is a special frame and if the audio sample values of the special frame have been encoded using the different second codec configuration, wherein, prior to being initialized, the decoder is flushed upon detection of a codec configuration change and the resulting output samples are stored in a flush buffer. 10. The method of claim 9 , wherein output samples resulting from flushing the decoder are stored in a first buffer, wherein at least a portion of the output samples resulting from flushing the decoder are input to a crossfade process as a first input and output samples acquired by decoding the encoded audio sample values associated with the special frame are stored in a second buffer and input to the crossfade process as a second input. 11. The method of claim 9 , wherein the bit stream of audio data comprises a first number of frames encoded using the first codec configuration and a second number of frames following the first number of frames and encoded using the second codec configuration, wherein the first frame of the second number of frames is the special frame. 12. The method of claim 9 , wherein the additional information comprise information on the codec configuration used for encoding the audio sample values associated with the current frame, the method comprising determining whether the codec configuration of the additional information is different from a first current codec configuration using which encoded audio sample values of frames in the bit stream, which precede the special frame, are encoded. 13. A non-transitory digital storage medium having a computer program stored thereon to perform a method for decoding a bit stream of encoded audio data, wherein the bit stream of encoded audio data represents a sequence of audio sample values and comprises a plurality of frames, wherein each frame comprises associated encoded audio sample values, comprising: determining whether a frame of the bitstream of encoded audio data is a special frame comprising encoded audio sample values associa

Assignees

Inventors

Classifications

  • Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding · CPC title

  • Mode decision, i.e. based on audio signal content versus external parameters · CPC title

  • Compression (speech analysis-synthesis for redundancy reduction G10L19/00; for image communication H04N); Expansion; Suppression of unnecessary data, e.g. redundancy reduction · CPC title

  • G10L19/167Primary

    Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes · CPC title

  • G10L19/00Primary

    Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis (in musical instruments G10H) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12094478B2 cover?
An audio decoder decodes a bit stream of encoded audio data, which bit stream represents a sequence of audio sample values and includes a plurality of frames, wherein each frame includes associated encoded audio sample values. The audio decoder includes a determiner configured to determine whether a frame of the encoded audio data is a special frame including encoded audio sample values associa…
Who is the assignee on this patent?
Fraunhofer Ges Forschung
What technology area does this patent fall under?
Primary CPC classification G10L19/167. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Sep 17 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 11 related publications on this page (citations in our corpus or others sharing the same primary CPC).