Audio decoder, apparatus for generating encoded audio output data and methods permitting initializing a decoder
US-2023335146-A1 · Oct 19, 2023 · US
US12094478B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-12094478-B2 |
| Application number | US-202418593784-A |
| Country | US |
| Kind code | B2 |
| Filing date | Mar 1, 2024 |
| Priority date | Oct 18, 2013 |
| Publication date | Sep 17, 2024 |
| Grant date | Sep 17, 2024 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
An audio decoder decodes a bit stream of encoded audio data, which bit stream represents a sequence of audio sample values and includes a plurality of frames, wherein each frame includes associated encoded audio sample values. The audio decoder includes a determiner configured to determine whether a frame of the encoded audio data is a special frame including encoded audio sample values associated with the special frame and additional information, wherein the additional information include encoded audio sample values of a number of frames preceding the special frame, wherein the encoded audio sample values of the preceding frames are encoded using the same codec configuration as the special frame, wherein the number of preceding frames is sufficient to initialize the decoder to be in a position to decode the audio sample values associated with the special frame if the special frame is the first frame upon start-up of the decoder.
Opening claim text (preview).
The invention claimed is: 1. An audio decoder for decoding a bit stream of encoded audio data, wherein the bit stream of encoded audio data represents a sequence of audio sample values and comprises a plurality of frames, wherein each frame comprises associated encoded audio sample values, the audio decoder comprising: a determiner configured to determine whether a frame of the bit stream of encoded audio data is a special frame comprising encoded audio sample values associated with the current frame and additional information, wherein the additional information comprise encoded audio sample values of a number of frames preceding the special frame, wherein the encoded audio sample values of the preceding frames are encoded using the same codec configuration as the current frame, wherein the number of preceding frames is sufficient to initialize the decoder to be in a position to decode the audio sample values associated with the current frame if the special frame is the first frame upon start-up of the decoder; and an initializer configured to initialize the decoder if the determiner determines that the frame is a special frame, wherein initializing the decoder comprises decoding the encoded audio sample values included in the additional information before decoding the encoded audio sample values associated with the current frame wherein the initializer is configured to switch the audio decoder from a first codec configuration to a different second codec configuration if the determiner determines that the frame is a special frame and if the audio sample values of the current frame have been encoded using the different second codec configuration, and wherein, prior to being initialized, the decoder is flushed upon detection of a codec configuration change and the resulting output samples are stored in a flush buffer. 2. The audio decoder of claim 1 , wherein output samples resulting from flushing the decoder are stored in a first buffer, wherein at least a portion of the output samples resulting from flushing the decoder are input to a crossfade process as a first input and output samples acquired by decoding the encoded audio sample values associated with the special frame are stored in a second buffer and input to the crossfade process as a second input. 3. The audio decoder of claim 1 , configured to decode the current frame using the first codec configuration and to discard the additional information if the determiner determines that the frame is a special frame and if the audio sample values of the special frame have been encoded using the first codec configuration. 4. The audio decoder of claim 1 , wherein the additional information comprise information on the codec configuration used for encoding the audio sample values associated with the current frame, wherein the determiner is configured to determine whether the codec configuration of the additional information is different from the first codec configuration. 5. The audio decoder of claim 1 , comprising a crossfader configured to perform crossfading between a plurality of output sample values acquired using the current codec configuration and a plurality of output sample values acquired by decoding the encoded audio sample values associated with the current frame. 6. The audio decoder of claim 5 , wherein the crossfader is configured to perform crossfading of output sample values obtained by flushing the decoder in the first codec configuration and output sample values acquired by decoding the encoded audio sample values associated with the current frame using the different second codec configuration. 7. The audio decoder of claim 1 , wherein an earliest frame of the number of frames comprised in the additional information is not time-differentially encoded or entropy encoded relative to any frame previous to the earliest frame and wherein the special frame is not time-differentially encoded or entropy encoded relative to any frame previous to the earliest frame of the number of frames preceding the special frame or relative to any frame previous to the special frame. 8. The audio decoder of claim 1 , wherein the additional information is implemented as an extension payload type defined to carry the encoded audio sample values of the number of frames preceding the special frame. 9. A method for decoding a bit stream of encoded audio data, wherein the bit stream of encoded audio data represents a sequence of audio sample values and comprises a plurality of frames, wherein each frame comprises associated encoded audio sample values, comprising: determining whether a frame of the bitstream of encoded audio data is a special frame comprising encoded audio sample values associated with a current frame and additional information, wherein the additional information comprise encoded audio sample values of a number of frames preceding the special frame, wherein the encoded audio sample values of the preceding frames are encoded using the same codec configuration as the special frame, wherein the number of preceding frames is sufficient to initialize a decoder to be in a position to decode the audio sample values associated with the current frame if the special frame is the first frame upon start-up of the decoder; and initializing the decoder if it is determined that the frame is a special frame, wherein the initializing comprises decoding the encoded audio sample values included in the additional information before decoding the encoded audio sample values associated with the current frame, switching the audio decoder from a first codec configuration to a different second codec configuration if it is determined that the frame is a special frame and if the audio sample values of the special frame have been encoded using the different second codec configuration, wherein, prior to being initialized, the decoder is flushed upon detection of a codec configuration change and the resulting output samples are stored in a flush buffer. 10. The method of claim 9 , wherein output samples resulting from flushing the decoder are stored in a first buffer, wherein at least a portion of the output samples resulting from flushing the decoder are input to a crossfade process as a first input and output samples acquired by decoding the encoded audio sample values associated with the special frame are stored in a second buffer and input to the crossfade process as a second input. 11. The method of claim 9 , wherein the bit stream of audio data comprises a first number of frames encoded using the first codec configuration and a second number of frames following the first number of frames and encoded using the second codec configuration, wherein the first frame of the second number of frames is the special frame. 12. The method of claim 9 , wherein the additional information comprise information on the codec configuration used for encoding the audio sample values associated with the current frame, the method comprising determining whether the codec configuration of the additional information is different from a first current codec configuration using which encoded audio sample values of frames in the bit stream, which precede the special frame, are encoded. 13. A non-transitory digital storage medium having a computer program stored thereon to perform a method for decoding a bit stream of encoded audio data, wherein the bit stream of encoded audio data represents a sequence of audio sample values and comprises a plurality of frames, wherein each frame comprises associated encoded audio sample values, comprising: determining whether a frame of the bitstream of encoded audio data is a special frame comprising encoded audio sample values associa
Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding · CPC title
Mode decision, i.e. based on audio signal content versus external parameters · CPC title
Compression (speech analysis-synthesis for redundancy reduction G10L19/00; for image communication H04N); Expansion; Suppression of unnecessary data, e.g. redundancy reduction · CPC title
Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes · CPC title
Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis (in musical instruments G10H) · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.