Fragment-aligned audio coding

US12149762B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12149762-B2
Application numberUS-202318447279-A
CountryUS
Kind codeB2
Filing dateAug 9, 2023
Priority dateMar 9, 2015
Publication dateNov 19, 2024
Grant dateNov 19, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Audio video synchronization and alignment or alignment of audio to some other external clock are rendered more effective or easier by treating fragment grid and frame grid as independent values, but, nevertheless, for each fragment the frame grid is aligned to the respective fragment's beginning. A compression effectiveness lost may be kept low when appropriately selecting the fragment size. On the other hand, the alignment of the frame grid with respect to the fragments' beginnings allows for an easy and fragment-synchronized way of handling the fragments in connection with, for example, parallel audio video streaming, bitrate adaptive streaming or the like.

First claim

Opening claim text (preview).

The invention claimed is: 1. A method for decoding audio content from an encoded data stream, wherein the encoded data stream comprises encoded representations of temporal fragments of the audio content, each of which has encoded thereinto a respective temporal fragment of the audio content in units of audio frames temporally aligned to a beginning of the respective temporal fragment so that the beginning of the respective temporal fragment coincides with a beginning of a first audio frame of the audio frames, wherein the method comprises decoding reconstructed versions of the temporal fragments of the audio content from the encoded representations of the temporal fragments; and joining, for playout, the reconstructed versions of the temporal fragments of the audio content together by truncating the reconstructed version of a predetermined temporal fragment at a portion of a trailing audio frame of the audio frames in units of which the predetermined temporal fragment is coded into the encoded representation of the predetermined temporal fragment, which temporally exceeds a trailing end of the predetermined temporal fragment, determining the portion of the trailing audio frame on the basis of truncation information in the encoded data stream, wherein the truncation information comprises a frame length value indicating a temporal length of the audio frames in units of which the predetermined temporal fragment is coded into the encoded representation of the predetermined temporal fragment, and a fragment length value indicating a temporal length of the predetermined temporal fragment from the beginning of the reconstructed version of the predetermined fragment to the fragment boundary with which the beginning of the reconstructed version of the succeeding temporal fragment coincides, and/or a truncation length value indicating a temporal length of the portion of the trailing audio frame or the difference between the temporal length of the portion of the trailing audio frame and the temporal length of the trailing audio frame. 2. The method according to claim 1 , wherein the decoding a further predetermined temporal fragment from the encoded representation of the further predetermined temporal fragment comprises generating the reconstructed version of the further predetermined temporal fragment within a portion of a trailing audio frame of the audio frames in units of which the further predetermined temporal fragment is coded into the encoded representation of the further predetermined temporal fragment, which portion extends from a leading end of the trailing audio frame up to the fragment boundary at which a reconstructed version of a succeeding temporal fragment abuts, by flushing internal decoder states as manifesting themselves up to an audio frame immediately preceding the trailing audio frame. 3. The method according to claim 1 , comprising deriving immediate playout information from the encoded representations of an even further predetermined temporal fragment, the immediate play-out information being related to the audio content at one or more pre-roll audio frames of the audio content which temporally precede(s) a beginning of the even further predetermined temporal fragment and use the immediate playout information so as to reconstruct the audio content at one or more audio frames of the even further predetermined temporal fragment immediately succeeding the beginning of the even further predetermined temporal fragment. 4. The method according to claim 3 , wherein the immediate playout information is a reconstruction of the audio content at the one or more pre-roll audio frames. 5. The method according to claim 3 , the immediate playout information is used in reconstructing the audio content at the one or more audio frames of the even further predetermined temporal fragment immediately succeeding the beginning of the even further predetermined temporal fragment for time domain aliasing cancellation. 6. The method according to claim 1 , comprising decoding the audio frames individually using an inverse of a lapped transform causing aliasing and incurring transform windows extending beyond the frames' boundaries. 7. The method according to claim 1 , wherein the decoding reconstructed versions of the temporal fragments of the audio content from the encoded representations of the temporal fragments comprises decoding the reconstructed versions of the temporal fragment of the audio content alternatively from two decoding cores.

Assignees

Inventors

Classifications

  • using video transcoding, i.e. partial or full decoding of a coded input stream followed by re-encoding of the decoded output stream · CPC title

  • for synchronising with other signals, e.g. video signals · CPC title

  • involving reformatting operations of audio signals, e.g. by converting from one coding standard to another (details of audio signal transcoding G10L19/173) · CPC title

  • for processing of video signals · CPC title

  • G10L19/167Primary

    Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12149762B2 cover?
Audio video synchronization and alignment or alignment of audio to some other external clock are rendered more effective or easier by treating fragment grid and frame grid as independent values, but, nevertheless, for each fragment the frame grid is aligned to the respective fragment's beginning. A compression effectiveness lost may be kept low when appropriately selecting the fragment size. On…
Who is the assignee on this patent?
Fraunhofer Ges Forschung
What technology area does this patent fall under?
Primary CPC classification H04N21/2335. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Nov 19 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).