Jitter buffer control, audio decoder, method and computer program

US9997167B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9997167-B2
Application numberUS-201514973729-A
CountryUS
Kind codeB2
Filing dateDec 18, 2015
Priority dateJun 21, 2013
Publication dateJun 12, 2018
Grant dateJun 12, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A jitter buffer control for controlling a provision of a decoded audio content on the basis of an input audio content is configured to select a frame-based time scaling or a sample-based time scaling in a signal-adaptive manner. An audio decoder uses such a jitter buffer control.

First claim

Opening claim text (preview).

The invention claimed is: 1. An audio decoder for providing a decoded audio content on the basis of an input audio content, the audio decoder comprising: a jitter buffer configured to buffer a plurality of audio frames representing blocks of audio samples; a decoder core configured to provide blocks of audio samples on the basis of audio frames received from the jitter buffer; a sample-based time scaler, wherein the sample based time scaler is configured to provide time-scaled blocks of audio samples on the basis of blocks of audio samples provided by the decoder core; and a jitter buffer control for controlling the provision of the decoded audio content on the basis of the input audio content, wherein the jitter buffer control is configured to select a frame-based time scaling or a sample-based time scaling in a signal-adaptive manner, such that a decision whether a frame-based time scaling or a sample-based time scaling is used is adapted to the characteristics of the audio signal, wherein the jitter buffer control is implemented using a hardware apparatus, or using a computer, or using a combination of a hardware apparatus and a computer, and wherein the jitter buffer control is configured to select a frame-based time scaling, which is performed by the jitter buffer, or a sample-based time scaling, which is performed by the sample-based time scaler, in a signal-adaptive manner. 2. The audio decoder according to claim 1 , wherein the jitter buffer is configured to drop or insert audio frames in order to perform a frame-based time scaling. 3. The audio decoder according to claim 1 , wherein the decoder core is configured to perform a comfort noise generation in response to a frame carrying a signaling information indicating a generation of comfort noise, and wherein the decoder core is configured to perform a concealing in response to an empty jitter buffer. 4. The audio decoder according to claim 1 , wherein the sample-based time scaler is configured to perform the time scaling of the input audio signal in dependence on a computation or an estimation of the quality of the time scaled version of the input audio signal acquirable by the time scaling. 5. A method for controlling a provision of a decoded audio content on the basis of an input audio content, wherein the method comprises selecting a frame-based time scaling or a sample-based time scaling in a signal-adaptive manner; wherein audio frames are dropped or inserted to control a depth of a jitter buffer when the frame-based time scaling is used, and wherein a time-shifted overlap-and-add of audio signal portions is performed when the sample-based time-scaling is used; wherein the method comprises selecting a frame-based comfort noise insertion or a frame-based comfort noise deletion for a time scaling if a discontinuous transmission in conjunction with comfort noise generation is currently used or was used for a previous frame, selecting an overlap-add-operation using a predetermined time shift for a time scaling if a current audio signal portion is active but comprises a signal energy which is smaller than or equal to an energy threshold value, and if a jitter buffer is not empty, or if a previous audio signal portion was active but comprises a signal energy which is smaller than or equal to the energy threshold value, and if the jitter buffer is not empty; selecting an overlap-add-operation using a signal-adaptive time shift for a time scaling if a current audio signal portion is active and comprises a signal energy which is larger than or equal to the energy threshold value and if the jitter buffer is not empty, or if a previous audio signal portion was active and comprises a signal energy which is larger than or equal to the energy threshold value and if the jitter buffer is not empty; and selecting an insertion of a concealed frame for a time scaling if a current audio signal portion is active and if the jitter buffer is empty, or if a previous audio signal portion was active and if the jitter buffer is empty. 6. A method for controlling a jitter buffer for provision of a decoded audio content on the basis of an input audio content, wherein the method comprises selecting a frame-based time scaling or a sample-based time scaling in a signal-adaptive manner; wherein audio frames are dropped or inserted to control a depth of a jitter buffer when the frame-based time scaling is used, and wherein a time-shifted overlap-and-add of audio signal portions is performed when the sample-based time-scaling is used; wherein the method comprises selecting an overlap-add-operation using a signal-adaptive time shift and a quality control mechanism for a time scaling if a current audio signal portion is active and comprises a signal energy which is larger than or equal to the energy threshold value and if the jitter buffer is not empty, or if a previous audio signal portion was active and comprises a signal energy which is larger than or equal to the energy threshold value and if the jitter buffer is not empty. 7. A non-transitory computer-readable medium comprising a computer program for performing the method according to claim 5 when the computer program is running on a computer. 8. A non-transitory computer-readable medium comprising a computer program for performing the method according to claim 6 when the computer program is running on a computer. 9. A method for controlling a provision of a decoded audio content on the basis of an input audio content, wherein the method comprises selecting a frame-based time scaling or a sample-based time scaling in a signal-adaptive manner, such that a decision whether a frame-based time scaling or a sample-based time scaling is used is adapted to the characteristics of the audio signal; wherein audio frames are dropped or inserted to control a depth of a jitter buffer when the frame-based time scaling is used, and wherein a time-shifted overlap-and-add of audio signal portions is performed when the sample-based time-scaling is used; wherein a comfort noise insertion or a comfort noise deletion is selected if a previous frame was inactive; wherein a time-shifted overlap-and-add of audio signal portions is selected if a previous frame was active. 10. A non-transitory computer-readable medium comprising a computer program for performing the method according to claim 9 when the computer program is running on a computer.

Assignees

Inventors

Classifications

  • G10L19/022Primary

    Blocking, i.e. grouping of samples in time; Choice of analysis windows; Overlap factoring · CPC title

  • using predictive techniques · CPC title

  • Synchronisation of packets and cells, e.g. transmission of voice via a packet network, circuit emulation service [CES] (queuing arrangements in packet switching elements H04L49/90; synchronising systems for the synchronous transmission of a pulse code modulated video signal with one or more other pulse code modulated signals H04N7/56) · CPC title

  • unidirectional timestamps · CPC title

  • Time compression or expansion · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9997167B2 cover?
A jitter buffer control for controlling a provision of a decoded audio content on the basis of an input audio content is configured to select a frame-based time scaling or a sample-based time scaling in a signal-adaptive manner. An audio decoder uses such a jitter buffer control.
Who is the assignee on this patent?
Fraunhofer Ges Forschung
What technology area does this patent fall under?
Primary CPC classification G10L19/022. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jun 12 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).