Method and Apparatus for Decoding Speech/Audio Bitstream

US2016343382A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2016343382-A1
Application numberUS-201615197364-A
CountryUS
Kind codeA1
Filing dateJun 29, 2016
Priority dateDec 31, 2013
Publication dateNov 24, 2016
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method and an apparatus for decoding a speech/audio bitstream are disclosed, where the method for decoding a speech/audio bitstream includes determining whether a current frame is a normal decoding frame or a redundancy decoding frame, obtaining a decoded parameter of the current frame by means of parsing when the current frame is a normal decoding frame or a redundancy decoding frame, performing post-processing on the decoded parameter of the current frame to obtain a post-processed decoded parameter of the current frame, and using the post-processed decoded parameter of the current frame to reconstruct a speech/audio signal.

First claim

Opening claim text (preview).

1 . A method for decoding a speech/audio bitstream, comprising: determining whether a current frame is a normal decoding frame or a redundancy decoding frame; parsing a bitstream to obtain a decoded parameter of the current frame when the current frame is the normal decoding frame or the redundancy decoding frame; performing post-processing on the decoded parameter of the current frame to obtain a post-processed decoded parameter of the current frame; and setting the post-processed decoded parameter of the current frame to recover reconstruct a speech/audio signal. 2 . The method according to claim 1 , wherein the decoded parameter of the current frame comprises a spectral pair parameter of the current frame, and wherein performing the post-processing on the decoded parameter of the current frame comprises setting the spectral pair parameter of the current frame and a spectral pair parameter of a previous frame of the current frame to obtain a post-processed spectral pair parameter of the current frame. 3 . The method according to claim 2 , wherein the post-processed spectral pair parameter of the current frame is obtained through calculation using the formula lsp[k]=α*lsp_old[k]+δ*lsp_new[k], wherein 0≦k≦M, wherein lsp[k] is the post-processed spectral pair parameter of the current frame, wherein lsp_old[k] is the spectral pair parameter of the previous frame, wherein lsp_new[k] is the spectral pair parameter of the current frame, wherein M is an order of spectral pair parameters, wherein α is a weight of the spectral pair parameter of the previous frame, wherein δ is a weight of the spectral pair parameter of the current frame, wherein α≧0 , wherein δ≧0, and wherein α+δ=1. 4 . The method according to claim 2 , wherein the post-processed spectral pair parameter of the current frame is obtained through calculation using the formula lsp[k]=α*lsp_old[k]+β*lsp_mid[k]+δ*lsp_new[k], wherein 0≦k≦M, wherein lsp[k] is the post-processed spectral pair parameter of the current frame, wherein lsp_old[k] is the spectral pair parameter of the previous frame, wherein lsp_mid[k] is a middle value of the spectral pair parameter of the current frame, wherein lsp_new[k] is the spectral pair parameter of the current frame, wherein M is an order of spectral pair parameters, wherein α is a weight of the spectral pair parameter of the previous frame, wherein β is a weight of the middle value of the spectral pair parameter of the current frame, wherein δ is a weight of the spectral pair parameter of the current frame, wherein a α≧0, wherein β≧0, wherein δ≧0, and wherein α+β+δ=1. 5 . The method according to claim 4 , wherein when the current frame is the redundancy decoding frame and a signal class of the current frame is not unvoiced, and when a signal class of a next frame of the current frame is unvoiced, a value of β is 0 or is less than a preset threshold. 6 . The method according to claim 4 , wherein when the current frame is the redundancy decoding frame and a signal class of the current frame is not unvoiced, a spectral tilt factor of the previous frame of the current frame is less than a preset spectral tilt factor threshold, a value of β is 0 or is less than a preset threshold. 7 . The method according to claim 4 , wherein when the current frame is the redundancy decoding frame and a signal class of the current frame is not unvoiced, a signal class of a next frame of the current frame is unvoiced and a spectral tilt factor of the previous frame of the current frame is less than a preset spectral tilt factor threshold, a value of β is 0 or is less than a preset threshold. 8 . The method according to claim 3 , wherein when a signal class of the current frame is unvoiced, the previous frame of the current frame is the redundancy decoding frame, and a signal class of the previous frame of the current frame is not unvoiced, a value of α is 0 or is less than a preset threshold. 9 . The method according to claim 3 , wherein when the current frame is the redundancy decoding frame and a signal class of the current frame is not unvoiced, when a signal class of a next frame of the current frame is unvoiced, a value of δ is 0 or is less than a preset threshold. 10 . The method according to claim 3 , wherein when the current frame is the redundancy decoding frame and a signal class of the current frame is not unvoiced, a spectral tilt factor of the previous frame of the current frame is less than a preset spectral tilt factor threshold, a value of δ is 0 or is less than a preset threshold. 11 . The method according to claim 3 , wherein when the current frame is the redundancy decoding frame and a signal class of the current frame is not unvoiced, a signal class of a next frame of the current frame is unvoiced and a spectral tilt factor of the previous frame of the current frame is less than a preset spectral tilt factor threshold, a value of δ is 0 or is less than a preset threshold. 12 . The method according to claim 6 , wherein the spectral tilt factor may be positive or negative, and wherein a smaller spectral tilt factor indicates the signal class, which is more inclined to be unvoiced, of a frame corresponding to the spectral tilt factor. 13 . The method according to claim 1 , wherein the decoded parameter of the current frame comprises an adaptive codebook gain of the current frame, and wherein when the current frame is the redundancy decoding frame, when a next frame of the current frame is an unvoiced frame, or a next frame of the next frame of the current frame is the unvoiced frame and an algebraic codebook of a current subframe of the current frame is a first quantity of times an algebraic codebook of a previous subframe of the current subframe or an algebraic codebook of the previous frame of the current frame, performing the post-processing on the decoded parameter of the current frame comprises attenuating an adaptive codebook gain of the current subframe of the current frame. 14 . The method according to claim 1 , wherein the decoded parameter of the current frame comprises an adaptive codebook gain of the current frame, and wherein when the current frame or a previous frame of the current frame is the redundancy decoding frame, when a signal class of the current frame is generic and a signal class of a next frame of the current frame is voiced or a signal class of the previous frame of the current frame is generic and the signal class of the current frame is voiced, and an algebraic codebook of one subframe in the current frame is different from an algebraic codebook of a previous subframe of the one subframe by a second quantity of times or the algebraic codebook of one subframe in the current frame is different from the algebraic codebook of the previous frame of the current frame by the second quantity of times, performing post-processing on the decoded parameter of the current frame comprises adjusting an adaptive codebook gain of a current subframe of the current frame according to at least one of a ratio of an algebraic codebook of the current subframe of the current frame to an algebraic codebook of a neighboring subframe of the current subframe of the current frame, a ratio of the adaptive codebook gain of the current subframe of the current frame to an adaptive codebook gain of the neighboring subframe of the current subframe of the current frame, and a ratio of the algebraic codebook of the current subframe of the current frame to the algebraic codebook of the previous frame of the current frame. 15 . The method according to claim 1 , wherein the decoded parameter of the current frame comprises an algebraic codebook of the c

Assignees

Inventors

Classifications

  • Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing · CPC title

  • G10L19/005Primary

    Correction of errors induced by the transmission channel, if related to the coding algorithm · CPC title

  • G10L19/167Primary

    Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes · CPC title

  • Decision in previous or following frames · CPC title

  • Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2016343382A1 cover?
A method and an apparatus for decoding a speech/audio bitstream are disclosed, where the method for decoding a speech/audio bitstream includes determining whether a current frame is a normal decoding frame or a redundancy decoding frame, obtaining a decoded parameter of the current frame by means of parsing when the current frame is a normal decoding frame or a redundancy decoding frame, perfor…
Who is the assignee on this patent?
Huawei Tech Co Ltd
What technology area does this patent fall under?
Primary CPC classification G10L19/005. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Nov 24 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).