System and method for mixed codebook excitation for speech coding

US9972325B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9972325-B2
Application numberUS-201313768814-A
CountryUS
Kind codeB2
Filing dateFeb 15, 2013
Priority dateFeb 17, 2012
Publication dateMay 15, 2018
Grant dateMay 15, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

In accordance with an embodiment, a method of encoding an audio/speech signal includes determining a mixed codebook vector based on an incoming audio/speech signal, where the mixed codebook vector includes a sum of a first codebook entry from a first codebook and a second codebook entry from a second codebook. The method further includes generating an encoded audio signal based on the determined mixed codebook vector, and transmitting a coded excitation index of the determined mixed codebook vector.

First claim

Opening claim text (preview).

What is claimed is: 1. A method of encoding an audio/speech signal, the method comprising: for each frame in an incoming audio/speech signal having a low bit rate, determining a mixed excitation and an adaptive codebook excitation based on the incoming audio/speech signal, the mixed excitation comprising a sum of a first excitation entry from a first codebook and a second excitation entry from a second codebook, wherein the first and second codebooks are both fixed but different codebooks, wherein the adaptive excitation comprises an entry from an adaptive codebook, wherein the first codebook comprises pulse-like entries, wherein the pulse-like entries comprise non-periodic, signed, and unit magnitude pulses specially designed for an Algebraic Code-Excited Linear Prediction (ACELP) speech coding algorithm, and the second codebook comprises noise-like entries, wherein determining the mixed excitation is performed in time domain; applying a first filter to the first excitation entry from the first codebook; applying a second filter to the second excitation entry from the second codebook, the second filter being different from the first filter; for each subframe in each frame in the incoming audio/speech signal, searching pulse-like entries in the first codebook, by using an Analysis-By-Synthesis searching approach, to find an entry that minimizes a weighted error between a synthesized speech and the incoming audio/speech signal, and coding an index of the entry to obtain at least one coded excitation index; generating an encoded audio signal based on the determined mixed excitation and the adaptive codebook excitation; and transmitting the at least one coded excitation index of the determined mixed excitation, wherein the determining and generating are performed using a hardware-based audio encoder. 2. The method of claim 1 , wherein determining the mixed excitation comprises: computing first correlations between a filtered target vector and filtered entries in the first codebook, wherein the filtered target vector is based on the incoming audio signal; determining a first group of highest first correlations; computing second correlations between a filtered target vector and filtered entries in the second codebook; determining a second group of highest second correlations; and computing a first criterion function of combinations of the first and second groups, wherein the first criterion function comprises a function of one of the first group of highest first correlations, one of the second group of highest second correlations and an energy of corresponding entries from the first codebook and the second codebook. 3. The method of claim 2 , further comprising: determining a third group of candidate correlations based on a highest computed first criterion functions; and selecting the mixed excitation based on applying a second criterion function to the third group, wherein the mixed excitation corresponds to codebook entries from the first codebook and the second codebook associated with a highest value of the second criterion function. 4. The method of claim 3 , wherein: the first criterion function is Q ⁡ ( i , j ) = [ R CB ⁢ ⁢ 1 ⁡ ( i ) + R CB ⁢ ⁢ 2 ⁡ ( j ) ] 2 E CB ⁢ ⁢ 1 ⁡ ( i ) + E CB ⁢ ⁢ 2 ⁡ ( j ) ; i = 0 , 1 , … ⁢ , K CB ⁢ ⁢ 1 0 - 1 ; j = 0 , 1 , … ⁢ , K CB ⁢ ⁢ 2 0 - 1 ,

Assignees

Inventors

Classifications

  • Correction of errors induced by the transmission channel, if related to the coding algorithm · CPC title

  • Compression (speech analysis-synthesis for redundancy reduction G10L19/00; for image communication H04N); Expansion; Suppression of unnecessary data, e.g. redundancy reduction · CPC title

  • Long term prediction, i.e. removing periodical redundancies, e.g. by using adaptive codebook or pitch predictor · CPC title

  • G10L19/12Primary

    the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders · CPC title

  • Pre-filtering or post-filtering · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9972325B2 cover?
In accordance with an embodiment, a method of encoding an audio/speech signal includes determining a mixed codebook vector based on an incoming audio/speech signal, where the mixed codebook vector includes a sum of a first codebook entry from a first codebook and a second codebook entry from a second codebook. The method further includes generating an encoded audio signal based on the determine…
Who is the assignee on this patent?
Huawei Tech Co Ltd
What technology area does this patent fall under?
Primary CPC classification G10L19/12. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue May 15 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).