Method, terminal, system for audio encoding/decoding/codec

US9997166B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9997166-B2
Application numberUS-201715790876-A
CountryUS
Kind codeB2
Filing dateOct 23, 2017
Priority dateAug 20, 2013
Publication dateJun 12, 2018
Grant dateJun 12, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Audio encoding methods/terminals, audio decoding methods/terminals, and audio codec systems are provided. A plurality of audio signals that are continuous is obtained. it is determined whether each audio signal of the plurality of audio signals includes a designated signal type, according to an audio parameter of each audio signal. A marked audio encoding stream is obtained by performing a marking to each audio signal as having or not having the designated signal type. The marking is used, at a decoding terminal, to perform an enhancement-process to one or more audio signals having the designated signal type. The enhancement-process is not performed to audio signals that do not have the designated signal type.

First claim

Opening claim text (preview).

What is claimed is: 1. An audio encoding method, comprising: obtaining a plurality of audio signals that are continuous; determining a type of each audio signal of the plurality of audio signals, according to an audio parameter of each audio signal and threshold values of corresponding categories of the audio parameter, wherein the categories of the audio parameter include logarithmic energy, a high-zero-crossing-rate-ratio (HZCRR), and a spectral flux (SF); and wherein the type of each audio signal is one of a designated signal type, a voice signal type, and a mute signal type; determining the type of the audio signal as the mute signal type when the logarithmic energy of the audio signal is less than a first threshold value; determining the type of the audio signal as the voice signal type when the logarithmic energy of the audio signal is no less than the first threshold value, and the HZCRR is more than a second threshold value; determining the type of the audio signal as the designated signal type when the logarithmic energy of the audio signal is no less than the first threshold value, the HZCRR is no more than the second threshold value, and the SF is more than a third threshold value; and obtaining a marked audio encoding stream by performing a marking to each audio signal as having or not having the designated signal type, wherein the marking is used at a decoding terminal to perform an enhancement-process to one or more audio signals having the designated signal type, and the enhancement-process is not performed to audio signals that do not have the designated signal type. 2. The method according to claim 1 , wherein an audio signal having the designated signal type is an analogous audio signal. 3. The method according to claim 1 , further comprising: obtaining the marked audio encoding stream; obtaining the plurality of audio signals from the marked audio encoding stream and obtaining the marking of at least a portion of the plurality of audio signals; performing the enhancement-process to one or more audio signals having the designated signal type according to the marking, to obtain an enhanced audio signal; and adding the enhanced audio signal into a decoding stream of the plurality of audio signals to obtain an audio decoding signal. 4. The method according to claim 3 , wherein the designated signal type is an analogous audio signal, and wherein performing the enhancement-process comprises: performing a frequency-spectrum enhancement and an acoustic-image extension to the analogous audio signal. 5. The method according to claim 4 , wherein processing the frequency-spectrum enhancement to the analogous audio signal comprises: obtaining a frequency of each audio signal; determining a frequency-spectrum enhancement coefficient of each audio signal, according to the frequency of each audio signal; and performing the frequency-spectrum enhancement to each audio signal, according to the frequency-spectrum enhancement coefficient of each audio signal. 6. The method according to claim 4 , wherein performing the acoustic-image extension to the analogous audio signal comprises: using a delaying parameter to perform the acoustic-image extension to the analogous audio signal. 7. An audio decoding method, comprising: obtaining an audio encoding stream to be decoded; obtaining a plurality of audio signals that are continuous and an audio parameter of each audio signal, from the audio encoding stream; determining whether each audio signal includes a designated signal type; for audio signals not having the designated signal type, directly performing a high frequency recovery and a stereo recovery, to obtain one or more enhanced audio signals; for the one or more audio signals having the designated signal type, performing a frequency-spectrum enhancement and an acoustic-image extension, performing the high frequency recovery after the frequency spectrum enhancement, and performing the stereo recovery after the acoustic-image extension, to obtain one or more enhanced audio signals; and adding the one or more enhanced audio signals into a decoding stream of the plurality of audio signals to obtain an audio decoding signal. 8. The method according to claim 7 , wherein determining whether each audio signal includes a designated signal type further comprises: when the audio encoding stream includes a marking for each audio signal representing a type of the audio signal, determining whether each audio signal includes the designated signal type according to the marking; and when the audio encoding stream does not include the marking for each audio signal representing the type of the audio signal, determining whether each audio signal includes the designated signal type according to the audio parameter of each audio signal. 9. The method according to claim 7 , wherein the designated signal type is an analogous audio signal, wherein the audio parameter of each audio signal comprises total frequency-spectrum energy, a spectral flatness measure (SFM), and a spectral flux (SF), and wherein determining whether each audio signal includes the designated signal type comprises: determining that an audio signal is the analogous audio signal, when the total frequency-spectrum energy of the audio signal is more than a first threshold value, the spectral flatness measure (SFM) is less than a second threshold value, and the spectral flux (SF) is more than a third threshold value. 10. The method according to claim 7 , wherein processing the frequency-spectrum enhancement comprises: obtaining a frequency of each audio signal; determining a frequency-spectrum enhancement coefficient of each audio signal, according to the frequency of each audio signal; and performing the frequency-spectrum enhancement to each audio signal, according to the frequency-spectrum enhancement coefficient of each audio signal. 11. An audio encoding apparatus, comprising a memory, and a processor coupled to the memory, the processor being configured for: obtaining a plurality of audio signals that are continuous; determining a type of each audio signal of the plurality of audio signals, according to an audio parameter of each audio signal and threshold values of corresponding categories of the audio parameter, wherein the categories of the audio parameter include logarithmic energy, a high-zero-crossing-rate-ratio (HZCRR), and a spectral flux (SF); and wherein the type of each audio signal is one of a designated signal type, a voice signal type, and a mute signal type; determining the type of the audio signal as the mute signal type when the logarithmic energy of the audio signal is less than a first threshold value; determining the type of the audio signal as the voice signal type when the logarithmic energy of the audio signal is no less than the first threshold value, and the HZCRR is more than a second threshold value; determining the type of the audio signal as the designated signal type when the logarithmic energy of the audio signal is no less than the first threshold value, the HZCRR is no more than the second threshold value, and the SF is more than a third threshold value; and obtaining a marked audio encoding stream by performing a marking to each audio signal as having or not having the designated signal type, wherein the marking is used at a decoding terminal to perform an enhancement-process to one or more audio signals having the designated signal type, and the enhancement-process is not performed to audio signals that do not have the designated signal type. 12. The apparatus according to claim 11 , wherein an audio signal having the designated signal type is an analogous

Assignees

Inventors

Classifications

  • the extracted parameters being power information · CPC title

  • Quantisation or dequantisation of spectral components · CPC title

  • the extracted parameters being zero crossing rates · CPC title

  • G10L19/02Primary

    using spectral analysis, e.g. transform vocoders or subband vocoders · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9997166B2 cover?
Audio encoding methods/terminals, audio decoding methods/terminals, and audio codec systems are provided. A plurality of audio signals that are continuous is obtained. it is determined whether each audio signal of the plurality of audio signals includes a designated signal type, according to an audio parameter of each audio signal. A marked audio encoding stream is obtained by performing a mark…
Who is the assignee on this patent?
Tencent Tech Shenzhen Co Ltd
What technology area does this patent fall under?
Primary CPC classification G10L19/02. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jun 12 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).