Encoding device and method, decoding device and method, and program

US9390717B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9390717-B2
Application numberUS-201214237933-A
CountryUS
Kind codeB2
Filing dateAug 14, 2012
Priority dateAug 24, 2011
Publication dateJul 12, 2016
Grant dateJul 12, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The present technology relates to an encoding device and method, a decoding device and method, and a program that enable audio of a high audio quality to be obtained with a smaller code amount. The encoding device multiplexes low frequency encoding data obtained by encoding a low frequency component of an input signal and high frequency encoding data obtained by encoding data including an estimation coefficient to acquire a high frequency component of the input signal by estimation and outputs multiplexed data. When the input signal is encoded, a calculation unit calculates pseudo high frequency subband power to be an estimation value of power of the high frequency component from an estimation coefficient selected in a frame immediately before a frame of a processing target and the high frequency component of the input signal. In addition, a determination unit determines whether reuse of the estimation coefficient of the immediately previous frame is enabled in the frame of the processing target, on the basis of a comparison result of the calculated pseudo high frequency subband power and actual high frequency component power. The present invention can be applied to the encoding device.

First claim

Opening claim text (preview).

The invention claimed is: 1. An encoding device including: a subband division unit that performs band division of an input signal and generates high frequency subband signals of subbands of a high frequency side of the input signal; a calculation unit that calculates pseudo high frequency subband power to be an estimation value of high frequency subband power of the high frequency subband signal of a frame of a processing target, on the basis of a feature amount obtained from a low frequency signal of the input signal and an estimation coefficient selected in a frame immediately before the frame of the processing target of the input signal among a plurality of estimation coefficients prepared in advance; a generation unit that, when reuse of the estimation coefficient of the immediately previous frame is enabled in the frame of the processing target, on the basis of the pseudo high frequency subband power and the high frequency subband power obtained from the high frequency subband signal, generates data to obtain the reuse enabled estimation coefficient; a low frequency encoding unit that encodes the low frequency signal and generates low frequency encoding data; and a multiplexing unit that multiplexes the data and the low frequency encoding data and generates an output code string. 2. The encoding device according to claim 1 , further comprising: a pseudo high frequency subband power calculation unit that calculates the pseudo high frequency subband power on the basis of the feature amount and the estimation coefficients, for every plurality of estimation coefficients; and a selection unit that compares the pseudo high frequency subband power calculated by the pseudo high frequency subband power calculation unit and the high frequency subband power and selects any one of the plurality of estimation coefficients, wherein the generation unit generates the data to obtain the estimation coefficient selected by the selection unit, when the reuse of the estimation coefficient of the immediately previous frame is disabled. 3. The encoding device according to claim 2 , further comprising: a high frequency encoding unit that encodes the data and generates high frequency encoding data, wherein the multiplexing unit multiplexes the high frequency encoding data and the low frequency encoding data and generates the output code string. 4. The encoding device according to claim 3 , wherein, when a square sum of differences of the pseudo high frequency subband power and the high frequency subband power of the subbands of the high frequency side is a predetermined threshold value or less, the reuse of the estimation coefficient is enabled. 5. The encoding device according to claim 3 , wherein the reuse of the estimation coefficient is enabled according to a comparison result of an evaluation value showing a similarity degree of the pseudo high frequency subband power and the high frequency subband power, which is calculated on the basis of the pseudo high frequency subband power and the high frequency subband power of the subbands of the high frequency side, and a predetermined threshold value. 6. The encoding device according to claim 3 , wherein the generation unit generates one data for a processing target section including a plurality of frames of the input signal. 7. The encoding device according to claim 6 , wherein information to specify a section including continuous frames in which the same estimation coefficient is selected, in the processing target section, is included in the data. 8. The encoding device according to claim 7 , wherein one information to specify the estimation coefficient is included for the section, in the data. 9. An encoding method including: performing band division of an input signal and generating high frequency subband signals of subbands of a high frequency side of the input signal; calculating pseudo high frequency subband power to be an estimation value of high frequency subband power of the high frequency subband signal of a frame of a processing target, on the basis of a feature amount obtained from a low frequency signal of the input signal and an estimation coefficient selected in a frame immediately before the frame of the processing target of the input signal among a plurality of estimation coefficients prepared in advance; when reuse of the estimation coefficient of the immediately previous frame is enabled in the frame of the processing target, on the basis of the pseudo high frequency subband power and the high frequency subband power obtained from the high frequency subband signal, generating data to obtain the reuse enabled estimation coefficient; encoding the low frequency signal and generating low frequency encoding data; and multiplexing the data and the low frequency encoding data and generating an output code string. 10. A non-transitory computer-readable storage device encoded with computer-readable instructions that, when executed by a processing device, perform a process comprising: performing band division of an input signal and generating high frequency subband signals of subbands of a high frequency side of the input signal; calculating pseudo high frequency subband power to be an estimation value of high frequency subband power of the high frequency subband signal of a frame of a processing target, on the basis of a feature amount obtained from a low frequency signal of the input signal and an estimation coefficient selected in a frame immediately before the frame of the processing target of the input signal among a plurality of estimation coefficients prepared in advance; when reuse of the estimation coefficient of the immediately previous frame is enabled in the frame of the processing target, on the basis of the pseudo high frequency subband power and the high frequency subband power obtained from the high frequency subband signal, generating data to obtain the reuse enabled estimation coefficient; encoding the low frequency signal and generating low frequency encoding data; and multiplexing the data and the low frequency encoding data and generating an output code string. 11. A decoding device including: a demultiplexing unit that demultiplexes an input code string into data to obtain an estimation coefficient and low frequency encoding data obtained by encoding a low frequency signal of an input signal, wherein the data to obtain the estimation coefficient is generated according to a determination result whether reuse of the estimation coefficient selected in a frame immediately before the frame of the processing target among a plurality of estimation coefficients prepared in advance is enabled in the frame of the processing target on the basis of an estimation value of high frequency subband power of the frame of the processing target, the estimation value being calculated based on a feature amount of the input signal, the estimation coefficient of the immediately previous frame and the high frequency subband power in the frame of the processing target of the input signal; a low frequency decoding unit that decodes the low frequency encoding data and generates the low frequency signal; a high frequency signal generating unit that generates a high frequency signal, on the basis of the estimation coefficient obtained from the data and the low frequency signal obtained by the decoding; and a synthesis unit that generates an output signal, on the basis of the high frequency signal and the low frequency signal obtained by the decoding. 12. The decoding device according to claim 11 , wherein, when it is determined that the reuse of the estimation coefficient of the immediately previous frame is di

Assignees

Inventors

Classifications

  • Subband vocoders · CPC title

  • using band spreading techniques · CPC title

  • G10L19/008Primary

    Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing · CPC title

  • Mode decision, i.e. based on audio signal content versus external parameters · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9390717B2 cover?
The present technology relates to an encoding device and method, a decoding device and method, and a program that enable audio of a high audio quality to be obtained with a smaller code amount. The encoding device multiplexes low frequency encoding data obtained by encoding a low frequency component of an input signal and high frequency encoding data obtained by encoding data including an…
Who is the assignee on this patent?
Yamamoto Yuki, Chinen Toru, Sony Corp
What technology area does this patent fall under?
Primary CPC classification G10L19/0208. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jul 12 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 6 related publications on this page (citations in our corpus or others sharing the same primary CPC).