Method for encoding multi-channel signal and encoder

US10643625B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10643625-B2
Application numberUS-201916272394-A
CountryUS
Kind codeB2
Filing dateFeb 11, 2019
Priority dateAug 10, 2016
Publication dateMay 5, 2020
Grant dateMay 5, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method for encoding a multi-channel signal and an encoder, where the encoding method includes obtaining a multi-channel signal of a current frame, determining an initial inter-channel time difference (ITD) value of the current frame, controlling, based on characteristic information of the multi-channel signal, a quantity of target frames that are allowed to appear continuously, where the characteristic information includes at least one of a signal-to-noise ratio of the multi-channel signal or a peak feature of cross correlation coefficients of the multi-channel signal, and an ITD value of a previous frame of the target frame is reused as an ITD value of the target frame, determining an ITD value of the current frame based on the initial ITD value and the quantity of target frames allowed to appear continuously, and encoding the multi-channel signal based on the ITD value of the current frame.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for encoding a multi-channel signal, comprising: obtaining a multi-channel signal of a current frame; determining an initial inter-channel time difference (ITD) value of the current frame; controlling, based on characteristic information of the multi-channel signal, a quantity of target frames allowed to appear continuously, wherein the characteristic information comprises at least one of a signal-to-noise ratio of the multi-channel signal or a peak feature of cross correlation coefficients of the multi-channel signal, and wherein an ITD value of a previous frame of a target frame is reused as an ITD value of the target frame; determining an ITD value of the current frame based on the initial ITD value of the current frame and the quantity of target frames allowed to appear continuously; and encoding the multi-channel signal based on the ITD value of the current frame. 2. The method of claim 1 , wherein before controlling the quantity of target frames allowed to appear continuously, the method further comprises determining the peak feature of the cross correlation coefficients of the multi-channel signal based on amplitude of a peak value of the cross correlation coefficients of the multi-channel signal and an index of a peak position of the cross correlation coefficients of the multi-channel signal. 3. The method of claim 2 , wherein determining the peak feature of the cross correlation coefficients of the multi-channel signal comprises: determining a peak amplitude confidence parameter based on the amplitude of the peak value of the cross correlation coefficients of the multi-channel signal, wherein the peak amplitude confidence parameter represents a confidence level of the amplitude of the peak value of the cross correlation coefficients of the multi-channel signal; determining a peak position fluctuation parameter based on an ITD value corresponding to the index of the peak position of the cross correlation coefficients of the multi-channel signal and an ITD value of a previous frame of the current frame, wherein the peak position fluctuation parameter represents a difference between the ITD value corresponding to the index of the peak position of the cross correlation coefficients of the multi-channel signal and the ITD value of the previous frame of the current frame; and determining the peak feature of the cross correlation coefficients of the multi-channel signal based on the peak amplitude confidence parameter and the peak position fluctuation parameter. 4. The method of claim 3 , wherein determining the peak amplitude confidence parameter comprises determining, as the peak amplitude confidence parameter, a ratio of a difference between an amplitude value of the peak value of the cross correlation coefficients of the multi-channel signal and an amplitude value of a second largest value of the cross correlation coefficients of the multi-channel signal to the amplitude value of the peak value of the cross correlation coefficients of the multi-channel signal. 5. The method of claim 3 , wherein determining the peak position fluctuation parameter comprises determining, as the peak position fluctuation parameter, an absolute value of a difference between the ITD value corresponding to the index of the peak position of the cross correlation coefficients of the multi-channel signal and the ITD value of the previous frame of the current frame. 6. The method of claim 1 , wherein controlling, the quantity of the target frames allowed to appear continuously comprises: controlling, based on the peak feature of the cross correlation coefficients of the multi-channel signal, the quantity of the target frames allowed to appear continuously; and reducing, by adjusting at least one of a target frame count or a threshold of the target frame count, the quantity of the target frames allowed to appear continuously when the peak feature of the cross correlation coefficients of the multi-channel signal meets a preset condition, wherein the target frame count represents a quantity of target frames that have currently appeared continuously, and wherein the threshold of the target frame count indicates the quantity of the target frames allowed to appear continuously. 7. The method of claim 6 , wherein controlling the quantity of the target frames allowed to appear continuously comprises controlling, based on the peak feature of the cross correlation coefficients of the multi-channel signal, the quantity of the target frames allowed to appear continuously only when the signal-to-noise ratio of the multi-channel signal does not meet a preset signal-to-noise ratio condition, and wherein the method further comprises stopping reusing an ITD value of a previous frame of the current frame as the ITD value of the current frame when the signal-to-noise ratio of the multi-channel signal meets the preset signal-to-noise ratio condition. 8. The method of claim 1 , wherein controlling the quantity of the target frames allowed to appear continuously comprises: determining whether the signal-to-noise ratio of the multi-channel signal meets a preset signal-to-noise ratio condition; controlling, based on the peak feature of the cross correlation coefficients of the multi-channel signal, the quantity of the target frames allowed to appear continuously when the signal-to-noise ratio of the multi-channel signal does not meet the preset signal-to-noise ratio condition; and stopping reusing an ITD value of a previous frame of the current frame as the ITD value of the current frame when the signal-to-noise ratio of the multi-channel signal meets the preset signal-to-noise ratio condition. 9. The method of claim 8 , wherein stopping reusing the ITD value of the previous frame of the current frame as the ITD value of the current frame comprises increasing a target frame count such that a value of the target frame count is greater than or equal to a threshold of the target frame count, wherein the target frame count represents a quantity of target frames that have currently appeared continuously, and wherein the threshold of the target frame count indicates the quantity of the target frames allowed to appear continuously. 10. An encoder, comprising: a memory comprising instructions; and a processor coupled to the memory, wherein the instructions cause the processor to be configured to: obtain a multi-channel signal of a current frame; determine an initial inter-channel time difference (ITD) value of the current frame; control, based on characteristic information of the multi-channel signal, a quantity of target frames allowed to appear continuously, wherein the characteristic information comprises at least one of a signal-to-noise ratio of the multi-channel signal or a peak feature of cross correlation coefficients of the multi-channel signal, and wherein an ITD value of a previous frame of a target frame is reused as an ITD value of the target frame; determine an ITD value of the current frame based on the initial ITD value of the current frame and the quantity of target frames allowed to appear continuously; and encode the multi-channel signal based on the ITD value of the current frame. 11. The encoder of claim 10 , wherein the instructions further cause the processor to be configured to determine the peak feature of the cross correlation coefficients of the multi-channel signal based on amplitude of a peak value of the cross correlation coefficients of the multi-channel signal and an index of a peak position of the cross correlation coefficients of the multi-channel signal. 12. The encoder of claim 11 , wherein the instructions further cause the processor to be configure

Assignees

Inventors

Classifications

  • using subband decomposition · CPC title

  • Application of parametric coding in stereophonic audio systems · CPC title

  • G10L19/008Primary

    Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing · CPC title

  • the extracted parameters being correlation coefficients · CPC title

  • Systems employing more than two channels, e.g. quadraphonic (H04S5/00, H04S7/00 take precedence) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10643625B2 cover?
A method for encoding a multi-channel signal and an encoder, where the encoding method includes obtaining a multi-channel signal of a current frame, determining an initial inter-channel time difference (ITD) value of the current frame, controlling, based on characteristic information of the multi-channel signal, a quantity of target frames that are allowed to appear continuously, where the char…
Who is the assignee on this patent?
Huawei Tech Co Ltd
What technology area does this patent fall under?
Primary CPC classification G10L19/008. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue May 05 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).