Method for encoding multi-channel signal and encoder

US11217257B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11217257-B2
Application numberUS-202016818612-A
CountryUS
Kind codeB2
Filing dateMar 13, 2020
Priority dateAug 10, 2016
Publication dateJan 4, 2022
Grant dateJan 4, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method for encoding a multi-channel signal and an encoder, where the encoding method includes obtaining a multi-channel signal of a current frame, determining an initial inter-channel time difference (ITD) value of the current frame, controlling, based on characteristic information of the multi-channel signal, a quantity of target frames that are allowed to appear continuously, where the characteristic information includes at least one of a signal-to-noise ratio of the multi-channel signal or a peak feature of cross correlation coefficients of the multi-channel signal, and an ITD value of a previous frame of the target frame is reused as an ITD value of the target frame, determining an ITD value of the current frame based on the initial ITD value and the quantity of target frames allowed to appear continuously, and encoding the multi-channel signal based on the ITD value of the current frame.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer program product comprising computer-executable instructions stored on a non-transitory computer-readable medium that, when executed by a processor, cause a device to: obtain a multi-channel signal of a current frame; determine an initial inter-channel time difference (ITD) value of the current frame; control a quantity of target frames allowed to appear continuously based on characteristic information of the multi-channel signal, wherein the characteristic information comprises at least one of a signal-to-noise ratio of the multi-channel signal or a peak feature of cross correlation coefficients of the multi-channel signal, and wherein an ITD value of a previous frame of a target frame is reused as an ITD value of the target frame; determine an ITD value of the current frame based on the initial ITD value of the current frame and the quantity of target frames allowed to appear continuously; and encode the multi-channel signal based on the ITD value of the current frame. 2. The computer program product of claim 1 , wherein before the computer-executable instructions cause the device to control the quantity of target frames allowed to appear continuously, the computer-executable instructions further cause the device to determine the peak feature of the cross correlation coefficients based on an amplitude of a peak value of the cross correlation coefficients and an index of a peak position of the cross correlation coefficients. 3. The computer program product of claim 2 , wherein the computer-executable instructions further cause the device to: determine a peak amplitude confidence parameter based on the amplitude of the peak value, wherein the peak amplitude confidence parameter represents a confidence level of the amplitude of the peak value; determining a peak position fluctuation parameter based on an ITD value corresponding to the index of the peak position of the cross correlation coefficients and an ITD value of a previous frame of the current frame, wherein the peak position fluctuation parameter represents a difference between the ITD value corresponding to the index of the peak position and the ITD value of the previous frame; and determining the peak feature of the cross correlation coefficients based on the peak amplitude confidence parameter and the peak position fluctuation parameter. 4. The computer program product of claim 3 , wherein the computer-executable instructions further cause the device to determine, as the peak amplitude confidence parameter, a ratio of a difference between an amplitude value of the peak value of the cross correlation coefficients and an amplitude value of a second largest value of the cross correlation coefficients to the amplitude value of the peak value of the cross correlation coefficients. 5. The computer program product of claim 3 , wherein the computer-executable instructions further cause the device to determine, as the peak position fluctuation parameter, an absolute value of a difference between the ITD value corresponding to the index of the peak position of the cross correlation coefficients and the ITD value of the previous frame. 6. The computer program product of claim 1 , wherein the computer-executable instructions further cause the device to: control the quantity of the target frames allowed to appear continuously based on the peak feature of the cross correlation coefficients; and adjust at least one of a target frame count or a threshold of the target frame count to reduce the quantity of the target frames allowed to appear continuously when the peak feature of the cross correlation coefficients meets a preset condition, wherein the target frame count represents a quantity of target frames that have currently appeared continuously, and wherein the threshold of the target frame count indicates the quantity of the target frames allowed to appear continuously. 7. The computer program product of claim 6 , wherein the computer-executable instructions further cause the device to: control the quantity of the target frames allowed to appear continuously based on the peak feature of the cross correlation coefficients only when the signal-to-noise ratio of the multi-channel signal does not meet a preset signal-to-noise ratio condition, and stop reusing an ITD value of a previous frame as the ITD value of the current frame when the signal-to-noise ratio of the multi-channel signal meets the preset signal-to-noise ratio condition. 8. The computer program product of claim 1 , wherein the computer-executable instructions further cause the device to: determine whether the signal-to-noise ratio of the multi-channel signal meets a preset signal-to-noise ratio condition; control the quantity of the target frames allowed to appear continuously when the signal-to-noise ratio of the multi-channel signal does not meet the preset signal-to-noise ratio condition based on the peak feature of the cross correlation coefficients; and stop reusing an ITD value of a previous frame as the ITD value of the current frame when the signal-to-noise ratio of the multi-channel signal meets the preset signal-to-noise ratio condition. 9. The computer program product of claim 8 , wherein the computer-executable instructions further cause the device to increase a target frame count such that a value of the target frame count is greater than or equal to a threshold of the target frame count, wherein the target frame count represents a quantity of target frames that have currently appeared continuously, and wherein the threshold of the target frame count indicates the quantity of the target frames allowed to appear continuously. 10. An encoder, comprising: an obtaining circuit, configured to obtain a multi-channel signal of a current frame; a first determining circuit, configured to determine an initial inter-channel time difference (ITD) value of the current frame; a control circuit, configured to control a quantity of target frames allowed to appear continuously based on characteristic information of the multi-channel signal, wherein the characteristic information comprises at least one of a signal-to-noise ratio of the multi-channel signal or a peak feature of cross correlation coefficients of the multi-channel signal, and wherein an ITD value of a previous frame of a target frame is reused as an ITD value of the target frame; a second determining circuit, configured to determine an ITD value of the current frame based on the initial ITD value of the current frame and the quantity of target frames that are allowed to appear continuously; and an encoding circuit, configured to encode the multi-channel signal based on the ITD value of the current frame. 11. The encoder according to claim 10 , wherein the encoder further comprises a third determining circuit, configured to determine the peak feature of the cross correlation coefficients based on an amplitude of a peak value of the cross correlation coefficients and an index of a peak position of the cross correlation coefficients signal. 12. The encoder according to claim 11 , wherein the third determining circuit is further configured to: determine a peak amplitude confidence parameter based on the amplitude of the peak value, wherein the peak amplitude confidence parameter represents a confidence level of the amplitude of the peak value; determine a peak position fluctuation parameter based on an ITD value corresponding to the index of the peak position of the cross correlation coefficients, and an ITD value of a previous frame of the current frame, wherein the peak position fluctuation parameter represents a difference between the ITD value corresponding to t

Assignees

Inventors

Classifications

  • Application of parametric coding in stereophonic audio systems · CPC title

  • Systems employing more than two channels, e.g. quadraphonic (H04S5/00, H04S7/00 take precedence) · CPC title

  • the extracted parameters being correlation coefficients · CPC title

  • using subband decomposition · CPC title

  • G10L19/008Primary

    Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11217257B2 cover?
A method for encoding a multi-channel signal and an encoder, where the encoding method includes obtaining a multi-channel signal of a current frame, determining an initial inter-channel time difference (ITD) value of the current frame, controlling, based on characteristic information of the multi-channel signal, a quantity of target frames that are allowed to appear continuously, where the char…
Who is the assignee on this patent?
Huawei Tech Co Ltd
What technology area does this patent fall under?
Primary CPC classification G10L19/008. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jan 04 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).