Inter-Channel Phase Difference Parameter Extraction Method and Apparatus
US-2022328053-A1 · Oct 13, 2022 · US
US12367885B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-12367885-B2 |
| Application number | US-202418417518-A |
| Country | US |
| Kind code | B2 |
| Filing date | Jan 19, 2024 |
| Priority date | May 31, 2016 |
| Publication date | Jul 22, 2025 |
| Grant date | Jul 22, 2025 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
An inter-channel phase difference (IPD) parameter extraction method includes obtaining a parameter for obtaining an information extraction manner for a current frame of a multi-channel signal; obtaining an IPD parameter extraction manner for the current frame based on the parameter for obtaining the information extraction manner, where the obtained IPD parameter extraction manner is one of at least two preset IPD parameter extraction manners; and obtaining an IPD parameter of the current frame based on the obtained IPD parameter extraction manner for the current frame.
Opening claim text (preview).
What is claimed is: 1. A method comprising: obtaining at least one of a current parameter of a current frame of an audio signal or a previous parameter of a previous frame of the audio signal, wherein the audio signal comprises at least two channels, wherein the current parameter comprises at least one of a first parameter representing a first left-right channel coherence of the current frame, a first subband inter-channel phase difference (IPD) variance of the current frame, a first signal class of the current frame, or a first inter-channel time difference (ITD) of the current frame, wherein the previous parameter comprises at least one of a second parameter representing a second left-right channel coherence of the previous frame, a second subband IPD variance of the previous frame, a second ITD of the previous frame, a first IPD parameter extraction manner for the previous frame, or a second signal class of the previous frame, wherein the second signal class is either speech or music, and wherein the first signal class is either the speech or the music; obtaining, based on the at least one of the current parameter or the previous parameter, a second IPD parameter extraction manner for the current frame, wherein the second IPD parameter extraction manner is a third IPD parameter extraction manner or a fourth IPD parameter extraction manner, wherein the third IPD parameter extraction manner comprises one of a first manner for extracting a group IPD parameter of the current frame, a second manner for not extracting the group IPD parameter, or a third manner for setting a first IPD parameter of the current frame to zero, wherein the fourth IPD parameter extraction manner comprises one of a fourth manner for extracting first subband set IPD parameters or a fifth manner for extracting second subband IPD parameters, wherein obtaining the second IPD parameter extraction manner comprises obtaining, when a value of the first parameter is greater than a first threshold, the third IPD parameter extraction manner as the second IPD parameter extraction manner, and wherein the first threshold is 0.75; performing time-to-frequency conversion on a left-channel time-domain signal and a right-channel time-domain signal of the current frame to respectively obtain a left-channel frequency-domain signal and a right-channel frequency-domain signal of the current frame; extracting, based on the second IPD parameter extraction manner, a second IPD parameter of the left-channel frequency-domain signal and the right-channel frequency-domain signal; and encoding the second IPD parameter. 2. The method of claim 1 , wherein the first parameter representing the first left-right channel coherence of the current frame and the second parameter representing the second left-right coherence of the previous frame describe a coherence between channels. 3. The method of claim 1 , wherein the first subband IPD variance of the current frame and the second subband IPD variance of the previous frame represent a horizontal orientation of a sound source. 4. The method of claim 1 , wherein the first ITD of the current frame and the second ITD of the previous frame represent a horizontal orientation of a sound source. 5. The method of claim 1 , wherein the first ITD of the current frame and the second ITD of the previous frame are spatial perception parameters. 6. The method of claim 1 , wherein the first IPD variance of the current frame and the second IPD variance of the previous frame represent a horizontal orientation of a sound source. 7. The method of claim 1 , wherein the first IPD variance of the current frame and the second IPD variance of the previous frame are spatial perception parameters. 8. An apparatus, comprising: a memory configured to store instructions; and a processor coupled to the memory and configured to execute the instructions to cause the apparatus to: obtain at least one of a current parameter of a current frame of an audio signal or a previous parameter of a previous frame of the audio signal, wherein the audio signal comprises at least two channels, wherein the current parameter comprises at least one of a first parameter representing a first left-right channel coherence of the current frame, a first subband inter-channel phase difference (IPD) variance of the current frame, a first signal class of the current frame, or a first inter-channel time difference (ITD) of the current frame, wherein the previous parameter comprises at least one of a second parameter representing a second left-right channel coherence of the previous frame, a second subband IPD variance of the previous frame, a second ITD of the previous frame, a first IPD parameter extraction manner for the previous frame, or a second signal class of the previous frame, wherein the second signal class is either speech or music, and wherein the first signal class is either the speech or the music; obtain, based on the at least one of the current parameter or the previous parameter, a second IPD parameter extraction manner for the current frame, wherein the second IPD parameter extraction manner is a third IPD parameter extraction manner or a fourth IPD parameter extraction manner, wherein the third IPD parameter extraction manner comprises one of a first manner for extracting a group IPD parameter of the current frame, a second manner for not extracting the group IPD parameter, or a third manner for setting a first IPD parameter of the current frame to zero, wherein the fourth IPD parameter extraction manner comprises one of a fourth manner for extracting first subband set IPD parameters or a fifth manner for extracting second subband IPD parameters, wherein obtaining the second IPD parameter extraction manner comprises obtaining, when a value of the first parameter is greater than a first threshold, the third IPD parameter extraction manner as the second IPD parameter extraction manner, and wherein the first threshold is 0.75; perform time-to-frequency conversion on a left-channel time-domain signal and a right-channel time-domain signal of the current frame to respectively obtain a left-channel frequency-domain signal and a right-channel frequency-domain signal of the current frame; extract, based on the second IPD parameter extraction manner, a second IPD parameter of the left-channel frequency-domain signal and the right-channel frequency-domain signal; and encode the second IPD parameter. 9. The apparatus of claim 8 , wherein the first parameter representing the first left-right channel coherence of the current frame and the second parameter representing the second left-right coherence of the previous frame describe a coherence between channels. 10. The apparatus of claim 8 , wherein the first subband IPD variance of the current frame and the second subband IPD variance of the previous frame represent a horizontal orientation of a sound source. 11. The apparatus of claim 8 , wherein the first ITD of the current frame and the second ITD of the previous frame represent a horizontal orientation of a sound source. 12. The apparatus of claim 8 , wherein the first ITD of the current frame and the second ITD of the previous frame are spatial perception parameters. 13. The apparatus of claim 8 , wherein the first IPD variance of the current frame and the second IPD variance of the previous frame represent a horizontal orientation of a sound source. 14. The apparatus of claim 8 , wherein the first IPD variance of the current frame and the second IPD variance of the previous frame are spatial perception parameters. 15. A computer program product comprising instructions stored on a non
characterised by the type of extracted parameters · CPC title
Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.