Multi-stream audio coding
US-2019103118-A1 · Apr 4, 2019 · US
US11900950B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-11900950-B2 |
| Application number | US-202217976474-A |
| Country | US |
| Kind code | B2 |
| Filing date | Oct 28, 2022 |
| Priority date | Apr 30, 2020 |
| Publication date | Feb 13, 2024 |
| Grant date | Feb 13, 2024 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
The present disclosure provides a bit allocation method and apparatus for an audio signal. The bit allocation method for an audio signal includes: obtaining T audio signals in a current frame, where T is a positive integer; determining a first audio signal set based on the T audio signals, where the first audio signal set includes M audio signals, M is a positive integer, T≥M; determining M priorities of the M audio signals in the first audio signal set; and performing bit allocation on the M audio signals based on the M priorities of the M audio signals.
Opening claim text (preview).
What is claimed is: 1. A method of bit allocation for an audio signal, comprising: obtaining T audio signals in a current frame, wherein T is a positive integer; obtaining S groups of metadata in the current frame, wherein S is a positive integer, T≥S, the S groups of metadata correspond to the T audio signals, and the metadata describes a status of a corresponding audio signal in a spatial scene; determining a first audio signal set based on the T audio signals, wherein the first audio signal set comprises M audio signals, M is a positive integer, and T≥M; determining M priorities of the M audio signals in the first audio signal set; and performing bit allocation on the M audio signals based on the M priorities of the M audio signals; wherein determining the M priorities of the M audio signals in the first audio signal set comprises: obtaining a scene grading parameter of each of the M audio signals; and determining the M priorities of the M audio signals based on the scene grading parameter of each of the M audio signals. 2. The method according to claim 1 , wherein obtaining the scene grading parameter of each of the M audio signals comprises: obtaining one or more of: a movement grading parameter, a loudness grading parameter, a spread grading parameter, a diffuseness grading parameter, a status grading parameter, a priority grading parameter, or a signal grading parameter of a first audio signal, wherein the first audio signal is one of the M audio signals; and obtaining a scene grading parameter of the first audio signal based on the obtained one or more of: the movement grading parameter, the loudness grading parameter, the spread grading parameter, the diffuseness grading parameter, the status grading parameter, the priority grading parameter, or the signal grading parameter, wherein the movement grading parameter describes a movement speed of the first audio signal in a unit time in a spatial scene, the loudness grading parameter describes loudness of the first audio signal in the spatial scene, the spread grading parameter describes a spread range of the first audio signal in the spatial scene, the diffuseness grading parameter describes a diffuseness range of the first audio signal in the spatial scene, the status grading parameter describes sound source divergence of the first audio signal in the spatial scene, the priority grading parameter describes a priority of the first audio signal in the spatial scene, and the signal grading parameter describes energy of the first audio signal in an encoding process. 3. The method according to claim 1 , wherein obtaining the scene grading parameter of each of the M audio signals comprises: obtaining one or more of: a movement grading parameter, a loudness grading parameter, a spread grading parameter, a diffuseness grading parameter, a status grading parameter, a priority grading parameter, or a signal grading parameter of a first audio signal based on metadata corresponding to the first audio signal or based on the first audio signal and the metadata corresponding to the first audio signal, wherein the first audio signal is one of the M audio signals; and obtaining a scene grading parameter of the first audio signal based on the obtained one or more of: the movement grading parameter, the loudness grading parameter, the spread grading parameter, the diffuseness grading parameter, the status grading parameter, the priority grading parameter, or the signal grading parameter, wherein the movement grading parameter describes a movement speed of the first audio signal in a unit time in the spatial scene, the loudness grading parameter describes loudness of the first audio signal in the spatial scene, the spread grading parameter describes a spread range of the first audio signal in the spatial scene, the diffuseness grading parameter describes a diffuseness range of the first audio signal in the spatial scene, the status grading parameter describes sound source divergence of the first audio signal in the spatial scene, the priority grading parameter describes a priority of the first audio signal in the spatial scene, and the signal grading parameter describes energy of the first audio signal in an encoding process. 4. The method according to claim 2 , wherein obtaining the scene grading parameter of the first audio signal comprises: performing weighed averaging on the obtained one or more of: the movement grading parameter, the loudness grading parameter, the spread grading parameter, the diffuseness grading parameter, the status grading parameter, the priority grading parameter, or the signal grading parameter, to obtain the scene grading parameter of the first audio signal; or performing averaging on the obtained one or more of: the movement grading parameter, the loudness grading parameter, the spread grading parameter, the diffuseness grading parameter, the status grading parameter, the priority grading parameter, or the signal grading parameter to obtain the scene grading parameter of the first audio signal; or using the obtained one or more of: the movement grading parameter, the loudness grading parameter, the spread grading parameter, the diffuseness grading parameter, the status grading parameter, the priority grading parameter, or the signal grading parameter as the scene grading parameter of the first audio signal. 5. The method according to claim 1 , wherein determining the M priorities of the M audio signals based on the scene grading parameter of each of the M audio signals comprises: determining a priority corresponding to the scene grading parameter of a first audio signal as a priority of the first audio signal based on a specified first correspondence, wherein the specified first correspondence comprises correspondences between a plurality of scene grading parameters and a plurality of priorities, one or more scene grading parameters correspond to a priority, and the first audio signal is one of the M audio signals; using the scene grading parameter of the first audio signal as a priority of the first audio signal; or determining a range of the scene grading parameter of the first audio signal based on a plurality of specified range thresholds, and determining a priority corresponding to the range of the scene grading parameter of the first audio signal as a priority of the first audio signal. 6. The method according to claim 1 , wherein performing the bit allocation on the M audio signals comprises: performing bit allocation based on a currently available bit quantity and the M priorities of the M audio signals, wherein a higher quantity of bits is allocated to an audio signal with a higher priority. 7. The method according to claim 6 , wherein performing the bit allocation based on the currently available bit quantity and the M priorities of the M audio signals comprises: determining a bit quantity ratio of a first audio signal based on a priority of the first audio signal, wherein the first audio signal is one of the M audio signals; and obtaining a bit quantity of the first audio signal based on a product of the currently available bit quantity and the bit quantity ratio of the first audio signal; or determining a bit quantity of the first audio signal from a specified second correspondence based on the priority of the first audio signal, wherein the specified second correspondence comprises correspondences between a plurality of priorities and a plurality of bit quantities, one or more priorities correspond to a bit quantity, and the first audio signal is one of the M audio signals. 8. The method according to claim 1 , wherein determining the first audio signal set comprises: adding a pre-specified audio signal of the T audio signals to the first audio signal set
Dynamic bit allocation (for perceptual audio coders G10L19/032) · CPC title
Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing · CPC title
using orthogonal transformation · CPC title
Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.