Bit allocation method and apparatus for audio signal

US11900950B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11900950-B2
Application numberUS-202217976474-A
CountryUS
Kind codeB2
Filing dateOct 28, 2022
Priority dateApr 30, 2020
Publication dateFeb 13, 2024
Grant dateFeb 13, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The present disclosure provides a bit allocation method and apparatus for an audio signal. The bit allocation method for an audio signal includes: obtaining T audio signals in a current frame, where T is a positive integer; determining a first audio signal set based on the T audio signals, where the first audio signal set includes M audio signals, M is a positive integer, T≥M; determining M priorities of the M audio signals in the first audio signal set; and performing bit allocation on the M audio signals based on the M priorities of the M audio signals.

First claim

Opening claim text (preview).

What is claimed is: 1. A method of bit allocation for an audio signal, comprising: obtaining T audio signals in a current frame, wherein T is a positive integer; obtaining S groups of metadata in the current frame, wherein S is a positive integer, T≥S, the S groups of metadata correspond to the T audio signals, and the metadata describes a status of a corresponding audio signal in a spatial scene; determining a first audio signal set based on the T audio signals, wherein the first audio signal set comprises M audio signals, M is a positive integer, and T≥M; determining M priorities of the M audio signals in the first audio signal set; and performing bit allocation on the M audio signals based on the M priorities of the M audio signals; wherein determining the M priorities of the M audio signals in the first audio signal set comprises: obtaining a scene grading parameter of each of the M audio signals; and determining the M priorities of the M audio signals based on the scene grading parameter of each of the M audio signals. 2. The method according to claim 1 , wherein obtaining the scene grading parameter of each of the M audio signals comprises: obtaining one or more of: a movement grading parameter, a loudness grading parameter, a spread grading parameter, a diffuseness grading parameter, a status grading parameter, a priority grading parameter, or a signal grading parameter of a first audio signal, wherein the first audio signal is one of the M audio signals; and obtaining a scene grading parameter of the first audio signal based on the obtained one or more of: the movement grading parameter, the loudness grading parameter, the spread grading parameter, the diffuseness grading parameter, the status grading parameter, the priority grading parameter, or the signal grading parameter, wherein the movement grading parameter describes a movement speed of the first audio signal in a unit time in a spatial scene, the loudness grading parameter describes loudness of the first audio signal in the spatial scene, the spread grading parameter describes a spread range of the first audio signal in the spatial scene, the diffuseness grading parameter describes a diffuseness range of the first audio signal in the spatial scene, the status grading parameter describes sound source divergence of the first audio signal in the spatial scene, the priority grading parameter describes a priority of the first audio signal in the spatial scene, and the signal grading parameter describes energy of the first audio signal in an encoding process. 3. The method according to claim 1 , wherein obtaining the scene grading parameter of each of the M audio signals comprises: obtaining one or more of: a movement grading parameter, a loudness grading parameter, a spread grading parameter, a diffuseness grading parameter, a status grading parameter, a priority grading parameter, or a signal grading parameter of a first audio signal based on metadata corresponding to the first audio signal or based on the first audio signal and the metadata corresponding to the first audio signal, wherein the first audio signal is one of the M audio signals; and obtaining a scene grading parameter of the first audio signal based on the obtained one or more of: the movement grading parameter, the loudness grading parameter, the spread grading parameter, the diffuseness grading parameter, the status grading parameter, the priority grading parameter, or the signal grading parameter, wherein the movement grading parameter describes a movement speed of the first audio signal in a unit time in the spatial scene, the loudness grading parameter describes loudness of the first audio signal in the spatial scene, the spread grading parameter describes a spread range of the first audio signal in the spatial scene, the diffuseness grading parameter describes a diffuseness range of the first audio signal in the spatial scene, the status grading parameter describes sound source divergence of the first audio signal in the spatial scene, the priority grading parameter describes a priority of the first audio signal in the spatial scene, and the signal grading parameter describes energy of the first audio signal in an encoding process. 4. The method according to claim 2 , wherein obtaining the scene grading parameter of the first audio signal comprises: performing weighed averaging on the obtained one or more of: the movement grading parameter, the loudness grading parameter, the spread grading parameter, the diffuseness grading parameter, the status grading parameter, the priority grading parameter, or the signal grading parameter, to obtain the scene grading parameter of the first audio signal; or performing averaging on the obtained one or more of: the movement grading parameter, the loudness grading parameter, the spread grading parameter, the diffuseness grading parameter, the status grading parameter, the priority grading parameter, or the signal grading parameter to obtain the scene grading parameter of the first audio signal; or using the obtained one or more of: the movement grading parameter, the loudness grading parameter, the spread grading parameter, the diffuseness grading parameter, the status grading parameter, the priority grading parameter, or the signal grading parameter as the scene grading parameter of the first audio signal. 5. The method according to claim 1 , wherein determining the M priorities of the M audio signals based on the scene grading parameter of each of the M audio signals comprises: determining a priority corresponding to the scene grading parameter of a first audio signal as a priority of the first audio signal based on a specified first correspondence, wherein the specified first correspondence comprises correspondences between a plurality of scene grading parameters and a plurality of priorities, one or more scene grading parameters correspond to a priority, and the first audio signal is one of the M audio signals; using the scene grading parameter of the first audio signal as a priority of the first audio signal; or determining a range of the scene grading parameter of the first audio signal based on a plurality of specified range thresholds, and determining a priority corresponding to the range of the scene grading parameter of the first audio signal as a priority of the first audio signal. 6. The method according to claim 1 , wherein performing the bit allocation on the M audio signals comprises: performing bit allocation based on a currently available bit quantity and the M priorities of the M audio signals, wherein a higher quantity of bits is allocated to an audio signal with a higher priority. 7. The method according to claim 6 , wherein performing the bit allocation based on the currently available bit quantity and the M priorities of the M audio signals comprises: determining a bit quantity ratio of a first audio signal based on a priority of the first audio signal, wherein the first audio signal is one of the M audio signals; and obtaining a bit quantity of the first audio signal based on a product of the currently available bit quantity and the bit quantity ratio of the first audio signal; or determining a bit quantity of the first audio signal from a specified second correspondence based on the priority of the first audio signal, wherein the specified second correspondence comprises correspondences between a plurality of priorities and a plurality of bit quantities, one or more priorities correspond to a bit quantity, and the first audio signal is one of the M audio signals. 8. The method according to claim 1 , wherein determining the first audio signal set comprises: adding a pre-specified audio signal of the T audio signals to the first audio signal set

Assignees

Inventors

Classifications

  • G10L19/002Primary

    Dynamic bit allocation (for perceptual audio coders G10L19/032) · CPC title

  • G10L19/008Primary

    Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing · CPC title

  • using orthogonal transformation · CPC title

  • Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11900950B2 cover?
The present disclosure provides a bit allocation method and apparatus for an audio signal. The bit allocation method for an audio signal includes: obtaining T audio signals in a current frame, where T is a positive integer; determining a first audio signal set based on the T audio signals, where the first audio signal set includes M audio signals, M is a positive integer, T≥M; determining M pri…
Who is the assignee on this patent?
Huawei Tech Co Ltd
What technology area does this patent fall under?
Primary CPC classification G10L19/002. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Feb 13 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).