Audio processing method and terminal

US2024388866A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2024388866-A1
Application numberUS-202418785171-A
CountryUS
Kind codeA1
Filing dateJul 26, 2024
Priority dateJan 28, 2022
Publication dateNov 21, 2024
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

This application disclose an audio processing method and a terminal. The method includes: decoding an audio bitstream to obtain audio optimization metadata, basic audio metadata, and M pieces of decoded audio data, the audio optimization metadata includes first metadata of a first optimized listening area and a first decoding audio mixing parameter corresponding to the first optimized listening area; rendering M pieces of decoded audio data based on a current location of a user and the basic audio metadata, to obtain M pieces of rendered audio data; when the current location is in the first optimized listening area, performing first audio mixing on the M pieces of rendered audio data based on the first decoding audio mixing parameter, to obtain M pieces of first audio mixing data; and mixing the M pieces of first audio mixing data, to obtain mixed audio data corresponding to the first optimized listening area.

First claim

Opening claim text (preview).

What is claimed is: 1 . An audio processing method, comprising: decoding an audio bitstream to obtain audio optimization metadata, basic audio metadata, and M pieces of decoded audio data, wherein the audio optimization metadata comprises first metadata of a first optimized listening area and a first decoding audio mixing parameter corresponding to the first optimized listening area, and M is a positive integer; rendering M pieces of decoded audio data based on a current location of a user and the basic audio metadata, to obtain M pieces of rendered audio data; when the current location is in the first optimized listening area, performing first audio mixing on the M pieces of rendered audio data based on the first decoding audio mixing parameter, to obtain M pieces of first audio mixing data; and mixing the M pieces of first audio mixing data, to obtain mixed audio data corresponding to the first optimized listening area. 2 . The method according to claim 1 , wherein the audio optimization metadata further comprises a second decoding audio mixing parameter corresponding to the first optimized listening area; and the method further comprises: performing second audio mixing on the mixed audio data based on the second decoding audio mixing parameter, to obtain second audio mixing data corresponding to the first optimized listening area. 3 . The method according to claim 2 , wherein the audio optimization metadata further comprises: N−1 difference parameters of N−1 second decoding audio mixing parameters corresponding to N−1 optimized listening areas other than the first optimized listening area in N optimized listening areas with respect to the second decoding audio mixing parameter corresponding to the first optimized listening area, wherein N is a positive integer. 4 . The method according to claim 1 , wherein the method further comprises: decoding a video image bitstream to obtain decoded video image data and video image metadata, wherein the video image metadata comprises video metadata and image metadata; rendering the decoded video image data based on the video image metadata, to obtain rendered video image data; establishing a virtual scene based on the rendered video image data; and identifying the first optimized listening area in the virtual scene based on the rendered video image data and the audio optimization metadata. 5 . The method according to claim 1 , wherein the audio optimization metadata comprises: N−1 difference parameters of N−1 first decoding audio mixing parameters corresponding to the N−1 optimized listening areas other than the first optimized listening area in the N optimized listening areas with respect to the first decoding audio mixing parameter corresponding to the first optimized listening area, wherein N is a positive integer. 6 . The method according to claim 1 , wherein the audio optimization metadata further comprises: the central location coordinates of the first optimized listening area in the N optimized listening areas, and a location offset of central location coordinates of the N−1 optimized listening areas other than the first optimized listening area in the N optimized listening areas with respect to the central location coordinates of the first optimized listening area, wherein N is a positive integer. 7 . An audio processing method, comprising: receiving audio optimization metadata, basic audio metadata, and M pieces of first audio data, wherein the audio optimization metadata comprises first metadata of a first optimized listening area and a first audio mixing parameter corresponding to the first optimized listening area, and M is a positive integer; performing compression encoding on the audio optimization metadata, the basic audio metadata, and the M pieces of first audio data, to obtain an audio bitstream; and sending the audio bitstream. 8 . The method according to claim 7 , wherein the audio optimization metadata further comprises a second audio mixing parameter change identifier, wherein the second audio mixing parameter change identifier indicates whether a second audio mixing parameter corresponding to first audio data of a current frame changes compared with a second audio mixing parameter corresponding to first audio data of a previous frame. 9 . The method according to claim 7 , wherein the audio optimization metadata further comprises the second audio mixing parameter corresponding to the first optimized listening area in N optimized listening areas, and N−1 difference parameters of N−1 second audio mixing parameters corresponding to N−1 optimized listening areas other than the first optimized listening area in the N optimized listening areas with respect to the second audio mixing parameter corresponding to the first optimized listening area, wherein N is a positive integer. 10 . The method according to claim 7 , wherein the audio optimization metadata further comprises: N−1 difference parameters of N−1 first audio mixing parameters corresponding to the N−1 optimized listening areas other than the first optimized listening area in the N optimized listening areas with respect to the first audio mixing parameter corresponding to the first optimized listening area. 11 . The method according to claim 7 , wherein the audio optimization metadata further comprises: central location coordinates of the first optimized listening area in the N optimized listening areas, and a location offset of central location coordinates of the N−1 optimized listening areas other than the first optimized listening area in the N optimized listening areas with respect to the central location coordinates of the first optimized listening area, wherein N is a positive integer. 12 . The method according to claim 7 , wherein the audio optimization metadata further comprises: an optimized listening area change identifier, and/or a first audio mixing parameter change identifier, wherein the optimized listening area change identifier indicates whether the first optimized listening area changes; and the first audio mixing parameter change identifier indicates whether a first audio mixing parameter corresponding to the first audio data of the current frame changes compared with a first audio mixing parameter corresponding to the first audio data of the previous frame. 13 . An audio processing method, comprising: obtaining basic audio metadata and metadata of N optimized listening areas, wherein N is a positive integer, and the N optimized listening areas comprise a first optimized listening area; rendering M pieces of to-be-processed audio data based on the first optimized listening area and the basic audio metadata, to obtain M pieces of rendered audio data corresponding to the first optimized listening area, wherein M is a positive integer; performing first audio mixing on the M pieces of rendered audio data, to obtain M pieces of first audio mixing data and a first audio mixing parameter corresponding to the first optimized listening area; and generating audio optimization metadata based on first metadata of the first optimized listening area and the first audio mixing parameter, wherein the audio optimization metadata comprises the first metadata and the first audio mixing parameter. 14 . The method according to claim 13 , wherein the method further comprises: mixing the M pieces of first audio mixing data, to obtain mixed audio data corresponding to the first optimized listening area; and performing second audio mixing on the mixed audio data, to obtain second audio mixing data corresponding to the first optimized listening area and a second audio mixing parameter corres

Assignees

Inventors

Classifications

  • Application of parametric coding in stereophonic audio systems · CPC title

  • Positioning of individual sound objects, e.g. moving airplane, within a sound field (H04S2420/13 takes precedence) · CPC title

  • G10L19/008Primary

    Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing · CPC title

  • Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder · CPC title

  • Aspects of down-mixing multi-channel audio to configurations with lower numbers of playback channels, e.g. 7.1 -> 5.1 (H04S2400/01 takes precedence) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2024388866A1 cover?
This application disclose an audio processing method and a terminal. The method includes: decoding an audio bitstream to obtain audio optimization metadata, basic audio metadata, and M pieces of decoded audio data, the audio optimization metadata includes first metadata of a first optimized listening area and a first decoding audio mixing parameter corresponding to the first optimized listening…
Who is the assignee on this patent?
Huawei Tech Co Ltd
What technology area does this patent fall under?
Primary CPC classification G10L19/008. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Nov 21 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).