Decoding method and apparatus, and computer readable storage medium

US12424230B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12424230-B2
Application numberUS-202218546387-A
CountryUS
Kind codeB2
Filing dateJan 4, 2022
Priority dateMar 2, 2021
Publication dateSep 23, 2025
Grant dateSep 23, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The present disclosure relates to a decoding method, apparatus and computer-readable storage medium, which relates to the field of computer technology. The method of the present disclosure includes buffering one or more stream segments of a data stream which are received, wherein the data stream comprises an audio stream; parsing the one or more stream segments buffered until header information is obtained through the parsing; storing the header information; and decoding stream segments of the audio stream among various stream segments received according to the header information until the audio stream is completely decoded.

First claim

Opening claim text (preview).

What is claimed is: 1. A decoding method, comprising: buffering one or more stream segments of a data stream which are received, wherein the data stream comprises an audio stream; parsing the one or more stream segments buffered until header information is obtained through the parsing, comprising: determining whether a total data length of all stream segments currently buffered reaches a preset frame length, parsing data of the preset frame length from a starting position in the stream segments currently buffered, in a case where the total data length of all the stream segments currently buffered reaches the preset frame length, determining whether the header information is successfully parsed, updating the preset frame length by increasing the preset frame length by a preset value, in a case where the header information is not successfully parsed, and repeating the above until the header information is parsed, wherein not every stream segment contains the header information; storing the header information; and decoding stream segments of the audio stream among various stream segments received according to the header information until the audio stream is completely decoded. 2. The decoding method according to claim 1 , wherein the parsing the one or more stream segments buffered until header information is obtained through the parsing further comprises: in a case where the total data length of all the stream segments currently buffered does not reach the preset frame length, after receiving and buffering a next stream segment, re-determining whether the total data length of all the stream segments currently buffered reaches the preset frame length. 3. The decoding method according to claim 1 , wherein the decoding stream segments of the audio stream among various stream segments received according to the header information comprises: determining a length of an audio frame according to the header information; and decoding the stream segments of the audio stream among the various stream segments received by distinguishing different audio frames according to the length of the audio frame. 4. The decoding method according to claim 3 , wherein the decoding the stream segments of the audio stream among various stream segments received by distinguishing different audio frames according to the length of the audio frame comprises: dividing a current stream segment of the audio stream according to a sequence defined by a data encapsulation format and according to the audio frame length to obtain one or more complete audio frames; decoding the one or more complete audio frames; determining whether data at an end of the current stream segment of the audio stream belongs to an incomplete audio frame; in a case where the data at the end of the current stream segment of the audio stream belongs to the incomplete audio frame, buffering the incomplete audio frame; after receiving a next stream segment of the current stream segment of the audio frame, splicing the next stream segment with the incomplete audio frame to obtain a spliced stream segment; taking the spliced stream segment as the current stream segment of the audio stream; and repeating the above until a last stream segment of the audio stream is completely decoded. 5. The decoding method according to claim 1 , wherein the decoding stream segments of the audio stream among various stream segments received according to the header information until the audio stream is completely decoded comprises: parsing a current stream segment or parsing the current stream segment and stream segments following the current stream segment in a case where a failure is occurred in the decoding of the current stream segment of the audio stream based on the header information, until new header information is obtained through the parsing; and decoding the stream segments following the current stream segment according to the new header information until the audio stream is completely decoded. 6. The decoding method according to claim 1 , wherein the parsing the one or more stream segments buffered until header information is obtained through the parsing comprises: parsing the one or more stream segments buffered by calling an Open avformat method in FFmpeg until the header information is obtained through the parsing. 7. The decoding method according to claim 1 , wherein the decoding stream segments of the audio stream among various stream segments received according to the header information comprises: determining whether the data stream comprises a data stream other than the audio stream according to the header information; in a case where the data stream comprises the data stream other than the audio stream, separating the data stream other than the audio stream from the audio stream; determining format information of the audio stream according to the header information; transcoding the stream segments of the audio stream into an original audio stream according to the format information of the audio stream; and re-sampling the original audio stream at a preset bit rate. 8. The decoding method according to claim 7 , wherein: the data stream other than the audio stream is separated from the audio stream by calling a Separate stream method in FFmpeg; the format information of the audio stream is determined according to the header information, the stream segments of the audio stream are transcoded into the original audio stream according to the format information of the audio stream, and the original audio stream is re-sampled at the preset bit rate, by calling a Parse format method in FFmpeg. 9. A decoding apparatus, comprising: a processor; and a memory coupled to the processor for storing instructions, which when executed by the processor, cause the processor to: buffer one or more stream segments of a data stream which are received, wherein the data stream comprises an audio stream; parse the one or more stream segments buffered until header information is obtained through the parsing, comprising: determining whether a total data length of all stream segments currently buffered reaches a preset frame length, parsing data of the preset frame length from a starting position in the stream segments currently buffered, in a case where the total data length of all the stream segments currently buffered reaches the preset frame length, determining whether the header information is successfully parsed, updating the preset frame length by increasing the preset frame length by a preset value, in a case where the header information is not successfully parsed, and repeating the above until the header information is parsed, wherein not every stream segment contains the header information; store the header information; and decode stream segments of the audio stream among various stream segments received according to the header information until the audio stream is completely decoded. 10. A non-transitory computer-readable storage medium having a computer program stored thereon, wherein the program when executed by a processor, cause the processer to: buffer one or more stream segments of a data stream which are received, wherein the data stream comprises an audio stream; parse the one or more stream segments buffered until header information is obtained through the parsing, comprising: determining whether a total data length of all stream segments currently buffered reaches a preset frame length, parsing data of the preset frame length from a starting position in the stream segments currently buffered, in a case where the total data length of all the stream segments currently buffered reaches the preset frame length, determining whether the header information is successful

Assignees

Inventors

Classifications

  • Feedback of the input speech · CPC title

  • Segmentation; Word boundary detection · CPC title

  • Procedures used during a speech recognition process, e.g. man-machine dialogue · CPC title

  • G10L19/167Primary

    Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12424230B2 cover?
The present disclosure relates to a decoding method, apparatus and computer-readable storage medium, which relates to the field of computer technology. The method of the present disclosure includes buffering one or more stream segments of a data stream which are received, wherein the data stream comprises an audio stream; parsing the one or more stream segments buffered until header information…
Who is the assignee on this patent?
Beijing Wodong Tianjun Information Technology Co Ltd, Beijing Jingdong Century Trading Co Ltd
What technology area does this patent fall under?
Primary CPC classification G10L19/167. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Sep 23 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).