Method and apparatus for generating and processing media file
US-2024048768-A1 · Feb 8, 2024 · US
US9936266B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-9936266-B2 |
| Application number | US-201414274056-A |
| Country | US |
| Kind code | B2 |
| Filing date | May 9, 2014 |
| Priority date | May 17, 2013 |
| Publication date | Apr 3, 2018 |
| Grant date | Apr 3, 2018 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A video encoding method and apparatus are provided, in which the method comprises the steps of obtaining respective original picture data in sequence; generating respective multi-media audio and video files in sequence according to the obtained original picture data; parsing each multi-media audio and video file, encapsulating the result of parsing according to a predetermined standard to obtain encoded picture data corresponding to each multi-media audio and video file and conforming to the predetermined standard, and outputting the encoded picture data. The solutions in the present disclosure may meet the need of stream-typed video traffic for small granularity encoded picture data in frame level.
Opening claim text (preview).
What is claimed is: 1. A video encoding method, comprising: obtaining, by a picture obtaining unit using a processor, respective original picture data in sequence, each original picture data including a first number of original picture frames; generating, by a file generating unit using a processor, respective multi-media audio and video files in sequence according to the obtained original picture data, wherein each audio file or each video file is obtained by compressing the first number of original picture frames according to the first encoding mode; parsing, by a file parsing unit using a processor, each multi-media audio and video file according to the first encoding mode to obtain a first number of data segments, encapsulating the first number of data segments according to the second encoding mode to obtain the first number of encoded picture data corresponding to each multi-media audio and video file and conforming to the second encoding mode, wherein the first encoding mode and the second encoding mode are different, wherein the parsing and encapsulating generated multi-media audio and video file are performed by: when the 1 st to (Q−1)th multi-media audio and video files have been generated, buffering them rather than parsing and encapsulating them, where Q is a positive integer which is larger than 1, and when the Qth multi-media audio and video file has been generated, buffering it, and starting to parse and encapsulate the buffered Q multi-media audio and video files from the 1st multi-media audio and video file, and outputting the first number of encoded picture data for a video decoding apparatus to reproduce the first number of encoded picture data; wherein the step of parsing each multi-media audio and video file according to the first encoding mode to obtain a first number of data segments, encapsulating the first number of data segments according to the second encoding mode to obtain encoded picture data corresponding to each multi-media audio and video file and conforming to the second encoding mode comprising: parsing each multi-media audio and video file according to the first encoding mode to obtain mdat box data including a first number of data segments; modifying the values of the first four bytes of the data segments in the mdat box data according to the second encoding mode, and taking each modified data segment as each encoded picture data conforming to the second encoding mode. 2. The method of claim 1 , further comprising: for each generated multi-media audio and video file, before outputting the first encoded picture data conforming to the second encoding mode corresponding to the multi-media audio and video file, generating sequence parameter set network abstraction layer data and picture parameter set network abstraction layer data corresponding to the multi-media audio and video file respectively, and outputting the generated sequence parameter set network abstraction layer data and picture parameter set network abstraction layer data together with the first encoded picture data conforming to the second encoding mode. 3. The method of claim 2 , the step of generating sequence parameter set network abstraction layer data and picture parameter set network abstraction layer data corresponding to the multi-media audio and video file respectively comprising: generating the sequence parameter set network abstraction layer data and picture parameter set network abstraction layer data corresponding to the multi-media audio and video file according to the network abstraction layer data structure defined in the second encoding mode. 4. The method of claim 1 , further comprising: initiating N threads when the first frame of original picture data is obtained, where N is a positive integer which is larger than 1; determining whether there is a free thread each time the condition for generating a new multi-media audio and video file is met, and selecting a free thread to generate the multi-media audio and video file if there is, otherwise waiting for a free thread to generate the multi-media audio and video file. 5. The method of claim 1 , further comprising: initiating P threads when the first multi-media audio and video file is generated, where P is a positive integer which is larger than 1; determining whether there is a free thread each time a new multi-media audio and video file needs to be parsed and encapsulated, and selecting a free thread to parse and encapsulate the multi-media audio and video file if there is, otherwise waiting for a free thread to parse and encapsulate the multi-media audio and video file. 6. The method of claim 1 , further comprising: each time one piece of encoded picture data conforming to the second encoding mode is received, decoding the one piece of encoded picture data and performing picture displaying according to the result of decoding. 7. A non-transitory computer-readable storage medium storing program instructions which, when executed on a computer, cause the computer to execute the respective steps of the video encoding method according to claim 1 . 8. A video encoding apparatus, comprising a memory and a processor coupled to the memory, wherein the memory storing instructions when being executed by the processor, cause the processor to perform the following operations: obtaining respective original picture data in sequence, each original picture data including a first number of original picture frames; generating respective multi-media audio and video files in sequence according to the obtained each original picture data, wherein each audio file or each video file is obtained by compressing the first number of original picture frames according to the first encoding mode; and parsing each multi-media audio and video file according to the first encoding mode to obtain a first number of data segments, encapsulating the first number of data segments according to the second encoding mode to obtain encoded picture data corresponding to each multi-media audio and video file and conforming to the second encoding mode, wherein the first encoding mode and the second encoding mode are different, wherein the parsing and encapsulating generated multi-media audio and video file are performed by: when the 1 st to (Q−1)th multi-media audio and video files have been generated, buffer them rather than parsing and encapsulating them, where Q is a positive integer that is larger than 1, and when the Qth multi-media audio and video file has been generated, buffer it, and start to parse and encapsulate the buffered Q multi-media audio and video files from the 1 st multi-media audio and video file, and outputting the first number of encoded picture data for a video decoding apparatus to reproduce the first number of encoded picture data; wherein the step of parsing each multi-media audio and video file according to the first encoding mode to obtain a first number of data segments, encapsulating the first number of data segments according to the second encoding mode to obtain encoded picture data corresponding to each multi-media audio and video file and conforming to the predetermined standard comprising: parsing each multi-media audio and video file according to the first encoding mode to obtain mdat box data including a first number of data segments; modifying the values of the first four bytes of the data segments in the mdat box data according to the requirement of the second encoding mode, and taking each modified data segment as each encoded picture data conforming to the second encoding mode. 9. The apparatus of claim 8 , wherein the instructions when being executed by the processor further perform the following operations: for each generated multi-media audio and
involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements {(video transcoding H04N19/40; media packet handling at the source H04L65/762)} · CPC title
using Advanced Video Coding [AVC] · CPC title
involving a specific file format, e.g. MP4 format · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.