Video Coding With Guided Machine Learning Restoration
US-2024098280-A1 · Mar 21, 2024 · US
US12034944B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-12034944-B2 |
| Application number | US-202117469716-A |
| Country | US |
| Kind code | B2 |
| Filing date | Sep 8, 2021 |
| Priority date | Sep 27, 2019 |
| Publication date | Jul 9, 2024 |
| Grant date | Jul 9, 2024 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
This application discloses a video encoding method and apparatus, a video decoding method and apparatus, an electronic device, and a computer-readable storage medium. The method includes: obtaining a current video frame, the current video frame being divided into a plurality of tile-divided Tile regions; determining a resolution corresponding to each Tile region in the plurality of Tile regions to obtain a plurality of resolutions, the plurality of resolutions including at least two different resolutions; and decoding the Tile region based on the resolution corresponding to the Tile region among the plurality of resolutions, to obtain a decoded video frame.
Opening claim text (preview).
What is claimed is: 1. A video decoding method performed by an electronic device, the method comprising: obtaining a current video frame, the current video frame being divided into a plurality of tile-divided Tile regions; determining a resolution corresponding to each Tile region in the plurality of Tile regions to obtain a plurality of resolutions, comprising at least two different resolutions, wherein: a resolution corresponding to each respective Tile region is determined according to a transmission bandwidth of a video stream including the current video frame from a source to the electronic device, a first resolution is chosen for the respective Tile region when the transmission bandwidth is less than a threshold corresponding to the respective Tile region, a second resolution is chosen for the respective Tile region when the transmission bandwidth is greater than the threshold, and the second resolution is higher than the first resolution; and decoding each Tile region based on the resolution corresponding to the Tile region among the plurality of resolutions, to obtain a decoded video frame. 2. The method according to claim 1 , wherein the determining a resolution corresponding to each Tile region in the plurality of Tile regions comprises: obtaining flag bit data corresponding to each Tile region in the plurality of Tile regions from a syntactic element of the current video frame, the flag bit data being used for identifying the resolution corresponding to the Tile region. 3. The method according to claim 1 , wherein the thresholds corresponding to the plurality of Tile regions are different from each other. 4. A video encoding method performed by an electronic device, the method comprising: performing division processing on a current video frame to obtain a plurality of tile- divided Tile regions comprised in the current video frame; determining a resolution corresponding to each Tile region in the plurality of Tile regions to obtain a plurality of resolutions, comprising at least two different resolutions , wherein: a resolution corresponding to each respective Tile region is determined according to a transmission bandwidth of a video stream including the current video frame from a source to the electronic device, a first resolution is chosen for the respective Tile region when the transmission bandwidth is less than a threshold corresponding to the respective Tile region, a second resolution is chosen for the respective Tile region when the transmission bandwidth is greater than the threshold, and the second resolution is higher than the first resolution; and encoding each Tile region based on the resolution corresponding to the Tile region among the plurality of resolutions, to obtain encoded data of the current video frame. 5. The method according to claim 4 , wherein the determining a resolution corresponding to each Tile region in the plurality of Tile regions comprises: performing the following processing on the Tile region in the plurality of Tile regions: determining, for each resolution in a resolution set, a rate distortion cost of encoding the Tile region; and determining a resolution corresponding to a smallest rate distortion cost as the resolution corresponding to the Tile region. 6. The method according to claim 4 , wherein the determining a resolution corresponding to each Tile region in the plurality of Tile regions comprises: obtaining an inputted resolution determining instruction, the resolution determining instruction being used for indicating the resolution corresponding to the Tile region; and determining the resolution indicated by the resolution determining instruction as the resolution corresponding to the Tile region. 7. The method according to claim 4 , wherein the encoding the Tile region based on the resolution corresponding to the Tile region among the plurality of resolutions comprises: encoding the Tile region based on the resolution corresponding to the Tile region among the plurality of resolutions, to obtain encoded data corresponding to the Tile region; determining flag bit data corresponding to the encoded data corresponding to the Tile region, the flag bit data being used for identifying the resolution corresponding to the Tile region; and adding the flag bit data as a syntactic element to the encoded data corresponding to the Tile region. 8. The method according to claim 4 , wherein the thresholds corresponding to the plurality of Tile regions are different from each other. 9. An electronic device, comprising a memory and a processor, the memory storing computer-readable instructions, and the computer-readable instructions, when executed by the processor, causing electronic device to perform a plurality of operations including: obtaining a current video frame, the current video frame being divided into a plurality of tile-divided Tile regions; determining a resolution corresponding to each Tile region in the plurality of Tile regions to obtain a plurality of resolutions, comprising at least two different resolutions, wherein: a resolution corresponding to each respective Tile region is determined according to a transmission bandwidth of a video stream including the current video frame from a source to the electronic device, a first resolution is chosen for the respective Tile region when the transmission bandwidth is less than a threshold corresponding to the respective Tile region, a second resolution is chosen for the respective Tile region when the transmission bandwidth is greater than the threshold, and the second resolution is higher than the first resolution; and decoding each Tile region based on the resolution corresponding to the Tile region among the plurality of resolutions, to obtain a decoded video frame. 10. The electronic device according to claim 9 , wherein the determining a resolution corresponding to each Tile region in the plurality of Tile regions comprises: obtaining flag bit data corresponding to each Tile region in the plurality of Tile regions from a syntactic element of the current video frame, the flag bit data being used for identifying the resolution corresponding to the Tile region. 11. The electronic device according to claim 9 , wherein the plurality of operations further comprise: performing division processing on a second current video frame to obtain a plurality of tile-divided Tile regions comprised in the second current video frame; determining a resolution corresponding to each Tile region in the plurality of Tile regions to obtain a plurality of resolutions, the plurality of resolutions comprising at least two different resolutions; and encoding the Tile region based on the resolution corresponding to the Tile region among the plurality of resolutions, to obtain encoded data of the second current video frame. 12. The electronic device according to claim 11 , wherein the determining a resolution corresponding to each Tile region in the plurality of Tile regions comprises: performing the following processing on the Tile region in the plurality of Tile regions: determining, for each resolution in a resolution set, a rate distortion cost of encoding the Tile region; and determining a resolution corresponding to a smallest rate distortion cost as the resolution corresponding to the Tile region. 13. The electronic device according to claim 11 , wherein the determining a resolution corresponding to each Tile region in the plurality of Tile regions comprises: obtaining an inputted resolution determining instruction, the resolution determining instruction being used for indicating the resolution corresponding to the Tile
characterised by syntax aspects related to video coding, e.g. related to compression standards · CPC title
Embedding additional information in the video signal during the compression process (H04N19/517, H04N19/68, H04N19/70 take precedence) · CPC title
the region being a block, e.g. a macroblock · CPC title
Position within a video image, e.g. region of interest [ROI] · CPC title
the region being a slice, e.g. a line of blocks or a group of blocks · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.