Method and device for processing video image
US-2021235113-A1 · Jul 29, 2021 · US
US12581087B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-12581087-B2 |
| Application number | US-202318520922-A |
| Country | US |
| Kind code | B2 |
| Filing date | Nov 28, 2023 |
| Priority date | May 28, 2021 |
| Publication date | Mar 17, 2026 |
| Grant date | Mar 17, 2026 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
An encoding method includes: a first matching block of a current block is determined; motion compensation enhancement is performed on the first matching block to obtain at least one second matching block; motion information of the current block is determined according to the at least one second matching block; and the current block is encoded according to the motion information.
Opening claim text (preview).
The invention claimed is: 1 . An encoding method, applied to an encoder, the method comprising: determining a first matching block of a current block; performing motion compensation enhancement on the first matching block to obtain at least one second matching block; determining motion information of the current block according to the at least one second matching block; and encoding the current block according to the motion information, setting a first syntax element identification information, wherein the first syntax element identification information is for indicating whether to use a motion compensation enhancement processing mode for the current block; and encoding a value of the first syntax element identification information, and signalling encoded bits, wherein the value of the first syntax element identification information is set to be a first value to indicate that the motion compensation enhancement processing mode is used for the current block, and the value of the first syntax element identification information is set to be a second value to indicate that the motion compensation enhancement processing mode is not used for the current block. 2 . The method of claim 1 , wherein performing the motion compensation enhancement on the first matching block to obtain the at least one second matching block comprises: performing super-resolution and quality enhancement processing on the first matching block to obtain a processed block, wherein a resolution of the processed block is higher than a resolution of the current block; and performing first filtering processing on the processed block to obtain the at least one second matching block, wherein a resolution of the second matching block obtained after the first filtering processing is the same as the resolution of the current block. 3 . The method of claim 2 , wherein performing the motion compensation enhancement on the first matching block further comprises: performing the motion compensation enhancement on the first matching block by using a preset neural network model, wherein the preset neural network model comprises a feature extraction module, a residual projection module group, a sampling module and a reconstruction module, and the feature extraction module, the residual projection module group, the sampling module and the reconstruction module are connected in sequence, wherein performing the super-resolution and quality enhancement processing on the first matching block to obtain the processed block comprises: performing shallow feature extraction on the first matching block by the feature extraction module to obtain first feature information; performing residual feature learning on the first feature information by the residual projection module group to obtain second feature information; performing second filtering processing on the second feature information by the sampling module to obtain third feature information; and performing super-resolution reconstruction on the third feature information by the reconstruction module to obtain the processed block. 4 . The method of claim 1 , further comprising: determining a training data set, the training data set comprising at least one training picture; preprocessing the training data set to obtain ground truths of the preset neural network model and at least one input picture group, wherein the input picture group comprises at least one input picture; and training the neural network model based on the ground truths by using the at least one input picture group to obtain at least one group of candidate model parameters, wherein the ground truths are used for determining a loss value of a loss function of the neural network model, and the at least one group of candidate model parameters are obtained when the loss value of the loss function converges to a preset threshold. 5 . The method of claim 4 , further comprising: determining a quantization parameter of the current block; determining, according to the quantization parameter, a model parameter corresponding to the quantization parameter from the at least one group of candidate model parameters; and determining the preset neural network model according to the model parameter, wherein in response to the at least one group being a plurality of groups, the at least one input picture group corresponds to different quantization parameters, and the plurality of groups of candidate model parameters correspond to different quantization parameters. 6 . The method of claim 3 , wherein determining the first matching block of the current block comprises: performing integer-pixel motion estimation on the current block to determine the first matching block of the current block, wherein the first matching block is a matching block with a minimum rate distortion cost when motion estimation is performed on the current block at integer pixel positions; wherein performing the motion compensation enhancement on the first matching block to obtain the at least one second matching block comprises: performing fractional pixel motion compensation on the first matching block by using the preset neural network model to obtain the at least one second matching block. 7 . The method of claim 6 , further comprising: after the at least one second matching block is obtained, performing the sub-pixel motion estimation on the current block according to the at least one second matching block to determine a sub-pixel matching block of the current block, the sub-pixel matching block being a matching block with minimum rate distortion cost when motion estimation is performed on the current block at sub-pixel positions; wherein determining the motion information of the current block according to the at least one second matching block comprises: performing pre-encoding processing on the current block by using the first matching block to determine a first rate distortion cost; performing pre-encoding processing on the current block by using the sub-pixel matching block to determine a second rate distortion cost; in response to the first rate distortion cost being greater than the second rate distortion cost, determining to use the motion compensation enhancement processing mode for the current block, and determining the motion information as first motion information, the first motion information being used for pointing to a sub-pixel position; and in response to the first rate distortion cost being less than or equal to the second rate distortion cost, determining not to use the motion compensation enhancement processing mode for the current block, and determining the motion information as second motion information, the second motion information being used for pointing to an integer pixel position. 8 . The method of claim 7 , further comprising: in response to the first rate distortion cost being greater than the second rate distortion cost, determining the value of first syntax element identification information to be the first value; and in response to the first rate distortion cost being less than or equal to the second rate distortion cost, determining the value of the first syntax element identification information to be the second value. 9 . The method of claim 7 , wherein encoding the current block according to the motion information comprises: in response to using the motion compensation enhancement processing mode for the current block, determining a first prediction block of the current block according to the first motion information and the sub-pixel matching block, determining a residual block of the current block according to the current block and the first prediction block, and encoding the residual block, and signalling encoded bits; or, i
characterised by syntax aspects related to video coding, e.g. related to compression standards · CPC title
by encoding · CPC title
the region being a block, e.g. a macroblock · CPC title
Filters, e.g. for pre-processing or post-processing (sub-band filter banks H04N19/635) · CPC title
characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation (H04N19/635 takes precedence) · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.