Methods and Apparatuses for Coding Video Data with Secondary Transform
US-2022201335-A1 · Jun 23, 2022 · US
US11700377B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-11700377-B2 |
| Application number | US-202117535095-A |
| Country | US |
| Kind code | B2 |
| Filing date | Nov 24, 2021 |
| Priority date | Jun 6, 2019 |
| Publication date | Jul 11, 2023 |
| Grant date | Jul 11, 2023 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
An image decoding method according to the present document is characterized by including: a step for receiving a bitstream including residual information; a step for deriving transform coefficients for a target block on the basis of the residual information; a step for deriving corrected transform coefficients on the basis of an inverse non-separable transform of the transform coefficients; and a step for deriving residual samples for the target block on the basis of an inverse primary transform of the corrected transform coefficients, wherein the inverse non-separable transform is performed when the size of the target block is equal to or smaller than the size of a prescribed maximum transform application block.
Opening claim text (preview).
What is claimed is: 1. An image decoding method performed by a decoding apparatus, the method comprising: receiving a bitstream including residual information; deriving transform coefficients for a target block based on the residual information; deriving modified transform coefficients based on an inverse non-separable transform for the transform coefficients; deriving residual samples for the target block based on an inverse primary transform for the modified transform coefficients; and generating a reconstructed picture based on the residual samples for the target block, wherein the target block is a coding block, and a transform block is partitioned from the coding block, wherein the inverse non-separable transform is not performed when the size of the coding block is greater than the size of a predetermined maximum transform block, wherein the target block includes a luma coding block and a chroma coding block, and wherein based on the size of the luma coding block being equal to or smaller than the size of the maximum transform block and a color format being 4:2:0, the inverse non-separable transform is applied to the chroma coding block when the chroma coding block is less than or equal to ½ of the size of the maximum transform block. 2. The image decoding method of claim 1 , wherein information on the size of the maximum transform block is further received. 3. The image decoding method of claim 1 , wherein whether the inverse non-separable transform is performed is derived by comparing the larger of the width or height of the target block with the width or height of the maximum transform block. 4. The image decoding method of claim 1 , wherein the maximum transform block is 64×64. 5. The image decoding method of claim 1 , wherein when the size of the target block is larger than the size of the predetermined maximum transform block, Low Frequency Non-Separable Transform (LFNST) index indicating a predetermined transform kernel matrix used for the inverse non-separable transform is not derived. 6. An image encoding method performed by an image encoding apparatus, the method comprising: deriving prediction samples for a target block; deriving residual samples for the target block based on the prediction samples; deriving transform coefficients for the target block based on a primary transform for the residual samples; deriving modified transform coefficients from the transform coefficients based on a predetermined transform kernel matrix for a non-separable transform; and encoding quantized residual information and a Low Frequency Non-Separable Transform (LFNST) index indicating the transform kernel matrix, wherein the target block is a coding block, and a transform block is partitioned from the coding block, wherein the non-separable transform is not performed when the size of the coding block is greater than the size of a predetermined maximum transform block, wherein the target block includes a luma coding block and a chroma coding block, and wherein based on the size of the luma coding block being equal to or smaller than the size of the maximum transform block and a color format being 4:2:0, the non-separable transform is applied to the chroma coding block when the chroma coding block is less than or equal to ½ of the size of the maximum transform block. 7. The image encoding method of claim 6 , wherein information on the size of the maximum transform block is further encoded. 8. The image encoding method of claim 6 , wherein whether the non-separable transform is performed is derived by comparing the larger of the width or height of the target block with the width or height of the maximum transform block. 9. The image encoding method of claim 6 , wherein the maximum transform block is 64×64. 10. The image encoding method of claim 6 , wherein based on the size of the target block being larger than the size of the predetermined maximum transform block, the LFNST index is not encoded. 11. A non-transitory computer-readable digital storage medium that stores instructions that when executed by one or more processors cause the one or more processors to perform operations comprising: deriving prediction samples for a target block; deriving residual samples for the target block based on the prediction samples; deriving transform coefficients for the target block based on a primary transform for the residual samples; deriving modified transform coefficients from the transform coefficients based on a predetermined transform kernel matrix for a non-separable transform; and encoding quantized residual information and a Low Frequency Non-Separable Transform (LFNST) index indicating the transform kernel matrix to generate a bitstream, wherein the target block is a coding block, and a transform block is partitioned from the coding block, wherein the non-separable transform is not performed when the size of the coding block is greater than the size of a predetermined maximum transform applied block, wherein the target block includes a luma coding block and a chroma coding block, and wherein based on the size of the luma coding block being equal to or smaller than the size of the maximum transform block and a color format being 4:2:0, the non-separable transform is applied to the chroma coding block when the chroma coding block is less than or equal to ½ of the size of the maximum transform block. 12. A transmission method of data for an image, the method comprising: obtaining a bitstream for the image; wherein the bitstream is generated based on: deriving prediction samples for a target block, deriving residual samples for the target block based on the prediction samples, deriving transform coefficients for the target block based on a primary transform for the residual samples, deriving modified transform coefficients from the transform coefficients based on a predetermined transform kernel matrix for a non-separable transform, encoding quantized residual information and a Low Frequency Non-Separable Transform (LFNST) index indicating the transform kernel matrix to generate the bitstream, and transmitting the data comprising the bitstream, wherein the target block is a coding block, and a transform block is partitioned from the coding block, wherein the non-separable transform is not performed when the size of the coding block is greater than the size of a predetermined maximum transform applied block, wherein the target block includes a luma coding block and a chroma coding block, and wherein based on the size of the luma coding block being equal to or smaller than the size of the maximum transform block and a color format being 4:2:0, the non-separable transform is applied to the chroma coding block when the chroma coding block is less than or equal to ½ of the size of the maximum transform block.
the unit being a set of transform coefficients · CPC title
the region being a block, e.g. a macroblock · CPC title
Sampling, masking or truncation of coding units, e.g. adaptive resampling, frame skipping, frame interpolation or high-frequency transform coefficient masking · CPC title
the unit being a colour or a chrominance component · CPC title
Motion inside a coding unit, e.g. average field, frame or block difference · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.