Method and device for encoding and decoding videos using a best transformation matrix among multiple transformation matrices

US9723313B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9723313-B2
Application numberUS-201213452198-A
CountryUS
Kind codeB2
Filing dateApr 20, 2012
Priority dateOct 23, 2009
Publication dateAug 1, 2017
Grant dateAug 1, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The embodiments of the present invention provide a method and a device for encoding and decoding videos, and relate to the communication field, and an efficient transformation matrix corresponding to features of each residual block is selected for transformation, which therefore improves encoding efficiency. The solution provided in an embodiment of the present invention is: generating a prediction residual according to input video data; selecting a set of best transformation matrixes among multiple candidate transformation matrixes according to an intra-frame prediction mode and rate-distortion criteria to perform transform-coding on the prediction residual and obtain a transformation result; and generating an encoded stream according to the transformation result and selected transformation matrix index information.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for encoding video data, comprising: generating a prediction residual according to input video data; performing transform-coding on the prediction residual using a plurality of candidate transformation matrices and obtaining a transformation result for each of the plurality of candidate transformation matrices; selecting a set of best transformation matrices among the plurality of candidate transformation matrices according to optimization criteria; and encoding a selected transformation matrix index according to the transformation result corresponding to the set of best transformation matrices and an intra-frame prediction mode to generate an encoded stream. 2. The method for encoding video data according to claim 1 , wherein: the set of best transformation matrices is a non-separate transformation matrix; or the set of best transformation matrices are a pair of transformation matrices that comprise a column transformation matrix and a row transformation matrix. 3. The method for encoding video data according to claim 1 , wherein: the optimization criteria comprise: rate-distortion criteria, sum of absolute difference (SAD), code bits, or distortion. 4. The method for encoding video data according to claim 1 , further comprising: selecting a scan order of a set of coefficients according to the transformation matrix index to scan a transformed coefficient. 5. The method for encoding video data according to claim 1 , further comprising: encoding the prediction residual in various encoding modes, selecting a mode with a minimum optimization criteria cost as the intra-frame prediction mode, and obtaining an encoding result. 6. The method for encoding video data according to claim 1 , wherein: the encoding the selected transformation matrix index according to the transformation result and the intra-frame prediction mode to generate the encoded stream comprises: selecting a method for encoding the transformation matrix index according to a selected intra-frame prediction mode to write the transformation matrix index into the encoded data. 7. The method for encoding video data according to claim 6 , wherein: if the set of best transformation matrices are a pair of transformation matrices, the selecting a method for encoding the transformation matrix index according to the selected intra-frame prediction mode to write the transformation matrix index into the encoded data comprises: encoding index of a pair of transformation matrices jointly, or encoding index of a pair of transformation matrices separately; and selecting a method for encoding the transformation matrix index according to the selected intra-frame prediction mode to write the transformation matrix index into the encoded data. 8. The method for encoding video data according to claim 1 , wherein: the selecting a set of best transformation matrices among the plurality of candidate transformation matrices according to the optimization criteria to perform transform-coding on the prediction residual and obtain a transformation result comprises: traversing all combinations of a column transformation matrix and a row transformation matrix in the plurality of candidate transformation matrices, selecting a transformation combination with a minimum optimization criteria cost after residual transform-coding as a best transformation matrix, and obtaining a transformation result. 9. A video decoding method, comprising: resolving an encoded video stream to obtain a transformation result, and obtaining a transformation matrix index according to an intra-frame prediction mode; and determining a transformation matrix among a plurality of candidate transformation matrices according to the transformation matrix index, using the determined transformation matrix to perform inverse transformation on the transformation result to obtain residual data, and reconstructing video data according to the residual data. 10. The video decoding method according to claim 9 , wherein: the determined transformation matrix is a non-separate transformation matrix; or the determined transformation matrix is a pair of transformation matrices that comprise a column transformation matrix and a row transformation matrix. 11. The video decoding method according to claim 9 , wherein the obtaining the transformation matrix index according to the intra-frame prediction mode comprises: selecting a method for decoding the transformation matrix index according to the intra-frame prediction mode to obtain the transformation matrix index. 12. The video decoding method according to claim 9 , further comprising: selecting a scan order of a set of coefficients according to the transformation matrix index to perform inverse coefficient scanning on a transformed coefficient. 13. The video decoding method according to claim 9 , wherein: the transformation matrix is determined among a set of candidate row transformation matrices and column transformation matrices according to row transformation matrix index information and column transformation matrix index in the transformation matrix index. 14. A video data encoder, comprising: a hardware processor configured to: generate a prediction residual according to input video data; perform transform-coding on the prediction residual using a plurality of candidate transformation matrices and obtain a transformation result for each of the plurality of candidate transformation matrices; select a set of best transformation matrices among the plurality of candidate transformation matrices according to optimization criteria; and encode a selected transformation matrix index according to the transformation result corresponding to the set of best transformation matrices and an intra-frame prediction mode to generate an encoded stream. 15. The video data encoder according to claim 14 , the hardware processor further configured to select a scan order of a set of coefficients according to the transformation matrix index to scan a transformed coefficient. 16. The video data encoder according to claim 14 , the hardware processor further configured to determine a mode with a minimum optimization criteria cost as an intra-frame prediction mode after the prediction residual is encoded in various encoding modes, and obtain an encoding result. 17. The video data encoder according to claim 14 , the hardware processor further configured to select a method for encoding the transformation matrix index according to a selected intra-frame prediction mode to write the transformation matrix index into encoded data. 18. The video data encoder according to claim 14 , the hardware processor further configured to: traverse all combinations of a column transformation matrix and a row transformation matrix in the plurality of candidate transformation matrices, select a transformation combination with a minimum optimization criteria cost after residual transform-coding as a best transformation matrix, and obtain a transformation result. 19. A video decoder, comprising: a hardware processor configured to: resolve a video stream to obtain a transformation result, and obtain a transformation matrix index according to an intra-frame prediction mode; determine a transformation matrix among a plurality of candidate transformation matrices according to the transformation matrix index information; and use the determined transformation matrix to perform inverse transformation on the transformation result to obtain residual data, and reconstruct video data according

Assignees

Inventors

Classifications

  • H04N19/147Primary

    according to rate distortion criteria (rate-distortion as a criterion for motion estimation H04N19/567) · CPC title

  • in combination with predictive coding · CPC title

  • the region being a block, e.g. a macroblock · CPC title

  • Embedding additional information in the video signal during the compression process (H04N19/517, H04N19/68, H04N19/70 take precedence) · CPC title

  • Selection of transform size, e.g. 8x8 or 2x4x8 DCT; Selection of sub-band transforms of varying structure or type · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9723313B2 cover?
The embodiments of the present invention provide a method and a device for encoding and decoding videos, and relate to the communication field, and an efficient transformation matrix corresponding to features of each residual block is selected for transformation, which therefore improves encoding efficiency. The solution provided in an embodiment of the present invention is: generating a predic…
Who is the assignee on this patent?
Yang Mingyuan, Wang Dong, Xiong Lianhuan, and 5 more
What technology area does this patent fall under?
Primary CPC classification H04N19/147. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Aug 01 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).