Methods and apparatuses for video encoding and video decoding

US12464160B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12464160-B2
Application numberUS-201816753763-A
CountryUS
Kind codeB2
Filing dateOct 4, 2018
Priority dateOct 5, 2017
Publication dateNov 4, 2025
Grant dateNov 4, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Implementations are described for determining, for a block being encoded in a picture, at least one predictor candidate, determining for the at least one predictor candidate, one or more corresponding control point generator motion vectors, based on motion information associated to the at least one predictor candidate, determining for the block being encoded, one or more corresponding control point motion vectors, based on the one or more corresponding control point generator motion vectors determined for the at least one predictor candidate, determining, based on the one or more corresponding control point motion vectors determined for the block, a corresponding motion field, and encoding the block based on the corresponding motion field.

First claim

Opening claim text (preview).

The invention claimed is: 1 . A method for video decoding, comprising: determining a predictor candidate for a block being decoded in a picture in an affine motion model, wherein the predictor candidate has a translational motion model and a plurality of sub-blocks comprising at least a top-left sub-block, a top-right sub-block, a bottom-left sub-block, and a bottom-right sub-block; determining for the predictor candidate, at least two control point generator motion vectors of an affine motion model, wherein each control point generator motion vector is associated to a different sub-block of the predictor candidate, provided that the at least two control point generator motion vectors determined for the predictor candidate are for the top-left sub-block and the top-right sub-block, respectively, and wherein motion vectors for the bottom-left sub-block and the bottom-right sub-block are compared to estimated motion vectors for the bottom-left sub-block and the bottom-right sub-block and satisfy a threshold level for respective angle and magnitude; determining corresponding control point motion vectors for the block being decoded based on the at least two control point generator motion vectors determined for the predictor candidate, such that the determined control point motion vectors reflect both motion per sub-block and the translational motion model of the predictor candidate; determining, based on the determined control point motion vectors, a corresponding motion field for the block, wherein the motion field identifies motion vectors used for prediction of sub-blocks of the block being decoded; and decoding the block based on the motion field. 2 . The method of claim 1 , wherein the predictor candidate is comprised in a set of predictor candidates and wherein determining the predictor candidate comprises receiving an index corresponding to the predictor candidate in the set of predictor candidates. 3 . The method of claim 1 , further comprising verifying that the determined control point motion vectors satisfy the affine motion model. 4 . The method of claim 1 , wherein determining the at least two control point generator motion vectors comprises: determining, for at least two distinct sets of at least three sub-blocks of the predictor candidate, corresponding control point motion vectors for the predictor candidate associated respectively to the at least two sets, based on the motion vectors associated respectively to the at least three sub-blocks of each set; and calculating corresponding control point motion vectors associated to the predictor candidate by averaging the determined control point motion vectors associated to each set. 5 . The method of claim 1 , wherein the motion vector is derived from at least one of: a bilateral template matching between two reference blocks in respectively two reference frames; a reference block of a reference frame identified by motion information of a first spatial neighboring block of the predictor candidate; or an average of motion vectors of spatial and temporal neighboring blocks of the predictor candidate. 6 . A non-transitory computer readable storage medium having stored thereon instructions for decoding video data according to the method of claim 1 . 7 . An apparatus for video decoding, comprising a memory and at least one processor configured for: determining a predictor candidate for a block being decoded in a picture in an affine motion model, wherein the predictor candidate has a translational motion model and a plurality of sub-blocks comprising at least a top-left sub-block, a top-right sub-block, a bottom-left sub-block, and a bottom-right sub-block; determining for the predictor candidate, at least two control point generator motion vectors of an affine motion model, wherein each control point generator motion vector is associated to a different sub-block of the predictor candidate, provided that the at least two control point generator motion vectors determined for the predictor candidate are for the top-left sub-block and the top-right sub-block, respectively, and wherein motion vectors for the bottom-left sub-block and the bottom-right sub-block are compared to estimated motion vectors for the bottom-left sub-block and the bottom-right sub-block and satisfy a threshold level for respective angle and magnitude; determining corresponding control point motion vectors for the block being decoded based on the at least two control point generator motion vectors determined for the predictor candidate, such that the determined control point motion vectors reflect both motion per sub-block and the translational motion model of the predictor candidate; determining, based on the determined control point motion vectors, a corresponding motion field for the block, wherein the motion field identifies motion vectors used for prediction of sub-blocks of the block being decoded; and decoding the block based on the motion field. 8 . The apparatus of claim 7 , wherein the predictor candidate is comprised in a set of predictor candidates and wherein determining the predictor candidate comprises receiving an index corresponding to the predictor candidate in the set of predictor candidates. 9 . The apparatus of claim 7 , wherein the at least one processor is further configured for verifying that the determined control point motion vectors satisfy the affine motion model. 10 . The apparatus of claim 7 , wherein determining the at least two control point generator motion vectors comprises: determining, for at least two distinct sets of at least three sub-blocks of the predictor candidate, corresponding control point motion vectors for the predictor candidate associated respectively to the at least two sets, based on the motion vectors associated respectively to the at least three sub-blocks of each set; and calculating corresponding control point motion vectors associated to the predictor candidate by averaging the determined control point motion vectors associated to each set. 11 . The apparatus of claim 7 , wherein the motion vector is derived from at least one of: a bilateral template matching between two reference blocks in respectively two reference frames; a reference block of a reference frame identified by motion information of a first spatial neighboring block of the predictor candidate; or an average of motion vectors of spatial and temporal neighboring blocks of the predictor candidate. 12 . A method for video encoding, comprising: determining a predictor candidate for a block being encoded in a picture in an affine motion model, wherein the predictor candidate has a translational motion model and a plurality of sub-blocks comprising at least a top-left sub-block, a top-right sub-block, a bottom-left sub-block, and a bottom-right sub-block; determining for the predictor candidate, at least two control point generator motion vectors of an affine motion model, wherein each control point generator motion vector is associated to a different sub-block of the predictor candidate, provided that the at least two control point generator motion vectors determined for the predictor candidate are for the top-left sub-block and the top-right sub-block, respectively, and wherein motion vectors for the bottom-left sub-block and the bottom-right sub-block are compared to estimated motion vectors for the bottom-left sub-block and the bottom-right sub-block and satisfy a threshold level for respective angle and magnitude; determining corresponding control point motion vectors for the block being encoded based on the at least two control point generator motion vectors determined for the predictor candidat

Assignees

Inventors

Classifications

  • the region being a block, e.g. a macroblock · CPC title

  • Analysis of motion vectors, e.g. their magnitude, direction, variance or reliability · CPC title

  • Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction · CPC title

  • according to rate distortion criteria (rate-distortion as a criterion for motion estimation H04N19/567) · CPC title

  • characterised by syntax aspects related to video coding, e.g. related to compression standards · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12464160B2 cover?
Implementations are described for determining, for a block being encoded in a picture, at least one predictor candidate, determining for the at least one predictor candidate, one or more corresponding control point generator motion vectors, based on motion information associated to the at least one predictor candidate, determining for the block being encoded, one or more corresponding control p…
Who is the assignee on this patent?
Interdigital Vc Holdings Inc
What technology area does this patent fall under?
Primary CPC classification H04N19/56. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Nov 04 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 5 related publications on this page (citations in our corpus or others sharing the same primary CPC).