Predictor candidates for motion compensation

US2025317596A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2025317596-A1
Application numberUS-202519242477-A
CountryUS
Kind codeA1
Filing dateJun 18, 2025
Priority dateOct 5, 2017
Publication dateOct 9, 2025
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Different implementations are described, particularly implementations for determining a set of predictor candidates for affine merge coding mode from neighboring blocks for motion compensation of a picture block based on a motion model. The motion model, may be, e.g., an affine model in a merge mode or AMVP mode for a video content encoder or decoder. The motion model, may be, e.g., an affine model based on top-left/top-right control point motion vectors or an affine model based on top-left/bottom-left control point motion vectors. Such affine model may be signaled by a flag. In an embodiment, predictor candidates are sorted in the set based on a criterion such as, e.g., a validity check or a vectors coherence cost. In an embodiment, a predictor candidate is selected from the set based on a motion model for each of the multiple predictor candidates, and may be based on a criterion such as, e.g., a rate distortion cost. The corresponding motion field is determined based on, e.g., one or more corresponding control point motion vectors for the block being encoded or decoded. The corresponding motion field of an embodiment identifies motion vectors used for prediction of sub-blocks of the block being encoded or decoded.

First claim

Opening claim text (preview).

1 . A method for video encoding, comprising: determining, for a block being encoded in affine merge mode, a top-left list of spatial neighboring blocks, the top-left list comprising neighboring blocks of a top-left corner of the block, a top-right list of spatial neighboring blocks of the block, the top-right list comprising neighboring blocks of a top-right corner of the block, a bottom-left list of spatial neighboring blocks of the block, the bottom-left list comprising neighboring blocks of a bottom-left corner of the block; determining, for the block being encoded, a set of predictor candidates for affine merge mode based on the top-left list, top-right list and bottom-left list, wherein the set of predictor candidates for affine merge mode comprises: a first predictor candidate on condition that a reference picture of a first spatial neighboring block of the top-left list is the same as a reference picture of a second spatial neighboring block of the top-right list wherein: a motion vector of the first spatial neighboring block of the top-left list is used as a first control point motion vector of the first predictor candidate; a motion vector of the second spatial neighboring block of the top-right list is used as a second control point motion vector of first predictor candidate; and the reference picture of the first spatial neighboring block of the top-left list is used as reference picture of the first predictor candidate; and a second predictor candidate on condition that a reference picture of a first spatial neighboring block of the top-left list is the same as a reference picture of a second spatial neighboring block of the bottom-left list: a motion vector of the first spatial neighboring block of the top-left list is used as the first control point motion vector of the second predictor candidate; a motion vector of the second spatial neighboring block of the bottom-left list and the motion vector of the first spatial neighboring block of the top-left list are used to derive a second control point motion vector of the second predictor candidate; and the reference picture of the first spatial neighboring block of the top-left list is used as a reference picture of the second predictor candidate; determining, for the block being encoded and for each predictor candidate, a motion field based on a motion model and on the one or more control point motion vectors of the predictor candidate, wherein the motion field identifies motion vectors used for prediction of sub-blocks of the block being encoded; selecting a predictor candidate from the set of predictor candidates based on a rate distortion determination between predictions responsive to the motion field determined for each predictor candidate; and encoding the block based on the motion field for the selected predictor candidate. 2 . The method of claim 1 , wherein motion information associated to at least one of the spatial neighboring blocks comprises translational motion information. 3 . The method of claim 1 , wherein motion information associated to at least one of the spatial neighboring blocks comprises affine motion information. 4 . The method of claim 1 , wherein the set of predictor candidates for affine merge mode comprises: a third predictor candidate on condition that a reference picture of a first spatial neighboring block of the top-left list is the same as a reference picture of a second spatial neighboring block of the top-right list and as a reference picture of a third spatial neighboring block of the bottom-left list wherein: a motion vector of the first spatial neighboring block of the top-left list is used as a first control point motion vector of the third predictor candidate; a motion vector of the second spatial neighboring block of the top-right list is used as a second control point motion vector of third predictor candidate; a motion vector of the third spatial neighboring block of the bottom-left list is used as a third control point motion vector of the third predictor candidate; and the reference picture of the first spatial neighboring block of the top-left list is used as reference picture of the third predictor candidate. 5 . An apparatus for video encoding, comprising: a memory and one or more processors configured for: determining, for a block being encoded in affine merge mode, a top-left list of spatial neighboring blocks, the top-left list comprising neighboring blocks of a top-left corner of the block, a top-right list of spatial neighboring blocks of the block, the top-right list comprising neighboring blocks of a top-right corner of the block, a bottom-left list of spatial neighboring blocks of the block, the bottom-left list comprising neighboring blocks of a bottom-left corner of the block; determining, for the block being encoded, a set of predictor candidates for affine merge mode based on the top-left list, top-right list and bottom-left list, wherein the set of predictor candidates for affine merge mode comprises: a first predictor candidate on condition that a reference picture of a first spatial neighboring block of the top-left list is the same as a reference picture of a second spatial neighboring block of the top-right list wherein: a motion vector of the first spatial neighboring block of the top-left list is used as a first control point motion vector of the first predictor candidate; a motion vector of the second spatial neighboring block of the top-right list is used as a second control point motion vector of first predictor candidate; and the reference picture of the first spatial neighboring block of the top-left list is used as reference picture of the first predictor candidate; and a second predictor candidate on condition that a reference picture of a first spatial neighboring block of the top-left list is the same as a reference picture of a second spatial neighboring block of the bottom-left list: a motion vector of the first spatial neighboring block of the top-left list is used as the first control point motion vector of the second predictor candidate; a motion vector of the second spatial neighboring block of the bottom-left list and the motion vector of the first spatial neighboring block of the top-left list are used to derive a second control point motion vector of the second predictor candidate; and the reference picture of the first spatial neighboring block of the top-left list is used as a reference picture of the second predictor candidate; determining, for the block being encoded and for each predictor candidate, a motion field based on a motion model and on the one or more control point motion vectors of the predictor candidate, wherein the motion field identifies motion vectors used for prediction of sub-blocks of the block being encoded; selecting a predictor candidate from the set of predictor candidates based on a rate distortion determination between predictions responsive to the motion field determined for each predictor candidate; and encoding the block based on the motion field for the selected predictor candidate. 6 . The apparatus of claim 5 , wherein motion information associated to at least one of the spatial neighboring blocks comprises translational motion information. 7 . The apparatus of claim 5 , wherein motion information associated to at least one of the spatial neighboring blocks comprises affine motion information. 8 . The apparatus of claim 5 , wherein the set of predictor candidates for affine merge mode comprises: a third predictor candidate on condition that a reference picture of a first spatial neighboring block of the top-left list is the same as a reference picture of a second spatial neighboring block of the top-right list and as a reference picture

Assignees

Inventors

Classifications

  • Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction · CPC title

  • using feature points or meshes · CPC title

  • Motion estimation based on rate distortion criteria · CPC title

  • Motion estimation with initialisation of the vector search, e.g. estimating a good candidate to initiate a search · CPC title

  • the region being a block, e.g. a macroblock · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2025317596A1 cover?
Different implementations are described, particularly implementations for determining a set of predictor candidates for affine merge coding mode from neighboring blocks for motion compensation of a picture block based on a motion model. The motion model, may be, e.g., an affine model in a merge mode or AMVP mode for a video content encoder or decoder. The motion model, may be, e.g., an affine m…
Who is the assignee on this patent?
Interdigital Vc Holdings Inc
What technology area does this patent fall under?
Primary CPC classification H04N19/52. Mapped technology areas include Electricity.
When was this patent published?
Publication date Thu Oct 09 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).