Deep video coding with block-based motion estimation

US2025386027A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2025386027-A1
Application numberUS-202519302635-A
CountryUS
Kind codeA1
Filing dateAug 18, 2025
Priority dateFeb 22, 2023
Publication dateDec 18, 2025
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An apparatus for determining an encoding of a motion field for a picture of a video sequence comprising a sequence of pictures, such that said picture is decodable using a reference picture, the motion field and the residual, according to an embodiment is provided. The apparatus comprises a trained neural network configured to determine the encoding of the motion field, being associated with said picture, depending on said picture and depending on the reference picture.

First claim

Opening claim text (preview).

1 . An apparatus for determining an encoding of a motion field for a picture of a video sequence comprising a sequence of pictures, such that said picture is decodable using a reference picture, the motion field and the residual, wherein the apparatus comprises a trained neural network configured to determine the encoding of the motion field, being associated with said picture, depending on said picture and depending on the reference picture. 2 . An apparatus for encoding, wherein the apparatus is configured to encode a video sequence comprising a sequence of pictures to acquire encoded video data, wherein the apparatus is configured to generate the encoded video data such that each picture of one or more pictures of the video sequence is encoded by an encoding of a motion field and a residual, such that said picture is decodable using a reference picture, the motion field and the residual; wherein the apparatus comprises a trained neural network configured to determine the encoding of the motion field, being associated with said picture, depending on said picture and depending on the reference picture. 3 . An apparatus according to claim 1 , wherein the apparatus is configured to determine the motion field using a block-based motion search strategy, and wherein the trained neural network is configured to determine the encoding of the motion field. 4 . An apparatus according to claim 1 , wherein the apparatus is configured to determine two or more motion fields using the block-based motion search strategy, wherein the trained neural network is configured to determine the encoding of the motion field depending on the two or more motion fields that have been determined using the block-based motion search strategy. 5 . An apparatus according to claim 4 , wherein the apparatus is configured to determine the encoding of the motion field depending on the two or more motion fields by employing a cost function. 6 . An apparatus according to claim 4 , wherein the two or more motion fields exhibit different block sizes, for example, 8×8, and/or 16×16, and/or 32×32, and/or 64×64. 7 . An apparatus according to claim 3 , wherein the apparatus is configured to determine the motion field or the one or more motion fields using the block-based motion search strategy without using a neural network, and wherein the trained neural network is configured to determine the encoding of the motion field depending on the motion field or depending on the one or more motion fields. 8 . An apparatus according to claim 3 , wherein the block-based motion strategy comprises a block-based diamond search. 9 . An apparatus according to claim 3 , wherein the block-based motion strategy comprises a to determine the motion field depending on a sub-pel search. 10 . An apparatus according to claim 1 , wherein the trained neural network has been trained using a minimization function or optimization function, which depends on a predicted picture and an original picture, wherein the predicted picture is a picture that results from decoding using a reference picture and a motion field which are associated with said predicted picture. 11 . An apparatus according to claim 10 , wherein the neural network has been trained comprising minimizing a mean squared error between a predicted picture and an original picture. 12 . An apparatus according to claim 10 , wherein the neural network has been trained comprising minimizing a rate which depends on the motion field and/or on a residual. 13 . An apparatus according to claim 12 , wherein the neural network has been trained comprising the minimizing of a rate which depends on a rate of a block-based transform coder for the residual. 14 . An apparatus according to claim 12 , wherein the neural network has been trained comprising the minimizing of the rate depending on ∑ k - log 2 ⁢ P z ( z ~ k , ( μ k ^ , σ k ^ ) ) + ∑ l - log 2 ⁢ P y ( y ~ l , ϕ ) , and/or depending on κ ⁢ ∑ 𝔅 j  DCT ( x i + 1

Assignees

Inventors

Classifications

  • characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation (H04N19/635 takes precedence) · CPC title

  • H04N19/176Primary

    the region being a block, e.g. a macroblock · CPC title

  • the region being a picture, frame or field · CPC title

  • Non-supervised learning, e.g. competitive learning · CPC title

  • Convolutional networks [CNN, ConvNet] · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2025386027A1 cover?
An apparatus for determining an encoding of a motion field for a picture of a video sequence comprising a sequence of pictures, such that said picture is decodable using a reference picture, the motion field and the residual, according to an embodiment is provided. The apparatus comprises a trained neural network configured to determine the encoding of the motion field, being associated with sa…
Who is the assignee on this patent?
Fraunhofer Ges Forschung
What technology area does this patent fall under?
Primary CPC classification H04N19/176. Mapped technology areas include Electricity.
When was this patent published?
Publication date Thu Dec 18 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).