Image processing apparatus using neural network and image processing method using the same

US12598322B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12598322-B2
Application numberUS-202418654604-A
CountryUS
Kind codeB2
Filing dateMay 3, 2024
Priority dateNov 15, 2023
Publication dateApr 7, 2026
Grant dateApr 7, 2026

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An image encoding apparatus may include a memory storing one or more instructions; and a processor configured to execute the one or more instructions to: obtain an initial search position from a student neural network that has been trained through a knowledge distillation from a teacher neural network by inputting a current frame and a reference frame to the student neural network; and perform motion estimation based on the initial search position.

First claim

Opening claim text (preview).

What is claimed is: 1 . An image encoding apparatus comprising: a memory storing one or more instructions; and a processor configured to execute the one or more instructions to: obtain an initial search position from a student neural network that has been trained through a knowledge distillation from a teacher neural network by inputting a current frame and a reference frame to the student neural network; and perform motion estimation based on the initial search position, wherein the student neural network is trained by inputting a first frame and a second frame into the teacher neural network to obtain a first initial search position, inputting the first frame and the second frame into the student neural network to obtain a second initial search position, and training the student neural network based on a loss function using a difference between the first initial search position and the second initial search position. 2 . The image encoding apparatus of claim 1 , wherein the loss function comprises a multiplication of the difference between the first initial search position and the second initial search position by a loss weight. 3 . The image encoding apparatus of claim 2 , wherein the loss weight is determined based on bitrate increasing rate information acquired by the teacher neural network. 4 . The image encoding apparatus of claim 3 , wherein the bitrate increasing rate information is obtained by comparing an image compression result with the first initial search position applied and an image compression result without applying the first initial search position. 5 . The image encoding apparatus of claim 1 , wherein the processor is further configured to output a predicted frame by performing motion compensation based on result of the motion estimation. 6 . The image encoding apparatus of claim 1 , wherein the teacher neural network is located within the image encoding apparatus, or an external device located outside the image encoding apparatus. 7 . The image encoding apparatus of claim 1 , wherein there are a plurality of teacher neural networks comprising the teacher neural network, and the student neural network is trained by inputting the first frame and the second frame into each of the plurality of teacher neural networks, obtaining a plurality of initial search positions from the plurality of teacher neural networks, inputting the first frame and the second frame into the student neural network, obtaining the initial search position from the student neural network, and train the student neural network based on the loss function using differences between the plurality of initial search positions from the plurality of teacher neural networks and the initial search position from the student neural network. 8 . The image encoding apparatus of claim 1 , wherein the loss function comprises a computation of a difference between the first initial search position and a ground truth label. 9 . An image encoding method comprising: outputting an initial search position from a student neural network by inputting a current frame and a reference frame of an image into the student neural network that has been trained through a knowledge distillation from a teacher neural network; and performing motion estimation based on the initial search position, wherein the outputting of the initial search position comprises: obtaining a first initial search position from the teacher neural network by inputting a first frame and a second frame, into the teacher neural network; obtaining a second initial search position from the student neural network by inputting the first frame and the second frame into the student neural network; and training the student neural network based on a loss function using a difference between the first initial search position and the second initial search position. 10 . The image encoding method of claim 9 , wherein the loss function comprises a multiplication of the difference between the first initial search position and the second initial search position by a loss weight. 11 . The image encoding method of claim 10 , further comprising: generating the loss weight based on bitrate increasing rate information acquired by the teacher neural network. 12 . The image encoding method of claim 11 , further comprising: obtaining the bitrate increasing rate information by comparing an image compression result of the student neural network with the first initial search position applied and an image compression result of the student neural network without applying the first initial search position. 13 . The image encoding method of claim 9 , further comprising outputting a predicted frame by performing motion compensation based on result of the motion estimation. 14 . The image encoding method of claim 9 , wherein the loss function comprises a computation of a difference between the first initial search position and a ground truth label. 15 . An image decoding apparatus comprising: an entropy decoder configured to receive a bitstream and parse the received bitstream to obtain information necessary for image reconstruction; an inverse quantizer configured to output a transform coefficient by inversely quantizing a quantized transform coefficient contained in the information; an inverse transformer configured to obtain a residual block by inversely transforming the transform coefficient; a predictor configured to generate a predicted block on the basis of information on prediction obtained from the entropy decoder; an adder configured to generate a reconstructed block by adding the residual block and the predicted block; and a filter configured to apply filtering to the reconstructed block, wherein the bitstream is an encoded image signal based on a motion vector obtained using an initial search position output by inputting a current frame and a reference frame into a student neural network trained through a knowledge distillation technique using a teacher neural network, and wherein the student neural network is trained by inputting a first frame and a second frame into the teacher neural network to obtain a first initial search position, inputting the first frame and the second frame into the student neural network to obtain a second initial search position, and training the student neural network based on a loss function using a difference between the first initial search position and the second initial search position. 16 . An image decoding method comprising: receiving a bitstream and parsing the received bitstream to obtain information necessary for image reconstruction; outputting a transform coefficient by inversely quantizing a quantized transform coefficient contained in the information; obtaining a residual block by inversely transforming the transform coefficient; generating a predicted block on the basis of information on prediction obtained from an entropy decoder; generating a reconstructed block by adding the residual block and the predicted block; and applying filtering to the reconstructed block, wherein the bitstream is an encoded image signal based on a motion vector obtained using an initial search position output by inputting a current frame and a reference frame into a student neural network trained through a knowledge distillation technique using a teacher neural network, and wherein the student neural network is trained by inputting a first frame and a second frame into the teacher neural network to obtain a first initial search position, inputting the first frame and the second frame into the student neur

Assignees

Inventors

Classifications

  • Quantisation · CPC title

  • the unit being a set of transform coefficients · CPC title

  • the region being a block, e.g. a macroblock · CPC title

  • Entropy coding, e.g. variable length coding [VLC] or arithmetic coding · CPC title

  • Filters, e.g. for pre-processing or post-processing (sub-band filter banks H04N19/635) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12598322B2 cover?
An image encoding apparatus may include a memory storing one or more instructions; and a processor configured to execute the one or more instructions to: obtain an initial search position from a student neural network that has been trained through a knowledge distillation from a teacher neural network by inputting a current frame and a reference frame to the student neural network; and perform …
Who is the assignee on this patent?
Samsung Electronics Co Ltd
What technology area does this patent fall under?
Primary CPC classification H04N19/513. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Apr 07 2026 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 7 related publications on this page (citations in our corpus or others sharing the same primary CPC).