Fixed rate intraframe compression and decompression of video based on visual quality

US12506887B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12506887-B2
Application numberUS-202418940335-A
CountryUS
Kind codeB2
Filing dateNov 7, 2024
Priority dateApr 12, 2024
Publication dateDec 23, 2025
Grant dateDec 23, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

DCT-based methods for fixed bit rate, intraframe compression of video are based on the visual quality of the resulting decompressed imagery. In general, this involves assigning an initial bit budget per macroblock of a given video frame, resulting in degrees of compression, i.e., quantization scale factors, that vary among the macroblocks according to their complexity. The scale factors are then adjusted while maintaining the overall frame bit budget to reduce the visibility of artifacts in the decompressed frame. The adjustments may include increasing scale factors for simple macroblocks and reducing scale factors for complex blocks. As compared to traditional DCT-based compression schemes, the described methods may reduce the visibility of compression-related artifacts both in complex and in simple portions of the frame.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method for fixed bit rate, intraframe compression of video, including a sequence of images, comprising, for each image: segmenting the image into n portions, wherein compression parameters are individually adjustable for each of the n portions; transforming each of the portions of the image to generate frequency domain coefficients for each of the portions; selecting a stored quantization matrix for the image based on a desired bit rate for the compressed video; determining an initial bit budget per transformed portion of the image that is equal to a bit budget for the image within the compressed video divided by n; for each transformed portion, determining an initial scale factor that is the smallest scale factor that, when applied to the frequency domain coefficients together with the quantization matrix, results in a bit size of the transformed portion that is equal to or less than the initial bit budget per transformed portion; and generating a bitstream representing the compressed image by: for each transformed portion, scaling the portion's frequency domain coefficients by applying to the portion the initial scale factor determined for that portion together with the quantization matrix; and arranging the scaled frequency domain coefficients in a predetermined sequence and applying variable length coding to the sequence of scaled frequency domain coefficients. 2 . The method of claim 1 , further comprising: ranking the transformed portions in order of their initial scale factors and determining an average of the initial scale factors; proceeding in order of increasing initial scale factor, adjusting downward the initial scale factors of a plurality of transformed portions having initial scale factors greater than the average initial scale factor, wherein the amount of adjustment is a diminishing fraction of a difference between a given transformed portion's initial scale factor and the average initial scale factor; and terminating the adjustment of initial scale factors when adjusting an additional transformed portion would result in a sum of bit sizes of the scaled transformed portions of the image exceeding a bit budget for the image within the compressed video. 3 . The method of claim 2 , further comprising: prior to adjusting downward the initial scale factors of the plurality of transformed portions having initial scale factors greater than the average initial scale factor, for each portion having an initial scale factor below a threshold scale factor, adjusting upward the initial scale factor of the portion, wherein the threshold scale factor is based on the average initial scale factor and a standard deviation of the initial scale factors. 4 . A computer program product comprising: a non-transitory computer-readable medium with computer-readable instructions encoded thereon, wherein the computer-readable instructions, when processed by a processing device instruct the processing device to perform a method for fixed bit rate, intraframe compression of video, including a sequence of images, the method comprising, for each image: segmenting the image into n portions, wherein compression parameters are individually adjustable for each of the n portions; transforming each of the portions of the image to generate frequency domain coefficients for each of the portions; selecting a stored quantization matrix for the image based on a desired bit rate for the compressed video; determining an initial bit budget per transformed portion of the image that is equal to a bit budget for the image within the compressed video divided by n; for each transformed portion, determining an initial scale factor that is the smallest scale factor that, when applied to the frequency domain coefficients together with the quantization matrix, results in a bit size of the transformed portion that is equal to or less than the initial bit budget per transformed portion; and generating a bitstream representing the compressed image by: for each transformed portion, scaling the portion's frequency domain coefficients by applying to the portion the initial scale factor determined for that portion together with the quantization matrix; and arranging the scaled frequency domain coefficients in a predetermined sequence and applying variable length coding to the sequence of scaled frequency domain coefficients. 5 . A system comprising: a memory for storing computer-readable instructions; and a processor connected to the memory, wherein the processor, when executing the computer-readable instructions, causes the system to perform a method for fixed bit rate, intraframe compression of video, including a sequence of images, the method comprising, for each image: segmenting the image into n portions, wherein compression parameters are individually adjustable for each of the n portions; transforming each of the portions of the image to generate frequency domain coefficients for each of the portions; selecting a stored quantization matrix for the image based on a desired bit rate for the compressed video; determining an initial bit budget per transformed portion of the image that is equal to a bit budget for the image within the compressed video divided by n; for each transformed portion, determining an initial scale factor that is the smallest scale factor that, when applied to the frequency domain coefficients together with the quantization matrix, results in a bit size of the transformed portion that is equal to or less than the initial bit budget per transformed portion; and generating a bitstream representing the compressed image by: for each transformed portion, scaling the portion's frequency domain coefficients by applying to the portion the initial scale factor determined for that portion together with the quantization matrix; and arranging the scaled frequency domain coefficients in a predetermined sequence and applying variable length coding to the sequence of scaled frequency domain coefficients.

Assignees

Inventors

Classifications

  • using discrete cosine transform [DCT] · CPC title

  • using compressed domain processing techniques other than decoding, e.g. modification of transform coefficients, variable length coding [VLC] data or run-length data (motion estimation in a transform domain H04N19/547; processing of decoded motion vectors H04N19/513) · CPC title

  • the unit being bits, e.g. of the compressed video stream · CPC title

  • Incoming video signal characteristics or properties · CPC title

  • H04N19/124Primary

    Quantisation · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12506887B2 cover?
DCT-based methods for fixed bit rate, intraframe compression of video are based on the visual quality of the resulting decompressed imagery. In general, this involves assigning an initial bit budget per macroblock of a given video frame, resulting in degrees of compression, i.e., quantization scale factors, that vary among the macroblocks according to their complexity. The scale factors are the…
Who is the assignee on this patent?
Avid Tech Inc
What technology area does this patent fall under?
Primary CPC classification H04N19/124. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Dec 23 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).