Optimization of downsampling occurring before encoding images during compression

US11265563B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11265563-B2
Application numberUS-201916599780-A
CountryUS
Kind codeB2
Filing dateOct 11, 2019
Priority dateOct 12, 2018
Publication dateMar 1, 2022
Grant dateMar 1, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The disclosure lies in the field of coding image data, scalable in image resolution, such that one or more successive downsamplings are performed from a source signal of maximal image resolution to a minimal image resolution, in order to encode a base layer (L 0 ) corresponding to this minimal resolution, the base layer (L 0 ) serving as a reference for one or more enhancement layers (L 1 , L 2 ) to be used for decoding at a terminal having a screen of higher image resolution than the minimal image resolution. From said successive downsamplings, the base layer (L 0 ) and the one or more enhancement layers (L 1 , L 2 ) are constructed, and then an encoded bit stream is prepared in order to be transmitted, comprising data of the base layer and of the enhancement layer or layers. In particular, the downsampling step comprises the application of an adaptive low-pass filter to the image data from the downsampling.

First claim

Opening claim text (preview).

The invention claimed is: 1. A method for coding image data, scalable in image resolution, comprising: performing, based on a source signal of maximal image resolution, one or more successive downsamplings to a minimal image resolution, in order to encode a base layer corresponding to said minimal resolution, wherein the base layer is usable as a reference for one or more enhancement layers to be used for decoding at a terminal having a screen of higher image resolution than the minimal image resolution, constructing the base layer and the one or more enhancement layers based on the one or more successive downsamplings, preparing an encoded bitstream to be transmitted, wherein the encoded bitstream comprises data of the base layer and data of the one or more enhancement layers, wherein at least one of the one or more successive downsamplings comprises: performing an image pre-analysis to determine at least an importance criterion to be given to one or more regions of interest in image data of the source signal, the importance criterion being a texture complexity score of a region of interest measured by: a determination of a depth map in the current image, in order to identify a plurality of superimposed planes, and in each of the identified planes, a detection and a quantification of blur, assigning of a score to a region according to a depth of a plane in which this region is located, and a sharpness of an image in this plane, selecting at least one of a plurality of predetermined types of adaptive filters based at least in part upon the determined importance criterion, applying an adaptive low-pass filter of the selected type to the image data of the source signal to reduce a spectral folding effect of the at least one of the one or more successive downsamplings. 2. The method according to claim 1 , wherein the adaptive filter is of a finite impulse response type. 3. The method according to claim 2 , wherein the finite impulse response filter is of a type chosen among at least: a Kaiser-Bessel window filter, a bicubic filter, a bilinear filter, an interpolation filter. 4. The method according to claim 1 , wherein the type of adaptive filter is selected for an image or an image region according to at least one criterion among: an image resolution before downsampling, a frame rate of the source signal, an image coding rate. 5. The method according to claim 1 , wherein a bicubic filter is selected for sharp foreground regions, while a bilinear filter is used for sharp background regions and/or fuzzy foreground regions, and otherwise an interpolation filter is used. 6. The method according to claim 1 , wherein a range of configurations defining coefficients of a Kaiser-Bessel window filter is chosen based on a criterion among an image resolution before downsampling and a frame rate of the source signal. 7. The method according to claim 6 , wherein the coefficients of the Kaiser-Bessel window filter are chosen according to a configuration selected within said range, based on other criterion among the image resolution before downsampling and the frame rate of the source signal. 8. The method according to claim 1 , wherein a bicubic filter is selected if a coding rate of the base layer is greater than a threshold, and otherwise a bilinear filter is selected. 9. The method according to claim 8 , wherein the threshold is 2 Mbits/sec. 10. An image data coding device, scalable in image resolution, comprising a processing circuit for implementing the method according to claim 1 . 11. A computer program comprising instructions stored on a non-transitory computer-readable medium for implementing the method according to claim 1 , when said instructions are executed by a processor of the processing circuit. 12. A method for coding image data, scalable in image resolution, comprising: performing, based on a source signal of maximal image resolution, one or more successive downsamplings to a minimal image resolution, in order to encode a base layer corresponding to said minimal resolution, wherein the base layer is usable as a reference for one or more enhancement layers to be used for decoding at a terminal having a screen of higher image resolution than the minimal image resolution, constructing the base layer and the one or more enhancement layers based on the one or more successive downsamplings, preparing an encoded bitstream to be transmitted, wherein the encoded bitstream comprises data of the base layer and data of the one or more enhancement layers, wherein at least one of the one or more successive downsamplings comprises: performing an image pre-analysis to determine at least an importance criterion to be given to one or more regions of interest in image data of the source signal, the image pre-analysis identifying areas in the image data having a contrast or texture complexity above or below a predetermined threshold, the importance criterion being a texture complexity score of a region of interest measured by: a determination of a depth map in the current image, in order to identify a plurality of superimposed planes, and in each of the identified planes, a detection and a quantification of blur, assigning of a score to a region according to a depth of a plane in which this region is located, and a sharpness of an image in this plane, selecting at least one of a plurality of predetermined types of adaptive filters based at least in part upon the determined importance criterion, applying an adaptive low-pass filter of the selected type to the image data of the source signal to reduce a spectral folding effect of the at least one of the one or more successive downsamplings.

Assignees

Inventors

Classifications

  • Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation (H04N19/635, H04N19/86 take precedence) · CPC title

  • the unit being an image region, e.g. an object · CPC title

  • H04N19/33Primary

    in the spatial domain · CPC title

  • Position within a video image, e.g. region of interest [ROI] · CPC title

  • the unit being a scalable video layer · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11265563B2 cover?
The disclosure lies in the field of coding image data, scalable in image resolution, such that one or more successive downsamplings are performed from a source signal of maximal image resolution to a minimal image resolution, in order to encode a base layer (L 0 ) corresponding to this minimal resolution, the base layer (L 0 ) serving as a reference for one or more enhancement layers (L 1 , L 2…
Who is the assignee on this patent?
Ateme
What technology area does this patent fall under?
Primary CPC classification H04N19/33. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Mar 01 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 6 related publications on this page (citations in our corpus or others sharing the same primary CPC).