Intra predictions using linear or affine transforms with neighbouring sample reduction

US12075091B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12075091-B2
Application numberUS-202217962391-A
CountryUS
Kind codeB2
Filing dateOct 7, 2022
Priority dateDec 20, 2018
Publication dateAug 27, 2024
Grant dateAug 27, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Techniques for encoding/decoding of video signals are described, e.g. implemented in decoders, encoder, methods and non-transitory storage units storing instructions to perform the methods. A decoder or encoder may be configured to predict a predetermined block of the picture using a plurality of neighbouring samples by reducing the plurality of neighbouring samples to obtain a reduced set of samples values lower, in number of samples, than compared to the plurality of neighbouring samples, subjecting the reduced set of sample values to a linear or affine linear transformation to obtain predicted values for predetermined samples of the predetermined block.

First claim

Opening claim text (preview).

What is claimed is: 1. A decoder for decoding a picture from a data stream, configured to predict a predetermined block of the picture using a plurality of neighbouring samples, the decoder comprising: a non-transitory computer-readable medium; and at least one processor communicatively coupled to the non-transitory computer-readable medium, wherein the at least one processor is configured to read instructions from the non-transitory computer-readable medium to perform operations comprising: subdividing the picture into a plurality of blocks of different block sizes, the plurality of blocks comprising the predetermined block; reducing the plurality of neighbouring samples to obtain a reduced set of sample values lower, in number of samples, than compared to the plurality of neighbouring samples; obtaining predicted values by applying a matrix containing weighting factors to the reduced set of sample values; selecting a linear or affine linear transformation depending on a width W and height H of the predetermined block such that the linear or affine linear transformation selected for the predetermined block is selected out of a first set of linear or affine linear transformations as long as the width W and height H of the predetermined block are within a first set of width/height pairs and a second set of linear or affine linear transformations as long as the width W and height H of the predetermined block are within a second set of width/height pairs which is disjoint to the first set of width/height pairs; and deriving, by interpolation, prediction values for further samples of the predetermined block based on the predicted values for the plurality of neighbouring samples. 2. The decoder of claim 1 , the operations further comprising performing the reducing by averaging, wherein the averaging includes bit-shifting. 3. The decoder of claim 1 , wherein the plurality of neighbouring samples extend one-dimensionally along two sides of the predetermined block, predetermined samples are arranged in rows and columns and, along at least one of the rows and columns, the predetermined samples are positioned at every n th position from a sample of the predetermined samples adjoining the two sides of the predetermined block and the operations further comprise, based on the plurality of neighbouring samples, determining for each of the at least one of the rows and the columns, a support value for one of a plurality of neighbouring positions. 4. The decoder of claim 3 , wherein the predetermined samples are positioned at every n th position from the sample of the predetermined samples which adjoins the two sides of the predetermined block along the rows and the predetermined samples are positioned at every m th position from the sample of the predetermined samples which adjoins the two sides of the predetermined block along the columns, wherein n, m>1. 5. The decoder of claim 4 , wherein n=m. 6. The decoder of claim 3 , configured to, along at least one of the rows and columns, perform the determining of the support value by downsampling or averaging, for each support value, a group of neighbouring samples within the plurality of neighbouring samples. 7. The decoder of claim 1 , wherein the operations further comprise: reading for the predetermined block a set index from the data stream; and selecting the linear or affine linear transformation depending on a set index (k) out of a predetermined set of linear or affine linear transformations. 8. A decoding method for predicting a predetermined block of a picture using a plurality of neighbouring samples, the decoding method comprising: subdividing the picture into a plurality of blocks of different block sizes, the plurality of blocks comprising the predetermined block; reducing, by downsampling or averaging, the plurality of neighbouring samples to obtain a reduced set of sample values lower, in number of samples, than compared to the plurality of neighbouring samples; obtaining predicted values by applying a matrix containing weighting factors to the reduced set of sample values; selecting a linear or affine linear transformation depending on a width W and height H of the predetermined block such that the linear or affine linear transformation selected for the predetermined block is selected out of a first set of linear or affine linear transformations as long as the width W and height H of the predetermined block are within a first set of width/height pairs and a second set of linear or affine linear transformations as long as the width W and height H of the predetermined block are within a second set of width/height pairs which is disjoint to the first set of width/height pairs; and deriving, by interpolation, prediction values for further samples of the predetermined block based on the predicted values for the plurality of neighbouring samples. 9. An encoder for encoding a picture into a data stream, the encoder configured to predict a predetermined block of the picture using a plurality of neighbouring samples, the encoder comprising: a non-transitory computer-readable medium; and at least one processor communicatively coupled to the non-transitory computer-readable medium, wherein the at least one processor is configured to read instructions from the non-transitory computer-readable medium to perform operations comprising: subdividing the picture into a plurality of blocks of different block sizes, the plurality of blocks comprising the predetermined block; reducing the plurality of neighbouring samples to obtain a reduced set of sample values lower, in number of samples, than compared to the plurality of neighbouring samples; obtaining predicted values by applying a matrix containing weighting factors to the reduced set of sample values; selecting a linear or affine linear transformation depending on a width W and height H of the predetermined block such that the linear or affine linear transformation selected for the predetermined block is selected out of a first set of linear or affine linear transformations as long as the width W and height H of the predetermined block are within a first set of width/height pairs and a second set of linear or affine linear transformations as long as the width W and height H of the predetermined block are within a second set of width/height pairs which is disjoint to the first set of width/height pairs; and deriving, by interpolation, prediction values for further samples of the predetermined block based on the predicted values for the plurality of neighbouring samples. 10. The encoder of claim 9 , the operations further performing the reducing by averaging, wherein averaging includes bit-shifting. 11. The encoder of claim 9 , wherein the plurality of neighbouring samples extend one-dimensionally along two sides of the predetermined block, predetermined samples are arranged in rows and columns and, along at least one of the rows and columns, the predetermined samples are positioned at every n th position from a sample of the predetermined sample adjoining the two sides of the predetermined block and the operations further comprise, based on the plurality of neighbouring samples, determining for each of the at least one of the rows and the columns, a support value for one of a plurality of neighbouring positions. 12. The encoder of claim 11 , wherein the predetermined samples are positioned at every n th position from the sample of the predetermined samples which adjoins the two sides of the predetermined block along the rows and the predetermined samples are positioned at every m th position from the sample of the predetermined samples which adjoins the two sides of

Assignees

Inventors

Classifications

  • H04N19/61Primary

    in combination with predictive coding · CPC title

  • involving spatial sub-sampling or interpolation, e.g. alteration of picture size or resolution · CPC title

  • Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder · CPC title

  • the unit being bits, e.g. of the compressed video stream · CPC title

  • the unit being a pixel · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12075091B2 cover?
Techniques for encoding/decoding of video signals are described, e.g. implemented in decoders, encoder, methods and non-transitory storage units storing instructions to perform the methods. A decoder or encoder may be configured to predict a predetermined block of the picture using a plurality of neighbouring samples by reducing the plurality of neighbouring samples to obtain a reduced se…
Who is the assignee on this patent?
Fraunhofer Ges Forschung
What technology area does this patent fall under?
Primary CPC classification H04N19/61. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Aug 27 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).