Use-case driven context model selection for hybrid video coding tools

US12418654B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12418654-B2
Application numberUS-202418647776-A
CountryUS
Kind codeB2
Filing dateApr 26, 2024
Priority dateMar 5, 2019
Publication dateSep 16, 2025
Grant dateSep 16, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An apparatus includes an encoder, which receives image or video data, encodes the data and provides a bitstream. The encoder includes a CABAC encoder, which receives a binary valued syntax element associated with a block of the data to be encoded, and encodes, using a selected context model, the syntax element into coded bits for the bitstream. The syntax element includes a tool flag indicating whether or not certain coding tool is employed when encoding a block of the data. For a block of the data which exhibits an aspect ratio larger than 2 and for a block of the data which exhibits an aspect ratio less than or equal to 2, and to which blocks the certain coding tool is applicable, first and second context models for encoding the tool flag are selected from a group of one or more first and second context models, respectively.

First claim

Opening claim text (preview).

The invention claimed is: 1. A decoder for decoding a picture, the decoder comprising at least one processor configured to: receive a bitstream comprising encoded data representing the picture; for a block of the picture, determine whether a first dimension of the block is larger than two times a second dimension of the block; select a context based on the determination, wherein: the context is a first context when the first dimension of the block is larger than two times the second dimension of the block, or the context is a second context, different than the first context, when the first dimension of the block is not larger than two times the second dimension of the block; using the selected context, decode a flag associated with the block from the bitstream, the flag indicating whether to use a linear weighted intra prediction (LWIP) coding tool to decode the block; and decode the block based in part on the flag. 2. The decoder of claim 1 , wherein: the first dimension of the block is a height of the block and the second dimension is a width of the block; or the first dimension of the block is the width of the block and the second dimension is the height of the block. 3. The decoder of claim 1 , wherein the at least one processor is further configured to: in response to the flag indicating a first value, determine to use the LWIP coding tool to decode the block; and in response to the flag indicating a second value, different than the first value, determine to not use the LWIP coding tool to decode the block. 4. The decoder of claim 3 , wherein: the first value is one; and the second value is zero. 5. The decoder of claim 3 , wherein the at least one processor is further configured to: after the determination to use the LWIP coding tool to decode the block, decode a mode index for a set of available LWIP modes. 6. The decoder of claim 1 , wherein: the LWIP coding tool corresponds to matrix based intra-prediction. 7. The decoder of claim 1 , wherein: the first dimension of the block is larger than two times the second dimension of the block indicates that an aspect ratio of the block is larger than 2. 8. A method for decoding a picture, the method comprising: receiving a bitstream comprising encoded data representing the picture; for a block of the picture, determining whether a first dimension of the block is larger than two times a second dimension of the block; selecting a context based on the determination, wherein: the context is a first context when the first dimension of the block is larger than two times the second dimension of the block, or the context is a second context, different than the first context, when the first dimension of the block is not larger than two times the second dimension of the block; using the selected context, decoding a flag associated with the block from the bitstream, the flag indicating whether to use a linear weighted intra prediction (LWIP) coding tool to decode the block; and decoding the block based in part on the flag. 9. The method of claim 8 , wherein: the first dimension of the block is a height of the block and the second dimension is a width of the block; or the first dimension of the block is the width of the block and the second dimension is the height of the block. 10. The method of claim 8 , further comprising: in response to the flag indicating a first value, determining to use the LWIP coding tool to decode the block; and in response to the flag indicating a second value, different than the first value, determining to not use the LWIP coding tool to decode the block. 11. The method of claim 10 , wherein: the first value is one; and the second value is zero. 12. The method of claim 10 , further comprising: after the determination to use the LWIP coding tool to decode the block, decoding a mode index for a set of available LWIP modes. 13. The method of claim 8 , wherein: the LWIP coding tool corresponds to matrix based intra-prediction. 14. The method of claim 8 , wherein: the first dimension of the block is larger than two times the second dimension of the block indicates that an aspect ratio of the block is larger than 2. 15. A non-transitory computer readable medium containing instructions that when executed cause at least one processor to: receive a bitstream comprising encoded data representing a picture; for a block of the picture, determine whether a first dimension of the block is larger than two times a second dimension of the block; select a context based on the determination, wherein: the context is a first context when the first dimension of the block is larger than two times the second dimension of the block, or the context is a second context, different than the first context, when the first dimension of the block is not larger than two times the second dimension of the block; use the selected context, decode a flag associated with the block from the bitstream, the flag indicating whether to use a linear weighted intra prediction (LWIP) coding tool to decode the block; and decode the block based in part on the flag. 16. The non-transitory computer readable medium of claim 15 , wherein: the first dimension of the block is a height of the block and the second dimension is a width of the block; or the first dimension of the block is the width of the block and the second dimension is the height of the block. 17. The non-transitory computer readable medium of claim 15 , wherein: in response to the flag indicating a first value, the instructions that when executed cause the at least one processor determine to use the LWIP coding tool to decode the block; and in response to the flag indicating a second value, the instructions that when executed cause the at least one processor different than the first value, determine to not use the LWIP coding tool to decode the block. 18. The non-transitory computer readable medium of claim 17 , wherein: the first value is one; and the second value is zero. 19. The non-transitory computer readable medium of claim 17 , wherein: after the determination to use the LWIP coding tool to decode the block, the instructions that when executed cause the at least one processor decode a mode index for a set of available LWIP modes. 20. The non-transitory computer readable medium of claim 15 , wherein: the LWIP coding tool corresponds to matrix based intra-prediction, and the first dimension of the block is larger than two times the second dimension of the block indicates that an aspect ratio of the block is larger than 2. 21. An encoder for encoding a picture into a bitstream, the encoder comprising at least one processor configured to: for a block of the picture, determine whether a first dimension of the block is larger than two times a second dimension of the block; select a context based on the determination, wherein: the context is a first context when the first dimension of the block is larger than two times the second dimension of the block, or the context is a second context, different than the first context, when the first dimension of the block is not larger than two times the second dimension of the block; using the selected context, encode a flag associated with the block to the bitstream, the flag indicating whether to use a linear weighted intra prediction (LWIP) coding tool to decode the block; and encode the block based in part on the flag. 22. A method for encoding a picture t

Assignees

Inventors

Classifications

  • characterised by syntax aspects related to video coding, e.g. related to compression standards · CPC title

  • Embedding additional information in the video signal during the compression process (H04N19/517, H04N19/68, H04N19/70 take precedence) · CPC title

  • the region being a block, e.g. a macroblock · CPC title

  • the unit being bits, e.g. of the compressed video stream · CPC title

  • Tree coding, e.g. quad-tree coding · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12418654B2 cover?
An apparatus includes an encoder, which receives image or video data, encodes the data and provides a bitstream. The encoder includes a CABAC encoder, which receives a binary valued syntax element associated with a block of the data to be encoded, and encodes, using a selected context model, the syntax element into coded bits for the bitstream. The syntax element includes a tool flag indicating…
Who is the assignee on this patent?
Fraunhofer Ges Forschung, Fraunhofer Ges Zur Foerderung Derangewandten Forschung E V
What technology area does this patent fall under?
Primary CPC classification H04N19/13. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Sep 16 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).