Content-aware, machine-learning-based rate control

US12335486B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12335486-B2
Application numberUS-202318096428-A
CountryUS
Kind codeB2
Filing dateJan 12, 2023
Priority dateJan 12, 2023
Publication dateJun 17, 2025
Grant dateJun 17, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A system includes a processing device to receive video content, metadata related to the video content, and a target bit rate for encoding the video content. The processing device further detects a content type of the video content based on the metadata and encodes hardware to perform frame encoding on the video content. The system further includes a controller coupled between the processing device and the encoding hardware. The controller is programmed with machine instructions to generate first QP values on a per-frame basis using a frame machine learning model with a first plurality of weights. The first plurality of weights depends at least in part on the content type and the target bit rate. The controller further provides the first QP values to the encoding hardware for rate control of the frame encoding.

First claim

Opening claim text (preview).

What is claimed is: 1. A system comprising: a processing device to: receive video content, metadata related to the video content, and a target bit rate for encoding the video content; and detect a content type of the video content based on one or more tags within the metadata, wherein the one or more tags are indicative of the content type received from a particular video streaming source device; encoding hardware to perform frame encoding on the video content and to generate frame statistics based on one or more encoded frames of the video content corresponding to a current frame; and a controller coupled between the processing device and the encoding hardware, the controller programmed with machine instructions to: receive the frame statistics from the encoding hardware; generate a first quantization parameter (QP) value of the current frame using a frame machine learning model with a first plurality of weights, wherein the first plurality of weights depends at least in part on the content type, the target bit rate, and the frame statistics; and provide the first QP value directly to the encoding hardware for rate control of the frame encoding. 2. The system of claim 1 , wherein the statistics include one or more of block-related metadata, frame-related metadata, bit budget information, or complexity motion information. 3. The system of claim 1 , wherein corresponding to the current frame comprises one of being adjacent, neighboring frames, frames within a same block, or frames within a same sub-block of the video content. 4. The system of claim 1 , wherein the processing device is further to: retrieve a plurality of parameters related to the detected content type; and provide the plurality of parameters to the controller; and wherein the machine instructions are further to select the first plurality of weights corresponding to the plurality of parameters. 5. The system of claim 1 , wherein the machine learning model is a frame reinforcement learning model that is instantiated in a neural network, wherein the neural network uses the first plurality of weights to maximize a reward function of the neural network while encoding a plurality of frames of the video content. 6. The system of claim 1 , wherein the encoding hardware is further to perform sub-frame encoding and generate sub-frame statistics, and the machine instructions are further to: generate a second QP value of a current sub-frame using a sub-frame machine learning model with a second plurality of weights, wherein the second plurality of weights depends at least in part on the content type, the target bit rate, and the sub-frame statistics; and provide the second QP value to the encoding hardware for rate control of the sub-frame encoding. 7. The system of claim 6 , wherein the encoding hardware is further to encode each respective sub-frame of a plurality of sub-frames using a respective one of a plurality of second QP values. 8. An integrated circuit comprising: encoding hardware to perform frame encoding on video content and to generate frame statistics based on one or more encoded frames of the video content corresponding to a current frame; and a processing device coupled to encoding hardware, wherein the processing device is to implement, using program code, a frame machine learning rate controller that is to: receive the video content, metadata related to the video content, a target bit rate, and the frame statistics for encoding the video content; detect a content type of the video content based on one or more tags within the metadata, wherein the one or more tags are indicative of the content type received from a particular video streaming source device; generate a first quantization parameter (QP) value of the current frame using a frame machine learning model with a first plurality of weights, wherein the first plurality of weights depends at least in part on the content type, the target bit rate, and the frame statistics; and provide the first QP value directly to the encoding hardware for rate control of the frame encoding. 9. The integrated circuit of claim 8 , wherein the frame statistics include one or more of block-related metadata, frame-related metadata, bit budget information, or complexity motion information. 10. The integrated circuit of claim 8 , wherein corresponding to the current frame comprises one of being adjacent, neighboring frames, frames within a same block, or frames within a same sub-block of the video content. 11. The integrated circuit of claim 8 , wherein the processing device is further to: retrieve a plurality of parameters related to the detected content type; and select the first plurality of weights corresponding to the plurality of parameters. 12. The integrated circuit of claim 8 , wherein the machine learning model is a frame reinforcement learning model that is instantiated in a neural network, wherein the neural network uses the first plurality of weights to maximize a reward function of the neural network while encoding a plurality of frames of the video content. 13. The integrated circuit of claim 8 , wherein the encoding hardware is further to perform sub-frame encoding and generate sub-frame statistics, further comprising a controller coupled to the processing device and to execute machine instructions to: generate a second QP value on of a current sub-frame using a sub-frame machine learning model with a second plurality of weights, wherein the second plurality of weights depends at least in part on the content type, the target bit rate, and the sub-frame statistics; and provide the second QP value to the encoding hardware for rate control of the sub-frame encoding. 14. The integrated circuit of claim 8 , wherein the encoding hardware is further to encode each respective sub-frame of a plurality of sub-frames using a respective one of a plurality of second QP values. 15. A method comprising: receiving video content, metadata related to the video content, and a target bit rate for encoding the video content; receiving, by a processing device, from encoding hardware that performs frame encoding, frame statistics based on one or more encoded frames of the video content corresponding to a current frame; detecting a content type of the video content based on one or more tags within the metadata, wherein the one or more tags are indicative of the content type received from a particular video streaming source device; generating, by the processing device, a first quantization parameter (QP) value of the current frame using a frame machine learning model with a first plurality of weights, wherein the first plurality of weights depends at least in part on the content type, the target bit rate, and the frame statistics; and providing, by the processing device, the first QP value directly to encoding hardware for rate control of frame encoding by the encoding hardware. 16. The method of claim 15 , wherein the frame statistics include one or more of block-related metadata, frame-related metadata, bit budget information, or complexity motion information. 17. The method of claim 15 , further comprising: retrieving a plurality of parameters related to the detected content type; and selecting the first plurality of weights corresponding to the plurality of parameters. 18. The method of claim 15 , wherein the machine learning model is a frame reinforcement learning model that is instantiated in a neural network, wherein the neural network uses the first plurality of weights to maximize a reward function of the neural ne

Assignees

Inventors

Classifications

  • H04N19/172Primary

    the region being a picture, frame or field · CPC title

  • Quantisation · CPC title

  • H04N19/147Primary

    according to rate distortion criteria (rate-distortion as a criterion for motion estimation H04N19/567) · CPC title

  • Measured or subjectively estimated visual quality after decoding, e.g. measurement of distortion (use of rate-distortion criteria H04N19/147) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12335486B2 cover?
A system includes a processing device to receive video content, metadata related to the video content, and a target bit rate for encoding the video content. The processing device further detects a content type of the video content based on the metadata and encodes hardware to perform frame encoding on the video content. The system further includes a controller coupled between the processing dev…
Who is the assignee on this patent?
Mellanox Technologies Ltd
What technology area does this patent fall under?
Primary CPC classification H04N19/172. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Jun 17 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).