Combined upscaler and lcevc encoder
US-2024283955-A1 · Aug 22, 2024 · US
US2025080751A1 · US · A1
| Field | Value |
|---|---|
| Publication number | US-2025080751-A1 |
| Application number | US-202418816444-A |
| Country | US |
| Kind code | A1 |
| Filing date | Aug 27, 2024 |
| Priority date | Aug 30, 2023 |
| Publication date | Mar 6, 2025 |
| Grant date | — |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Methods and systems for rate control include determining an encoding parameter value to use for an input set of video frames based on a current channel capacity, using a machine learning model that accepts the input set of video frames and the current channel capacity as inputs. The input set of video frames are encoded using the encoding parameter to generate encoded video that has a bitrate below the current channel capacity. The encoded video is transmitted.
Opening claim text (preview).
What is claimed is: 1 . A computer-implemented method for rate control, comprising: determining an encoding parameter value to use for an input set of video frames based on a current channel capacity, using a machine learning model that accepts the input set of video frames and the current channel capacity as inputs; encoding the input set of video frames using the encoding parameter to generate encoded video that has a bitrate below the current channel capacity; and transmitting the encoded video. 2 . The method of claim 1 , further comprising determining the current channel capacity based on channel quality information received from user equipment. 3 . The method of claim 1 , wherein the machine learning model includes a prediction head model that is trained to generate a parameter value that, when used to encode the input set of video frames, results in the encoded video being at or below the current channel capacity. 4 . The method of claim 3 , wherein the prediction head model is a deep neural network model that includes conditional group normalization using the current channel capacity as a condition. 5 . The method of claim 4 , wherein the prediction head model includes a plurality of convolutional layers, each followed by a respective conditional group normalization. 6 . The method of claim 1 , wherein the encoding parameter is a quantization parameter. 7 . The method of claim 1 , further comprising altering the determined encoding parameter value to decrease video quality before encoding the video. 8 . The method of claim 1 , wherein determining the encoding parameter value includes extracting features from the input set of video frames and processing the features with the current channel capacity in a prediction head model. 9 . The method of claim 1 , wherein the encoded video is transmitted to a medical professional to aid in medical decision making. 10 . The method of claim 1 , further comprising performing a treatment action responsive to the encoded video, including automatically altering a patient's treatment in response to a patient activity shown in the encoded video. 11 . The method of claim 1 , wherein determining the encoding parameter value includes maximization of average video quality of a live video feed, subject to the current channel capacity available for the set of video frames and minimization of a probability of packet drop and video artifacts in transmission of the encoded video. 12 . A system for rate control, comprising: a hardware processor; and a memory that stores a computer program which, when executed by the hardware processor, causes the hardware processor to: determine an encoding parameter value to use for an input set of video frames based on a current channel capacity, using a machine learning model that accepts the input set of video frames and the current channel capacity as inputs; encode the input set of video frames using the encoding parameter to generate encoded video that has a bitrate below the current channel capacity; and transmit the encoded video. 13 . The system of claim 12 , wherein the computer program further causes the hardware processor to determine the current channel capacity based on channel quality information received from user equipment. 14 . The system of claim 12 , wherein the machine learning model includes a prediction head model that is trained to generate a parameter value that, when used to encode the input set of video frames, results in the encoded video being at or below the current channel capacity. 15 . The system of claim 14 , wherein the prediction head model is a deep neural network model that includes conditional group normalization using the current channel capacity as a condition. 16 . The system of claim 15 , wherein the prediction head model includes a plurality of convolutional layers, each followed by a respective conditional group normalization. 17 . The system of claim 12 , wherein the computer program further causes the hardware processor to alter the determined encoding parameter value to decrease video quality before encoding the video. 18 . The system of claim 12 , wherein the computer program further causes the hardware processor to extract features from the input set of video frames and to process the features with the current channel capacity in a prediction head model. 19 . The system of claim 12 , wherein the encoded video is transmitted to a medical professional to aid in medical decision making. 20 . The system of claim 12 , wherein the computer program further causes the hardware processor to perform a treatment action responsive to the encoded video, including automatically alteration of a patient's treatment in response to a patient activity shown in the encoded video.
Quantisation · CPC title
Selection of the code volume for a coding unit prior to coding · CPC title
by estimating the code amount by means of a model, e.g. mathematical model or statistical model · CPC title
the region being a picture, frame or field · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.