Differential adaptive bitrate streaming based on scene complexity
US-2020351504-A1 · Nov 5, 2020 · US
US11350104B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-11350104-B2 |
| Application number | US-202016913329-A |
| Country | US |
| Kind code | B2 |
| Filing date | Jun 26, 2020 |
| Priority date | Jun 26, 2019 |
| Publication date | May 31, 2022 |
| Grant date | May 31, 2022 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Disclosed is a method, implemented by computer, for processing a video sequence including a set of images, which method includes: obtaining information indicating at least one image in the set of images to be encoded using a spatial correlation-based predictive coding mode, determining consecutive subsets of images in the set of images, and encoding the video sequence on the basis of the determined consecutive subsets of images, wherein the respective sizes of at least some of the subsets of images are dependent on the at least one image to be encoded using the spatial correlation-based predictive coding mode.
Opening claim text (preview).
The invention claimed is: 1. A method implemented by computer means, for processing a video sequence comprising a set of images, the method comprising: obtaining information indicating at least one image in the set of images to be encoded using a spatial correlation-based predictive coding mode; determining consecutive subsets of images in the set of images; encoding the video sequence on the basis of the determined consecutive subsets of images; wherein the respective sizes, in number of images or in duration, of at least some of the subsets of images are dependent on the at least one image to be encoded using the spatial correlation-based predictive coding mode. 2. The method according to claim 1 , wherein the at least one image to be encoded using a spatial correlation-based predictive coding mode corresponds to a scene change in the video sequence or to a sub-chunk creation request. 3. The method according to claim 1 , wherein determining at least one subset uses a criterion relating to the size of the at least one subset. 4. The method according to claim 3 , wherein the criterion is a minimum size criterion and/or a maximum size criterion. 5. The method according to claim 1 , wherein determining at least one subset comprises: determining a starting image of the subset in the set of images; determining, from among the images positioned in the sequence starting from the starting image of the subset, a first candidate image in the plurality of images to be encoded using a spatial correlation-based predictive coding mode; determining a distance between the starting image of the subset and the first candidate image; comparing the distance with a threshold corresponding to a minimum size criterion or to a maximum size criterion; determining, in the set of images, an end image of the subset on the basis of the comparison with the threshold. 6. The method according to claim 5 , comprising, when the threshold corresponds to a minimum size: when the distance is less than the threshold, determining, from among the images positioned in the sequence starting from the first candidate image, a second candidate image in the plurality of images to be encoded using a spatial correlation-based predictive coding mode; determining the subset using the second candidate image. 7. The method according to claim 5 , comprising, when the threshold corresponds to a maximum size: when the distance is greater than the threshold, determining, from among the images positioned in the sequence starting from the starting image of the subset, an end image of the subset on the basis of a predetermined sub-chunk size. 8. The method according to claim 1 , further comprising: determining, in the set of images, a plurality of reference images whose positions in the sequence are separated in pairs by a predetermined distance; determining, in the set of images, for at least one subset, a starting image of the subset; determining a reference image positioned in the sequence starting from the starting image of the subset; determining, in the plurality of images to be encoded using a spatial correlation-based predictive coding mode, from among the images positioned in the sequence around the reference image and at a distance less than a predetermined variation threshold, a starting image of the subset immediately following the subset in a string of consecutive subsets of images; determining, from among the images positioned in the sequence starting from the starting image of the subset, an end image of the subset on the basis of the starting image of the following subset. 9. A video sequence encoding device, comprising an input interface configured so as to receive a set of images of the video sequence; and an image encoding unit operationally coupled to the input interface and configured so as to process the video sequence using a method for processing a video sequence comprising a set of images, the method comprising: obtaining information indicating at least one image in the set images to be encoded using a spatial correlation-based predictive coding mode; determining consecutive subsets of images in the set of images; encoding the video sequence on the basis of the determined consecutive subsets of images; wherein the respective sizes, in number of images or in duration, of at least some of the subsets of images are dependent on the at least one image to be encoded using the spatial correlation-based predictive coding mode. 10. The encoding device according to claim 9 , wherein the determining at least one subset uses a criterion relating to the size of the at least one subset. 11. The encoding device according to claim 10 , wherein the criterion is a minimum size criterion and/or a maximum size criterion. 12. The encoding device according to claim 9 , wherein determining at least one subset comprises: determining a starting image of the subset in the set of images; determining, from among the images positioned in the sequence starting from the starting image of the subset, a first candidate image in the plurality of images to be encoded using a spatial correlation-based predictive coding mode; determining a distance between the starting image of the subset and the first candidate image; comparing the distance with a threshold corresponding to a minimum size criterion or to a maximum size criterion; determining, in the set of images, an end image of the subset on the basis of the comparison with the threshold. 13. The encoding device according to claim 12 , comprising, when the threshold corresponds to a minimum size: when the distance is less than the threshold, determining, from among the images positioned in the sequence starting from the first candidate image, a second candidate image in the plurality of images to be encoded using a spatial correlation-based predictive coding mode; determining the subset using the second candidate image. 14. The encoding device according to claim 12 , comprising, when the threshold corresponds to a maximum size: when the distance is greater than the threshold, determining, from among the images positioned in the sequence starting from the starting image of the subset, an end image of the subset on the basis of a predetermined sub-chunk size. 15. A non-transitory computer-readable storage medium for storing a program able to be executed by a computer, comprising a dataset representing one or more programs, said one or more programs comprising instructions for, when said one or more programs are executed by a computer comprising a processing unit operationally coupled to memory means and to an input/output interface module, causing the computer to perform a method for processing a video sequence comprising a set of images, the method comprising: obtaining information indicating at least one image in the set of images to be encoded using a spatial correlation-based predictive coding mode; determining consecutive subsets of images in the set of images; encoding the video sequence on the basis of the determined consecutive subsets of images; wherein the respective sizes, in number of images or in duration, of at least some of the subsets of images are dependent on the at least one image to be encoded using the spatial correlation-based predictive coding mode. 16. The non-transitory computer-readable storage medium according to claim 15 , wherein determining at least one subset uses a criterion relating to the size of the at least one subset. 17. The non-transitory computer-readable storage medium according to claim 16 , wherein the crit
involving spatial prediction techniques · CPC title
Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction · CPC title
User input · CPC title
Adapting the group of pictures [GOP] structure, e.g. number of B-frames between two anchor frames (H04N19/107 takes precedence) · CPC title
the region being a picture, frame or field · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.