Optimizing encoding operations when generating encoded versions of a media title

US11818375B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11818375-B2
Application numberUS-202117504412-A
CountryUS
Kind codeB2
Filing dateOct 18, 2021
Priority dateFeb 23, 2017
Publication dateNov 14, 2023
Grant dateNov 14, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

In various embodiments, a sequence-based encoding application partitions a set of shot sequences associated with a media title into multiple clusters based on at least one feature that characterizes media content and/or encoded media content associated with the media title. The clusters include at least a first cluster and a second cluster. The sequence-based encoding application encodes a first shot sequence using a first operating point to generate a first encoded shot sequence. The first shot sequence and the first operating point are associated with the first cluster. By contrast, the sequence-based encoding application encodes a second shot sequence using a second operating point to generate a second encoded shot sequence. The second shot sequence and the second operating point are associated with the second cluster. Subsequently, the sequence-based encoding application generates an encoded media sequence based on the first encoded shot sequence and the second encoded shot sequence.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method for streaming a media title, the method comprising: selecting a first encoded media sequence from a plurality of encoded media sequences, wherein the plurality of encoded media sequences is associated with a plurality of encoding points, wherein the plurality of encoding points is associated with a plurality of clusters, and wherein the plurality of clusters is based on a partitioning of shot sequences associated with the media title according to at least one feature that characterizes at least one of media content or encoded media content associated with the media title; and requesting at least a portion of the first encoded media sequence from a server machine for playback. 2. The computer-implemented method of claim 1 , wherein the first encoded media sequence is selected based on at least one of a bitrate or a resolution. 3. The computer-implemented method of claim 1 , wherein the first encoded media sequence comprises a plurality of encoded shot sequences generated using the plurality of encoding points. 4. The computer-implemented method of claim 1 , wherein the at least one feature comprises at least one of a brightness, a colorfulness, a texture detail, a degree of motion uniformity, or a number of edges. 5. The computer-implemented method of claim 1 , wherein the shot sequences are partitioned by: for each shot sequence, extracting a feature vector that includes a set of feature values associated with the at least one feature; and performing one or more clustering operations on the shot sequences based on the feature vectors to generate the plurality of clusters. 6. The computer-implemented method of claim 5 , wherein the at least one feature comprises at least one of a brightness, a colorfulness, a texture detail, a degree of motion uniformity, or a number of edges. 7. The computer-implemented method of claim 5 , wherein the one or more clustering operations are associated with a K-Means algorithm or a trained neural network. 8. The computer-implemented method of claim 1 , wherein each encoding point included in the plurality of encoding points is generated based on a global convex hull representing encoding tradeoffs between distortion level and bitrate. 9. The computer-implemented method of claim 1 , wherein a number of encoding points included in the plurality of encoding points is less than a number of shot sequences included in the plurality of shot sequences. 10. The computer-implemented method of claim 1 , wherein the media title comprises at least one of video content or audio content. 11. One or more computer-readable storage media storing instructions that, when executed by one or more processors, cause the one or more processors to perform the steps of: selecting a first encoded media sequence from a plurality of encoded media sequences, wherein the plurality of encoded media sequences is associated with a plurality of encoding points, wherein the plurality of encoding points is associated with a plurality of clusters, and wherein the plurality of clusters is based on a partitioning of shot sequences associated with the media title; and requesting at least a portion of the first encoded media sequence from a server machine for playback. 12. The one or more non-transitory computer-readable media of claim 11 , wherein the first encoded media sequence is selected based on at least one of a bitrate or a resolution. 13. The one or more non-transitory computer-readable media of claim 11 , wherein the first encoded media sequence comprises a plurality of encoded shot sequences generated using the plurality of encoding points. 14. The one or more non-transitory computer-readable media of claim 11 , wherein the shot sequences associated with the media title are partitioned according to at least one feature that characterizes at least one of media content or encoded media content associated with the media title, and the at least one feature comprises at least one of a brightness, a colorfulness, a texture detail, a degree of motion uniformity, or a number of edges. 15. The one or more non-transitory computer-readable media of claim 11 , wherein the shot sequences are partitioned by: for each shot sequence, extracting a feature vector that includes a set of feature values associated with at least one feature that characterizes at least one of media content or encoded media content associated with the media title; and performing one or more clustering operations on the shot sequences based on the feature vectors to generate the plurality of clusters. 16. The one or more non-transitory computer-readable media of claim 15 , wherein the at least one feature comprises at least one of a brightness, a colorfulness, a texture detail, a degree of motion uniformity, or a number of edges. 17. The one or more non-transitory computer-readable media of claim 15 , wherein the one or more clustering operations are associated with a K-Means algorithm or a trained neural network. 18. The one or more non-transitory computer-readable media of claim 11 , wherein each encoding point included in the plurality of encoding points is generated based on a global convex hull representing encoding tradeoffs between distortion level and bitrate. 19. The one or more non-transitory computer-readable media of claim 11 , wherein a number of encoding points included in the plurality of encoding points is less than a number of shot sequences included in the plurality of shot sequences. 20. The one or more non-transitory computer-readable media of claim 11 , wherein the media title comprises at least one of video content or audio content. 21. An endpoint device, comprising: one or more memories storing instructions; and one or more processors that are coupled to the one or more memories and, when executing the instructions, are configured to perform the steps of: selecting a first encoded media sequence from a plurality of encoded media sequences, wherein the plurality of encoded media sequences is associated with a plurality of encoding points, wherein the plurality of encoding points is associated with a plurality of clusters associated with the media title; and requesting at least a portion of the first encoded media sequence from a server machine for playback.

Assignees

Inventors

Classifications

  • H04N19/42Primary

    characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation (H04N19/635 takes precedence) · CPC title

  • Selection of coding mode or of prediction mode · CPC title

  • Detection of scene cut or scene change · CPC title

  • the unit being a group of pictures [GOP] · CPC title

  • H04N19/179Primary

    the unit being a scene or a shot · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11818375B2 cover?
In various embodiments, a sequence-based encoding application partitions a set of shot sequences associated with a media title into multiple clusters based on at least one feature that characterizes media content and/or encoded media content associated with the media title. The clusters include at least a first cluster and a second cluster. The sequence-based encoding application encodes a firs…
Who is the assignee on this patent?
Netflix Inc
What technology area does this patent fall under?
Primary CPC classification H04N19/42. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Nov 14 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).