Systems and methods implementing a machine learning architecture for video processing

US12283084B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12283084-B2
Application numberUS-202418948428-A
CountryUS
Kind codeB2
Filing dateNov 14, 2024
Priority dateJul 26, 2017
Publication dateApr 22, 2025
Grant dateApr 22, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The present disclosure describes a method comprising receiving a video; segmenting the video into a plurality of segments, each of the plurality of segments comprising a plurality of images; executing one or more machine learning models using the plurality of segments to generate a segment score for each of the plurality of segments, the segment score for a segment indicating a likelihood that a user will interact with the segment; generating a video performance score for the video as a function of the segment scores for the plurality of segments; and generating a record comprising the video performance score for the video and an identification of the video.

First claim

Opening claim text (preview).

What is claimed is: 1. A method, comprising: receiving, by one or more processors, a video; segmenting, by the one or more processors, the video into a plurality of segments, each of the plurality of segments comprising a plurality of images; executing, by the one or more processors, one or more machine learning models using the plurality of segments to generate a segment score for each of the plurality of segments, the segment score for a segment indicating a likelihood that a user will interact with the segment; generating, by the one or more processors, a video performance score for the video as a function of the segment scores for the plurality of segments; and generating, by the one or more processors, a record comprising the video performance score for the video and an identification of the video. 2. The method of claim 1 , wherein segmenting the video into a plurality of segments comprises segmenting, by the one or more processors, the video into the plurality of segments each having a defined length and having a defined overlap between pairs of sequential segments of the plurality of segments. 3. The method of claim 1 , further comprising: identifying, by the one or more processors, edit points in the videos; and segmenting, by the one or more processors, the plurality of segments based on the identified edit points. 4. The method of claim 1 , wherein executing the one or more machine learning model to generate the segment score for each of the plurality of segments comprises: iteratively executing, by the one or more processors, a feature extraction machine learning model using the plurality of segments to generate a segment embedding for each of the plurality of segments; and iteratively executing, by the one or more processors, a content scoring machine learning model based on the plurality of segments to generate the segment score for each of the plurality of segments. 5. The method of claim 1 , wherein generating the video performance score for the video comprises: aggregating, by the one or more processors, the segment scores of the plurality of segments to generate the video performance score. 6. The method of claim 5 , wherein aggregating the segments scores comprises: assigning, by the one or more processors, weights to the segment scores according to lengths of the segments corresponding to the segment scores; and aggregating, by the one or more processors, the segment scores according to the assigned weights. 7. The method of claim 5 , wherein generating the video performance score for the video comprises: assigning, by the one or more processors, weights to the segment scores according to distances of the segments corresponding to the segment scores from a beginning of the video; and aggregating, by the one or more processors, the segment scores according to the assigned weights. 8. The method of claim 1 , further comprising: ranking, by the one or more processors, the plurality of segments according to the segment performance scores of the plurality of segments; and presenting, by the one or more processors, images from the plurality of segments on a user interface in order according to the rankings of the plurality of segments from which the images respectively originated. 9. The method of claim 1 , further comprising: identifying, by the one or more processors, a defined number of segments with the lowest segment performance scores of the plurality of segments; and removing, by the one or more processors, the defined number of segments with the lowest segment performance scores of the plurality of segments from the video. 10. The method of claim 1 , further comprising: identifying, by the one or more processors, a highest scoring segment of the plurality of segments based on the segments scores for the plurality of segments; extracting, by the one or more processors, one or more images from the highest scoring segment; executing, by the one or more processors, at least one machine learning model to generate an image performance score for each of the one or more images extracted from the highest scoring segment; identifying, by the one or more processors, a highest scoring image of the one or more images based on the generated image performance scores; and generating, by the one or more processors, a record identifying the highest scoring image. 11. The method of claim 1 , further comprising: identifying, by the one or more processors, a defined number of segments with the highest segment performance scores of the plurality of segments; concatenating, by the one or more processors, the defined number of segments into a concatenated video; and storing, by the one or more processors, the concatenated video in memory. 12. A system, comprising one or more processors coupled with memory and configured to: receive a video; segment the video into a plurality of segments, each of the plurality of segments comprising a plurality of images; execute one or more machine learning models using the plurality of segments to generate a segment score for each of the plurality of segments; generate a video performance score for the video as a function of the segment scores for the plurality of segments; and generate a record comprising the video performance score for the video and an identification of the video. 13. The system of claim 12 , wherein the one or more processors are configured to segment the video into a plurality of segments by segmenting the video into the plurality of segments each having a defined length and having a defined overlap between pairs of sequential segments of the plurality of segments. 14. The system of claim 12 , wherein the one or more processors are further configured to: determine segment content for each of the plurality of segments; and segment the plurality of segments based on a change in segment content between pairs of sequential segments of the plurality of segments. 15. The system of claim 12 , wherein the one or more processors are further configured to execute the one or more machine learning model to generate the segment score for each of the plurality of segments by: iteratively executing a feature extraction machine learning model using the plurality of segments to generate a segment embedding for each of the plurality of segments; and iteratively executing a content scoring machine learning model using the plurality of segments to generate the segment score for each of the plurality of segments. 16. The system of claim 12 , wherein the one or more processors are configured to generate the video performance score for the video by: aggregating the segment scores of the plurality of segments to generate the video performance score. 17. The system of claim 16 , wherein the one or more processors are configured to aggregate the segments scores by: assigning weights to the segment scores according to lengths of the segments corresponding to the segment scores; and aggregating the segment scores according to the assigned weights. 18. The system of claim 16 , wherein the one or more processors are configured to generate the video performance score for the video by: assigning weights to the segment scores according to distances of the segments corresponding to the segment scores from a beginning of the video; and aggregating the segment scores according to the assigned weights. 19. Non-transitory computer-readable media comprising instructions that, when executed by one or more processors, cause the one or more processors to: rec

Assignees

Inventors

Classifications

  • Combinations of networks · CPC title

  • Extraction of image or video features · CPC title

  • using neural networks · CPC title

  • Presentation of query results · CPC title

  • Extracting features or characteristics from the video content, e.g. video fingerprints, representative shots or key frames · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12283084B2 cover?
The present disclosure describes a method comprising receiving a video; segmenting the video into a plurality of segments, each of the plurality of segments comprising a plurality of images; executing one or more machine learning models using the plurality of segments to generate a segment score for each of the plurality of segments, the segment score for a segment indicating a likelihood that …
Who is the assignee on this patent?
Vizit Labs Inc
What technology area does this patent fall under?
Primary CPC classification G06V10/761. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Apr 22 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).