Generating video segments based on video metadata
US-11120490-B1 · Sep 14, 2021 · US
US12033669B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-12033669-B2 |
| Application number | US-202117330702-A |
| Country | US |
| Kind code | B2 |
| Filing date | May 26, 2021 |
| Priority date | Sep 10, 2020 |
| Publication date | Jul 9, 2024 |
| Grant date | Jul 9, 2024 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Embodiments are directed to a snap point segmentation that defines the locations of selection snap points for a selection of video segments. Candidate snap points are determined from boundaries of feature ranges of the video indicating when instances of detected features are present in the video. In some embodiments, candidate snap point separations are penalized for being separated by less than a minimum duration corresponding to a minimum pixel separation between consecutive snap points on a video timeline. The snap point segmentation is computed by solving a shortest path problem through a graph that models different snap point locations and separations. When a user clicks or taps on the video timeline and drags, a selection snaps to the snap points defined by the snap point segmentation. In some embodiments, the snap points are displayed during a drag operation and disappear when the drag operation is released.
Opening claim text (preview).
What is claimed is: 1. One or more computer storage media storing computer-useable instructions that, when used by one or more computing devices, cause the one or more computing devices to perform operations comprising: causing a presentation of a first segmentation of a video timeline of a video; generating a representation of a second segmentation of the first segmentation using one or more feature tracks that represent instances of detected features in the video and feature ranges indicating when the instances are present in the video; in response to initiation of a click-or-tap and drag operation along the video timeline, causing display of snap points defined by the second segmentation on the video timeline; snapping, in response to the click-or-tap and drag operation along the video timeline, selection boundaries of a marquee selection of a duration of the video to the snap points; and removing the display of the snap points in response to the click-or-tap and drag operation being released. 2. The one or more computer storage media of claim 1 , wherein the first segmentation is a default segmentation, wherein the presentation of the video timeline represents video segments of the default segmentation, wherein the second segmentation defines locations of the snap points within the video segments of the default segmentation. 3. The one or more computer storage media of claim 1 , the operations further comprising generating candidate snap points for the second segmentation by snapping regularly spaced candidate snap points to a corresponding closest boundary, of the feature ranges, that is located within a snapping threshold. 4. The one or more computer storage media of claim 1 , the operations further comprising selecting candidate snap points for the second segmentation from boundaries of detected transcript features. 5. The one or more computer storage media of claim 1 , the operations further comprising selecting candidate snap points for the second segmentation by adding regularly spaced candidate snap points into a gap between consecutive detected transcript feature boundaries and snapping the regularly spaced candidate snap points to a corresponding closest boundary, of the feature ranges of non-transcript features, that is located within a snapping threshold. 6. The one or more computer storage media of claim 1 , the second segmentation determined by penalizing candidate snap point separations less than a minimum separation apart, the minimum separation corresponding to a minimum pixel separation between consecutive snap points on the video timeline. 7. The one or more computer storage media of claim 1 , the second segmentation determined by penalizing candidate snap point separations less than a minimum separation apart, the minimum separation corresponding to a minimum pixel separation between consecutive snap points on the video timeline, the minimum pixel separation being dependent on a zoom level of the video timeline. 8. The one or more computer storage media of claim 1 , the second segmentation determined by using a graph that models different segmentation options as paths through nodes and edges with edge weights, the nodes representing candidate snap points, the edges representing candidate separations between snap points, and the edge weights representing cut costs for the candidate snap points. 9. A computerized method comprising: causing a presentation of a video timeline of a video; detecting a click-or-tap and drag operation along the video timeline; and snapping, in response to the click-or-tap and drag operation, a selection boundary of a selection of a portion of the video to snap points defined by a snap point segmentation of the video, the snap points located at boundaries of feature ranges of times when instances of detected features of the video are present in the video, and the snap points determined based on a set of candidate snap points selected from boundaries of detected transcript features. 10. The computerized method of claim 9 , wherein the presentation of the video timeline represents video segments of a default segmentation of the video, wherein the snap point segmentation defines locations of the snap points within the video segments of the default segmentation. 11. The computerized method of claim 9 , further comprising causing display of the snap points on the video timeline in response to initiation of the click-or-tap and drag operation, and removing the display of the snap points in response to the click-or-tap and drag operation being released. 12. The computerized method of claim 9 , further comprising generating a second set of candidate snap points for the snap point segmentation by snapping regularly spaced candidate snap points to a corresponding closest boundary, of the feature ranges, that is located within a snapping threshold. 13. The computerized method of claim 9 , further comprising selecting a second set of candidate snap points for the snap point segmentation by adding regularly spaced candidate snap points into a gap between consecutive detected transcript feature boundaries and snapping the regularly spaced candidate snap points to a corresponding closest boundary, of the feature ranges of non-transcript features, that is located within a snapping threshold. 14. The computerized method of claim 9 , the snap point segmentation determined by penalizing candidate snap point separations less than a minimum separation apart, the minimum separation corresponding to a minimum pixel separation between consecutive snap points on the video timeline. 15. The computerized method of claim 9 , the snap point segmentation determined by penalizing candidate snap point separations less than a minimum separation apart, the minimum separation corresponding to a minimum pixel separation between consecutive snap points on the video timeline, the minimum pixel separation being dependent on a zoom level of the video timeline. 16. A computer system comprising: one or more hardware processors and memory configured to provide computer program instructions to the one or more hardware processors; a video interaction engine configured to use the one or more hardware processors to perform operations comprising: causing a presentation of a first segmentation of a video timeline of a video; and during a click-or-tap and drag operation on the video timeline, snapping a selection boundary of a selection of a portion of the video to snap points defined by a second segmentation of the first segmentation, the snap points located at boundaries of feature ranges of times when instances of detected features of the video are present in the video, the second segmentation determined by penalizing candidate snap point separations less than a minimum separation apart. 17. The computer system of claim 16 , the operations further comprising: causing display of the snap points on the video timeline in response to the click-or-tap and drag operation, and removing the display of the snap points in response to the click-or-tap and drag operation being released. 18. The computer system of claim 16 , the minimum separation corresponding to a minimum pixel separation between consecutive snap points on the video timeline.
Drag-and-drop · CPC title
Interaction with lists of selectable items, e.g. menus · CPC title
Interaction techniques to control parameter settings, e.g. interaction with sliders or dials · CPC title
using a touch-screen or digitiser, e.g. input of commands through traced gestures · CPC title
based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.