Scene and activity identification in video summary generation

US10776629B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10776629-B2
Application numberUS-201816124607-A
CountryUS
Kind codeB2
Filing dateSep 7, 2018
Priority dateJul 23, 2014
Publication dateSep 15, 2020
Grant dateSep 15, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Video and corresponding metadata is accessed. Events of interest within the video are identified based on the corresponding metadata, and best scenes are identified based on the identified events of interest. A video summary can be generated including one or more of the identified best scenes. The video summary can be generated using a video summary template with slots corresponding to video clips selected from among sets of candidate video clips. Best scenes can also be identified by receiving an indication of an event of interest within video from a user during the capture of the video. Metadata patterns representing activities identified within video clips can be identified within other videos, which can subsequently be associated with the identified activities.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for identifying video scenes, the method comprising: accessing a video of an activity, the activity including an event at a moment within the video; accessing metadata associated with the video, the metadata associated with the video enabling identification of a type of the activity and a type of the event; identifying the type of the activity based on the metadata associated with the video including patterns associated with the type of the activity; identifying the type of the event based on the metadata associated with the video including patterns associated with the type of the event; identifying a scene of the video for the event, the scene including a length of the video that encompasses the moment and extends both before and after the moment within the video, wherein the length of the video is determined based on the type of the activity and the type of the event such that: the length is a first length based on the type of the activity being of a first activity type and the type of the event being of a first event type; the length is a second length based on the type of the activity being of the first activity type and the type of the event being of a second event type; the length is a third length based on the type of the activity being of a second activity type and the type of the event being of the first event type; and the length is a fourth length based on the type of the activity being of the second activity type and the type of the event being of the second event type, wherein: the first length is different from the second length, the third length, and the fourth length; the second length is different from the third length and the fourth length; and the third length is different from the fourth length; and outputting the scene of the video for playback. 2. The method of claim 1 , wherein outputting the scene of the video for playback includes generating a video summary comprising the scene. 3. The method of claim 1 , wherein the metadata is generated by a camera during the capture of the video. 4. The method of claim 3 , wherein the metadata comprises telemetry data describing a motion of the camera during the capture of the video. 5. The method of claim 3 , wherein the metadata comprises location data describing a location of the camera during the capture of the video. 6. The method of claim 3 , wherein the metadata comprises biometric data describing characteristics of a user of the camera during the capture of the video. 7. The method of claim 1 , wherein the metadata is accessed from an external entity after the capture of the video. 8. The method of claim 7 , wherein the metadata comprises environment data describing characteristics of an environment in which the video was captured. 9. The method of claim 1 , wherein position of the moment within the length is determined based on the type of the activity and the type of the event. 10. A system that identifies video scenes, the system comprising: a non-transitory computer-readable storage medium storing instructions configured, when executed, to cause the system to perform: accessing a video of an activity, the activity including an event at a moment within the video; accessing metadata associated with the video, the metadata associated with the video enabling identification of a type of the activity and a type of the event; identifying the type of the activity based on the metadata associated with the video including patterns associated with the type of the activity; identifying the type of the event based on the metadata associated with the video including patterns associated with the type of the event; identifying a scene of the video for the event, the scene including a length of the video that encompasses the moment and extends both before and after the moment within the video, wherein the length of the video is determined based on the type of the activity and the type of the event such that the length is a first length based on the type of the activity being of a first activity type and the type of the event being of a first event type; the length is a second length based on the type of the activity being of the first activity type and the type of the event being of a second event type; the length is a third length based on the type of the activity being of a second activity type and the type of the event being of the first event type; and the length is a fourth length based on the type of the activity being of the second activity type and the type of the event being of the second event type, wherein: the first length is different from the second length, the third length, and the fourth length; the second length is different from the third length and the fourth length; and the third length is different from the fourth length; and outputting the scene of the video for playback; and a processor configured to execute the instructions. 11. The system of claim 10 , wherein outputting the scene of the video for playback includes generating a video summary comprising the scene. 12. The system of claim 10 , wherein the metadata is generated by a camera during the capture of the video. 13. The system of claim 12 , wherein the metadata comprises telemetry data describing a motion of the camera during the capture of the video. 14. The system of claim 12 , wherein the metadata comprises location data describing a location of the camera during the capture of the video. 15. The system of claim 12 , wherein the metadata comprises biometric data describing characteristics of a user of the camera during the capture of the video. 16. The system of claim 10 , wherein the metadata is accessed from an external entity after the capture of the video. 17. The system of claim 16 , wherein the metadata comprises environment data describing characteristics of an environment in which the video was captured. 18. The system of claim 10 , wherein position of the moment within the length is determined based on the type of the activity and the type of the event.

Assignees

Inventors

Classifications

  • Control of parameters via user interfaces · CPC title

  • Control of cameras or camera modules · CPC title

  • H04N5/77Primary

    between a recording apparatus and a television camera · CPC title

  • G06V20/47Primary

    Detecting features for summarising video content · CPC title

  • metadata assisted face recognition · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10776629B2 cover?
Video and corresponding metadata is accessed. Events of interest within the video are identified based on the corresponding metadata, and best scenes are identified based on the identified events of interest. A video summary can be generated including one or more of the identified best scenes. The video summary can be generated using a video summary template with slots corresponding to video cl…
Who is the assignee on this patent?
Gopro Inc
What technology area does this patent fall under?
Primary CPC classification H04N5/77. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Sep 15 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).