Method, Apparatus And Computer Program Product For Generating Semantic Information From Video Content

US2016112727A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2016112727-A1
Application numberUS-201414519492-A
CountryUS
Kind codeA1
Filing dateOct 21, 2014
Priority dateOct 21, 2014
Publication dateApr 21, 2016
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method, apparatus and computer program product are provided for generating semantic information from video content. Objects and regions of interest within video content may be identified and monitored for characteristics relating to object detection, motion content, and motion trajectory. Salient events relating to the regions may be detected based on the monitoring. Temporal segments may be identified and used to create summary video content, or highlights. An example embodiment relates to processing video footage of sports. Goals, scored points, unsuccessful scoring attempts, as well as other events may be detected in the video content. Efficiency is gained by monitoring only a relatively small portion of the frame, and by limiting the dependency on tracking moving objects.

First claim

Opening claim text (preview).

1 . An apparatus comprising at least one processor and at least one memory including computer program code, the at least one memory and the computer program code configured to, with the processor, cause the apparatus to perform at least: receiving an indication of an object of interest in video content; identifying at least one region of interest based on (a) a position of the at least one region of interest relative to a position of the object of interest and (b) a viewing angle from which the video content is captured; monitoring, with the processor, at least one characteristic in the at least one region of interest in the video content; and in response to the monitoring of the video content, generating semantic information relating to the video content and causing the generated semantic information to be stored in the at least one memory. 2 . The apparatus according to claim 1 , wherein the at least one memory and the computer program code are further configured to, with the processor, cause the apparatus to perform at least: determining that a salient event relating to the object of interest has occurred; identifying temporal segments relating to the salient event; and generating summary video content comprising the identified temporal segments. 3 . The apparatus according to claim 2 , wherein the at least one memory and the computer program code are further configured to, with the processor, cause the apparatus to perform at least: generating metadata describing the salient event; storing the metadata in association with the video content; and providing the metadata and video content such that the summary video content is recreated for playback based on the metadata and video content. 4 . The apparatus according to claim 1 , wherein the at least one characteristic comprises at least one of motion detection or object tracking. 5 . The apparatus according to claim 1 , wherein the at least one characteristic comprises at least one of object detection, object recognition or color variation. 6 . The apparatus according to claim 1 , wherein the at least one memory and the computer program code are further configured to, with the processor, cause the apparatus to perform at least: receiving an indication of a user input identifying the object of interest. 7 . The apparatus according to claim 1 , wherein the at least one memory and the computer program code are further configured to, with the processor, cause the apparatus to perform at least: in an instance the perspective of the video content changes, tracking the object of interest and the at least one region of interest. 8 . The apparatus according to claim 1 , wherein at least the object of interest or region of interest is identified based on a context of the video content. 9 . A computer program product comprising at least one non-transitory computer-readable storage medium having computer-executable program code instructions stored therein, the computer-executable program code instructions comprising program code instructions for: receiving an indication of an object of interest in video content; identifying at least one region of interest based on (a) a position of the at least one region of interest relative to a position of the object of interest and (b) a viewing angle from which the video content is captured; monitoring at least one characteristic in the at least one region of interest; and in response to the monitoring, generating semantic information relating to the video content and causing the generated semantic information to be stored in the at least one non-transitory computer-readable storage medium. 10 . The computer program product according to claim 9 , wherein the computer-executable program code instructions further comprise program code instructions for: determining that a salient event relating to the object of interest has occurred; identifying temporal segments relating to the salient event; and generating summary video content comprising the identified temporal segments. 11 . The computer program product according to claim 10 , wherein the computer-executable program code instructions further comprise program code instructions for: generating metadata describing the salient event; storing the metadata in association with the video content; and providing the metadata and video content such that the summary video content is recreated for playback based on the metadata and video content. 12 . The computer program product according to claim 9 , wherein the at least one characteristic comprises at least one of motion detection or object tracking. 13 . The computer program product according to claim 9 , wherein the at least one characteristics comprise s at least one of object detection, object recognition or color variation. 14 . The computer program product according to claim 9 , wherein the computer-executable program code instructions further comprise program code instructions for: receiving an indication of a user input identifying the object of interest. 15 . The computer program product according to claim 9 , wherein the computer-executable program code instructions further comprise program code instructions for: in an instance the perspective of the video content changes, tracking the object of interest and the at least one region of interest. 16 . The computer program product according to claim 9 , wherein at least the object of interest or region of interest is identified based on a context of the video content. 17 . A method comprising: receiving an indication of an object of interest in video content; identifying at least one region of interest based on (a) a position of the at least one region of interest relative to a position of the object of interest and (b) a viewing angle from which the video content is captured; monitoring at least one characteristic in the at least one region of interest; and in response to the monitoring, generating semantic information relating to the video content, and causing the generated semantic information to be stored in a memory device. 18 . The method according to claim 17 , further comprising: determining that a salient event relating to the object of interest has occurred; identifying temporal segments relating to the salient event; and generating summary video content comprising the identified temporal segments. 19 . The method according to claim 17 , further comprising: generating metadata describing the salient event; storing the metadata in association with the video content; and providing the metadata and video content such that the summary video content is recreated for playback based on the metadata and video content. 20 . (canceled)

Assignees

Inventors

Classifications

  • specifically adapted to content descriptors, e.g. coding, compressing or processing of metadata · CPC title

  • Live feed · CPC title

  • involving operations for analysing video streams, e.g. detecting features or characteristics (television picture signal circuitry for scene change detection H04N5/147; filtering for image enhancement G06T5/00; methods or arrangements for recognising scenes G06V20/00; arrangements characterised by components specially adapted for monitoring, identification or recognition of video in broadcast systems H04H60/59) · CPC title

  • G06F16/739Primary

    in form of a video summary, e.g. the video summary being a video sequence, a composite still image or having synthesized frames · CPC title

  • Graphical querying, e.g. query-by-region, query-by-sketch, query-by-trajectory, GUIs for designating a person/face/object as a query predicate (end-user interface involving hot spots associated with the video H04N21/4725; end-user interface for selecting a Region of Interest H04N21/4728) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2016112727A1 cover?
A method, apparatus and computer program product are provided for generating semantic information from video content. Objects and regions of interest within video content may be identified and monitored for characteristics relating to object detection, motion content, and motion trajectory. Salient events relating to the regions may be detected based on the monitoring. Temporal segments may be …
Who is the assignee on this patent?
Nokia Technologies Oy
What technology area does this patent fall under?
Primary CPC classification H04N21/2353. Mapped technology areas include Electricity.
When was this patent published?
Publication date Thu Apr 21 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).