Method, system and computer program product for automatically altering a video stream

US2017337692A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2017337692-A1
Application numberUS-201715660252-A
CountryUS
Kind codeA1
Filing dateJul 26, 2017
Priority dateJan 27, 2015
Publication dateNov 23, 2017
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

There is provided a method for automatically altering a digital video stream including multiple video input frames, to automatically obtain output frames with a target composition, in which frame metadata relating to objects in one or more of the video input frames is analyzed on a frame-by-frame basis and used by a processor to automatically alter one or more output frames to be more similar to, or to match, the target composition, wherein cropping is performed in 3D. A related system and a related computer program product are also provided.

First claim

Opening claim text (preview).

1 . A method for automatically cropping input video frames of a digital video stream to obtain output video frames, wherein obtaining the output video frames comprises: analyzing on a frame-by-frame basis frame metadata relating to objects in one or more of the input video frames; and using of the frame metadata by a processor to automatically crop one or more of the input video frames based on a target frame composition, wherein the target frame composition defines at least one of a number, identity and/or position of objects for inclusion in at least one output video frame, wherein the cropping is based on the detection of objects which have associated metrics, the metrics having three-dimensional space parameters. 2 . The method according to claim 1 , wherein at least one of the three-dimensional space parameters relates to depth information corresponding to the objects in the one or more of the input video frames. 3 . The method according to claim 2 , wherein the depth information is depth sensor data. 4 . The method according to claim 1 , wherein the cropping is based on detection of a single target object. 5 . The method according to claim 1 , wherein the cropping is based on detection of a plurality of target objects. 6 . The method according to claim 1 , wherein the target frame composition is based on a determined association between at least two objects for inclusion in the at least one output video frame. 7 . The method according to claim 1 , wherein the cropping is based on a scene type metric, the scene type metric identifying a given input video frame as depicting a predefined category of scene. 8 . The method according to claim 7 , wherein the scene type metric is based on a predefined rule describing speed of at least one object in the given input video frame. 9 . The method according to claim 1 , comprising determining a bounding surface area in the one or more input video frames, wherein: the bounding area of a given input video frame comprises all of the objects for inclusion in the at least one output frame; and the cropping is based on the bounding area. 10 . The method according to claim 9 , in which the bounding area is a regular polyhedron in three-dimensional space. 11 . The method according to claim 1 , further comprising: receiving, from a user and via a user interface, a selection of a given object in one or more of the input video frames, and: responsive to the selection, setting the target frame composition to define said given object for inclusion in at least one output video frame. 12 . A system comprising a processor and storage, configured to implement a method for automatically cropping input video frames of a digital video stream to obtain output video frames, wherein obtaining the output video frames comprises: analyzing on a frame-by-frame basis frame metadata relating to objects in one or more of the input video frames; and using of the frame metadata by a processor to automatically crop one or more of the input video frames based on a target frame composition, wherein the target frame composition defines at least one of a number, identity and/or position of objects for inclusion in at least one output video frame, wherein the cropping is based on the detection of objects which have associated metrics, the metrics having three-dimensional space parameters. 13 . A method for automatically cropping input video frames of a digital video stream, to obtain output video frames, wherein obtaining the output video frames comprises: analyzing on a frame-by-frame basis frame metadata relating to objects in one or more of the input video input frames; and using of the frame metadata by a processor to automatically crop one or more output frames based on a target composition, wherein the target composition is defined by: a first set of rules that are functional rules, such as maximally filling a frame with an object, or placing an object in the center of a given output frame; and a second set of rules that are aesthetic rules defining a desired aesthetic appearance of a given output frame. 14 . The method according to claim 13 , wherein metadata relating to a given object describes proportions of that object. 15 . The method according to claim 13 , wherein the metadata relating to a given object describes motion of that object. 16 . The method according to claim 13 , wherein metadata relating to a given object describes a position of that object within an input video frame. 17 . The method according to claim 13 , wherein metadata relating to a given object defines proportions of that object based on proportions of a rectangle fitted around that object. 18 . The method according to claim 13 , wherein the method is performed in real time. 19 . The method according to claim 13 , wherein the method is performed as post-processing.

Assignees

Inventors

Classifications

  • G06T11/00Primary

    Two-dimensional [2D] image generation · CPC title

  • G06T7/11Primary

    Region-based segmentation · CPC title

  • Range image; Depth image; 3D point clouds · CPC title

  • Physics · mapped topic

  • Stereoscopic video; Stereoscopic image sequence · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2017337692A1 cover?
There is provided a method for automatically altering a digital video stream including multiple video input frames, to automatically obtain output frames with a target composition, in which frame metadata relating to objects in one or more of the video input frames is analyzed on a frame-by-frame basis and used by a processor to automatically alter one or more output frames to be more similar t…
Who is the assignee on this patent?
Apical Ltd
What technology area does this patent fall under?
Primary CPC classification G06T11/00. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Nov 23 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 5 related publications on this page (citations in our corpus or others sharing the same primary CPC).