What technology area does this patent fall under?

Primary CPC classification G06V20/49. Mapped technology areas include Physics.

When was this patent published?

Publication date Tue Jan 29 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Graph-based framework for video object segmentation and extraction in feature space

US10192117B2 · US · B2

Patent metadata
Field	Value
Publication number	US-10192117-B2
Application number	US-201615167327-A
Country	US
Kind code	B2
Filing date	May 27, 2016
Priority date	Jun 25, 2015
Publication date	Jan 29, 2019
Grant date	Jan 29, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method for graph-based spatiotemporal video segmentation and automatic target object extraction in high-dimensional feature space includes using a processor to automatically analyze an entire volumetric video sequence; using the processor to construct a high-dimensional feature space that includes color, motion, time, and location information so that pixels in the entire volumetric video sequence are reorganized according to their unique and distinguishable feature vectors; using the processor to create a graph model that fuses the appearance, spatial, and temporal information of all pixels of the video sequence in the high-dimensional feature space; and using the processor to group pixels in the graph model that are inherently similar and assign the same labels to them to form semantic spatiotemporal key segments.

First claim

Opening claim text (preview).

We claim: 1. A method for graph-based spatiotemporal video segmentation and automatic target object extraction in high-dimensional feature space, comprising: a) automatically analyzing an entire volumetric video sequence; b) constructing a high-dimensional feature space that includes color, motion, time, and location information so that pixels in the entire volumetric video sequence are reorganized according to their unique and distinguishable feature vectors; c) creating a graph model that fuses appearance, spatial, and temporal information of all pixels of the video sequence in the high-dimensional feature space, wherein the graph model represents each pixel as a graph node, and two pixels are connected by an edge based on similarity criteria; d) grouping pixels in the graph model that are inherently similar and assign the same labels to them to form semantic spatiotemporal key segments; and e) using the semantic spatiotemporal key segments as an input to an initial background/foreground model combined with a graph cut algorithm to label at least one target object. 2. The method of claim 1 wherein the graph cut algorithm is used to analyze region level volumetric segments for each segment obtained from the previous video segmentation stage to create nodes and using edges between nodes to reflect their mutual similarity considering both spatial and temporal coherence. 3. The method of claim 1 wherein intra-cluster connectivity is used to correct spatial and temporal inconsistency due to sudden motion changes or occlusion. 4. The method of claim 1 wherein the high-dimensional feature space is a seven dimension feature space. 5. The method of claim 1 wherein a k-nearest neighbor search is used in step d) to group pixels that are inherently similar and assign the same labels to them.

Assignees

Kodak Alaris Inc

Inventors

Classifications

G06V10/7635
based on graphs, e.g. graph cuts or spectral clustering · CPC title
G06V20/49Primary
Segmenting video sequences, i.e. computational techniques such as parsing or cutting the sequence, low-level clustering or determining units such as shots or scenes · CPC title
G06V20/46
Extracting features or characteristics from the video content, e.g. video fingerprints, representative shots or key frames · CPC title
G06F18/2323
based on graph theory, e.g. minimum spanning trees [MST] or graph cuts · CPC title
G06K9/00765
Physics · mapped topic

Patent family

Related publications grouped by family.

View patent family 57601075

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10192117B2 cover?: A method for graph-based spatiotemporal video segmentation and automatic target object extraction in high-dimensional feature space includes using a processor to automatically analyze an entire volumetric video sequence; using the processor to construct a high-dimensional feature space that includes color, motion, time, and location information so that pixels in the entire volumetric video sequ…
Who is the assignee on this patent?: Kodak Alaris Inc
What technology area does this patent fall under?: Primary CPC classification G06V20/49. Mapped technology areas include Physics.
When was this patent published?: Publication date Tue Jan 29 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).