System and method for coarse-to-fine video object segmentation and re-composition

US2019164006A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2019164006-A1
Application numberUS-201916263961-A
CountryUS
Kind codeA1
Filing dateJan 31, 2019
Priority dateFeb 24, 2016
Publication dateMay 30, 2019
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Embodiments of the present disclosure include a computer-implemented method that receives a digital image input, the digital image input containing one or more dynamic salient objects arranged over a background. The method also includes performing a tracking operation, the tracking operation identifying the dynamic salient object over one or more frames of the digital image input as the dynamic salient object moves over the background. The method further includes performing a clustering operation, in parallel with the tracking operation, on the digital image input, the clustering operation identifying boundary conditions of the dynamic salient object. Additionally, the method includes combining a first output from the tracking operation and a second output from the clustering operation to generate a third output. The method further includes performing a segmentation operation on the third output, the segmentation operation extracting the dynamic salient object from the digital image input.

First claim

Opening claim text (preview).

1 . A computer-implemented method, comprising: receiving a digital video image input, the digital video image input comprising a dynamic salient object arranged over a background; determining a spatial sampling ratio and a temporal sampling ratio; adjusting a spatial resolution and a frame rate according to the determined spatial sampling ratio and the temporal sampling ratio; performing a tracking operation, wherein the tracking operation identifies the dynamic salient object over one or more frames of the digital video image input as the dynamic salient object moves over the background; performing a clustering operation in parallel with the tracking operation on the digital video image input, wherein the clustering operation identifies boundary conditions of the dynamic salient object; combining a first output from the tracking operation and a second output from the clustering operation to generate a third output; and performing a segmentation operation on the third output, wherein the segmentation operation extracts the dynamic salient object from the digital video image input. 2 . The computer-implemented method of claim 1 , wherein determining the spatial sampling ratio and the temporal sampling ratio is based on a percentage of original resolution and frame rate of the digital video image input. 3 . The computer-implemented method of claim 2 , wherein the percentage of original resolution and frame rate of the digital video image input is based on processor computing ability. 4 . The computer-implemented method of claim 1 , wherein the clustering operation comprises supervoxel clustering. 5 . The computer-implemented method of claim 4 , wherein the supervoxel clustering is performed utilizing Simple Linear Interactive Clustering (SLIC). 6 . The computer-implemented method of claim 1 , further comprising applying a bilateral filter to each frame of the digital video image input. 7 . The computer-implemented method of claim 1 , wherein performing the tracking operation comprises point tracking the digital video image input via an algorithm selected from the group consisting of: mean shift tracking, Kanade-Lucas-Tomasi (KLT), tracking learning-detection (TLD), scale-invariant feature transform (SIFT), and Harris corner detection. 8 . The computer-implemented method of claim 1 , further comprising positioning the extracted dynamic salient object onto a different video sequence from the digital video image input to simulate interaction between animated objects and real life objects. 9 . The computer-implemented method of claim 1 , further comprising re-compositioning the extracted dynamic salient object onto a digital image, the digital image being different than frames in the digital video image input. 10 . The computer-implemented method of claim 9 , further comprising applying, to the extracted dynamic salient object, a transformation selected from the group consisting of: a color change, a contrast change, a brightness change, a tone change, a transparency change, and a filter change. 11 . The computer-implemented method of claim 9 , further comprising arranging the extracted dynamic salient object in multiple positions on the digital image to create a moving effect, wherein the digital image is a still photograph. 12 . The computer-implemented method of claim 1 , comprising receiving the digital video image input from a device selected from the group consisting of: a camera, a personal electronic device (PED), a smart phone, a wearable device, and a computer. 13 . The computer-implemented method of claim 12 , wherein the personal electronic device (PED) comprises a processor, a memory, a touchscreen display, and input device, and a network device.

Assignees

Inventors

Classifications

  • G06T7/215Primary

    Motion-based segmentation · CPC title

  • using clustering, e.g. of similar faces in social networks · CPC title

  • Salient features, e.g. scale invariant feature transforms [SIFT] · CPC title

  • Clustering techniques · CPC title

  • Creating or editing images; Combining images with text · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2019164006A1 cover?
Embodiments of the present disclosure include a computer-implemented method that receives a digital image input, the digital image input containing one or more dynamic salient objects arranged over a background. The method also includes performing a tracking operation, the tracking operation identifying the dynamic salient object over one or more frames of the digital image input as the dynamic…
Who is the assignee on this patent?
Kodak Alaris Inc
What technology area does this patent fall under?
Primary CPC classification G06T7/215. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu May 30 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).