System and method for coarse-to-fine video object segmentation and re-composition

US10229340B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10229340-B2
Application numberUS-201715441978-A
CountryUS
Kind codeB2
Filing dateFeb 24, 2017
Priority dateFeb 24, 2016
Publication dateMar 12, 2019
Grant dateMar 12, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Embodiments of the present disclosure include a computer-implemented method that receives a digital image input, the digital image input containing one or more dynamic salient objects arranged over a background. The method also includes performing a tracking operation, the tracking operation identifying the dynamic salient object over one or more frames of the digital image input as the dynamic salient object moves over the background. The method further includes performing a clustering operation, in parallel with the tracking operation, on the digital image input, the clustering operation identifying boundary conditions of the dynamic salient object. Additionally, the method includes combining a first output from the tracking operation and a second output from the clustering operation to generate a third output. The method further includes performing a segmentation operation on the third output, the segmentation operation extracting the dynamic salient object from the digital image input.

First claim

Opening claim text (preview).

The invention claimed is: 1. A computer-implemented method, comprising: receiving a digital image input, the digital image input containing one or more dynamic salient objects arranged over a background; performing a tracking operation, the tracking operation identifying the dynamic salient object over one or more frames of the digital image input as the dynamic salient object moves over the background; performing a clustering operation, in parallel with the tracking operation, on the digital image input, the clustering operation identifying boundary conditions of the dynamic salient object; combining a first output from the tracking operation and a second output from the clustering operation to generate a third output; and performing a segmentation operation on the third output, the segmentation operation extracting the dynamic salient object from the digital image input. 2. The computer-implemented method of claim 1 , further comprising creating an effect using an extracted dynamic salient object from the digital image input. 3. The computer-implemented method of claim 1 , wherein the tracking operation comprises a point tracking algorithm and a motion clustering algorithm, performed in series. 4. The computer-implemented method of claim 3 , wherein the point tracking algorithm is performed before the motion clustering algorithm. 5. The computer-implemented method of claim 3 , wherein the point tracking algorithm is a Kanade-Lucas-Tomasi point tracking algorithm and the motion clustering algorithm is a sparse subspace clustering algorithm. 6. The computer-implemented method of claim 1 , wherein the clustering operation comprises supervoxel clustering. 7. The computer-implemented method of claim 1 , further comprising re-compositioning an extracted dynamic salient object onto a digital image, the digital image being different than the digital image input. 8. The computer-implemented method of claim 1 , wherein the segmentation operation comprises a graph-based segmentation, including a coarse segmentation and a fine segmentation, the coarse segmentation being performed before the fine segmentation. 9. A system, comprising: one or more processors; and memory including instructions that, when executed by the one or more processors, cause the system to: receive a digital image input, the digital image input containing one or more dynamic salient objects arranged over a background; perform a tracking operation, the tracking operation identifying the dynamic salient object over one or more frames of the digital image input as the dynamic salient object moves over the background; perform a clustering operation, in parallel with the tracking operation, on the digital image input, the clustering operation identifying boundary conditions of the dynamic salient object; combine a first output from the tracking operation and a second output from the clustering operation to generate a third output; and perform a segmentation operation on the third output, the segmentation operation extracting the dynamic salient object from the digital image input. 10. The system of claim 9 , where the memory further includes instructions that, when executed by the one or more processors, cause the system to create an effect using an extracted dynamic salient object from the digital image input. 11. The system of claim 9 , wherein the tracking operation comprises a point tracking algorithm and a motion clustering algorithm, performed in series. 12. The system of claim 9 , wherein the clustering operation comprises supervoxel clustering. 13. The system of claim 9 , wherein the segmentation operation comprises a graph-based segmentation including a coarse segmentation and a fine segmentation. 14. The system of claim 9 , wherein the memory further includes instructions that, when executed by the one or more processors, cause the system to re-composition an extracted dynamic salient object onto a digital image, the digital image being different than the digital image input. 15. A non-transitory computer-readable storage medium storing instructions that, when executed by one or more processors, cause a computing system to: receive a digital image input, the digital image input containing one or more dynamic salient objects arranged over a background; perform a tracking operation, the tracking operation identifying the dynamic salient object over one or more frames of the digital image input as the dynamic salient object moves over the background; perform a clustering operation, in parallel with the tracking operation, on the digital image input, the clustering operation identifying boundary conditions of the dynamic salient object; combine a first output from the tracking operation and a second output from the clustering operation to generate a third output; and perform a segmentation operation on the third output, the segmentation operation extracting the dynamic salient object from the digital image input. 16. The non-transitory computer-readable storage medium of claim 15 , further comprising instructions that, when executed by the one or more processors, cause the computing system to create an effect using an extracted dynamic salient object from the digital image input. 17. The non-transitory computer-readable storage medium of claim 15 , wherein the tracking operation comprises a point tracking algorithm and a motion clustering algorithm, performed in series. 18. The non-transitory computer-readable storage medium of claim 15 , wherein the clustering operation comprises supervoxel clustering. 19. The non-transitory computer-readable storage medium of claim 15 , wherein the segmentation operation comprises a graph-based segmentation. 20. The non-transitory computer-readable storage medium of claim 15 , further comprising instructions that, when executed by the one or more processors, cause the computing system to re-composition an extracted dynamic salient object onto a digital image, the digital image being different than the digital image input.

Assignees

Inventors

Classifications

  • G06T7/215Primary

    Motion-based segmentation · CPC title

  • using clustering, e.g. of similar faces in social networks · CPC title

  • Salient features, e.g. scale invariant feature transforms [SIFT] · CPC title

  • Clustering techniques · CPC title

  • Creating or editing images; Combining images with text · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10229340B2 cover?
Embodiments of the present disclosure include a computer-implemented method that receives a digital image input, the digital image input containing one or more dynamic salient objects arranged over a background. The method also includes performing a tracking operation, the tracking operation identifying the dynamic salient object over one or more frames of the digital image input as the dynamic…
Who is the assignee on this patent?
Kodak Alaris Inc
What technology area does this patent fall under?
Primary CPC classification G06T7/215. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Mar 12 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).