Fast robust method for compensating ego-translations and detecting independent moving objects in video captured with a moving camera

US10078902B1 · US · B1

Patent metadata
FieldValue
Publication numberUS-10078902-B1
Application numberUS-201615250665-A
CountryUS
Kind codeB1
Filing dateAug 29, 2016
Priority dateAug 27, 2015
Publication dateSep 18, 2018
Grant dateSep 18, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Described is a system for compensating ego-translations in video captured with a moving camera. Translative ego-motion is estimated on a sequence of image frames captured by a moving camera by minimizing a cost function that is based on at least one image frame difference between consecutive image frames. An alternating one directional search is performed to minimize the cost function to find an optimal translation. The optimal translation is applied to the sequence of image frames, resulting in a sequence of image frames with ego-translations.

First claim

Opening claim text (preview).

What is claimed is: 1. A system for compensating ego-translations in video captured with a moving camera, the system comprising: a video camera, wherein the video camera comprises: one or more processors and a memory, the memory being a non-transitory computer-readable medium having executable instructions encoded thereon, such that upon execution of the instructions, the one or more processors perform operations of: estimating translative ego-motion on a sequence of image frames captured by a moving camera by minimizing a cost function that is based on at least one image frame difference, wherein I and I′ represent consecutive image frames in the sequence of image frames, and wherein h and v represent shift translations, wherein an optimal translation (h, v) to compensate camera ego-motion is determined according to the following: min h , v ⁢ ∑ i ⁢ ⁢ j ⁢  I ′ ⁡ ( i + h , j + v ) - I ⁡ ( i , j )  , where i and j represent coordinates of a position in an image frame; wherein an alternating one directional search is performed to minimize the cost function to find the optimal translations; and applying the optimal translations to the sequence of image frames, resulting in a sequence of image frames with ego-translations. 2. The system as set forth in claim 1 , wherein the one or more processors further perform operations of: initiating a search for the optimal translation from an initial point; searching as far as possible in one of a horizontal or a vertical direction until reaching a point representing a local minimum for that direction; starting at the point representing the local minimum, searching as far as possible in the other of the horizontal or vertical direction; and alternating searching in the horizontal and vertical directions until a global minimum is reached, wherein the optimal translation is the global minimum. 3. The system as set forth in claim 2 , wherein (h 0 , v 0 ) is the initial point, and wherein the one or more processors perform operations of: (a) finding h 1 such that min h f(h,v 0 ) at h=h 1 ; (b) finding v 1 such that min v f(h 1 ,v) at v=v 1 ; (c) let Δh=|h 1 −h 0 |, Δv=|v 1 −v 0 |, then resetting h 0 :=h 1 and v 0 :=v 1 , where:=denotes “is defined as”; (d) repeating operations (a)-(c) until Δh and Δv are predetermined values. 4. The system as set forth in claim 2 , wherein the optimal translation found in a current image frame is used as the initial point for a next image frame in the sequence of image frames to increase a search speed in finding the optimal translation in the next image frame. 5. The system as set forth in claim 1 , wherein the one or more processors further perform an operation of detecting and highlighting independent moving objects in the sequence of image frames with ego-translations. 6. The system as set forth in claim 5 , wherein the independent moving objects are detected by multiplying two consecutive image frame differences d 1 and d 2 , wherein I 1 , I 2 , I 3 are three consecutive image frames in the sequence of image frames, and (h 1 , v 1 ) and (h 2 , v 2 ) are corresponding optimal translations to compensate the camera ego-motion, wherein d 1( i,j )=| I 1( i,j )− I 2( i+h 1, j+v 1)| and d 2( i,j )=| I 2( i,j )− I 3( i+h 2, j+v 2)|, wherein pixels for independent-moving objects are determined according to the following: m ( i,j )= d 1( i,j ) d 2( i+h 1, j+v 1), where m is an image map of detected object pixels, and (i, j) is a position in the image map. 7. The system as set forth in claim 5 , wherein independent moving objects are detected by multiplying image frame differences from non-adjacent image frames. 8. A computer implemented method for compensating ego-translations in video captured with a moving camera, the method comprising an act of: causing one or more processers to execute instructions encoded on a non-transitory computer-readable medium, such that upon execution, the one or more processors perform operations of: estimating translative ego-motion on a sequence of image frames captured by a moving camera by minimizing a cost function that is based on at least one image frame difference, wherein I and I′ represent consecutive image frames in the sequence of image frames, and wherein h and v represent shift translations, wherein an optimal translation (h, v) to compensate camera ego-motion is determined according to the following: min h , v ⁢ ∑ i ⁢ ⁢ j ⁢  I ′ ⁡ ( i + h , j + v ) - I

Assignees

Inventors

Classifications

  • taken from planes or by drones · CPC title

  • G06V20/13Primary

    Satellite images · CPC title

  • Physics · mapped topic

  • G06T7/20Primary

    Analysis of motion (motion estimation for coding, decoding, compressing or decompressing digital video signals H04N19/43, H04N19/51) · CPC title

  • Video; Image sequence · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10078902B1 cover?
Described is a system for compensating ego-translations in video captured with a moving camera. Translative ego-motion is estimated on a sequence of image frames captured by a moving camera by minimizing a cost function that is based on at least one image frame difference between consecutive image frames. An alternating one directional search is performed to minimize the cost function to find a…
Who is the assignee on this patent?
Hrl Lab Llc
What technology area does this patent fall under?
Primary CPC classification G06V20/13. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Sep 18 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).