Optical flow estimation using 4-dimensional cost volume processing

US10467768B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10467768-B2
Application numberUS-201715482270-A
CountryUS
Kind codeB2
Filing dateApr 7, 2017
Priority dateApr 7, 2017
Publication dateNov 5, 2019
Grant dateNov 5, 2019

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Techniques are provided for estimation of optical flow between images using 4-dimensional cost volume processing. A methodology implementing the techniques according to an embodiment includes extracting a first set of feature vectors from a first image and extracting a second set of feature vectors from a second image. Each feature vector of the first set is associated with a pixel of the first image and each feature vector of the second set is associated with a pixel of the second image. The method further includes constructing a 4-dimensional (4D) cost volume to store a distance metric between each feature vector of the first set of feature vectors and a selected subset of feature vectors of the second set of feature vectors. The method further includes performing a flow-semi-global matching (Flow-SGM) on the 4D cost volume to estimate an optical flow vector for pixels of the first image.

First claim

Opening claim text (preview).

What is claimed is: 1. A processor-implemented method for optical flow estimation, the method comprising: extracting, by a processor-based system, a first set of feature vectors from a first image, each feature vector of the first set associated with a pixel of the first image; extracting, by the processor-based system, a second set of feature vectors from a second image, each feature vector of the second set associated with a pixel of the second image; constructing, by the processor-based system, a 4-dimensional (4D) cost volume to store distance metrics between one or more feature vectors of the first set of feature vectors and one or more feature vectors of the second set of feature vectors; and performing, by the processor-based system, a flow-semi-global matching (Flow-SGM) on the 4D cost volume to estimate an optical flow vector for pixels of the first image. 2. The method of claim 1 , further comprising normalizing the extracted feature vectors to unity length, and calculating each of the distance metrics as a Euclidean distance using a vector dot product operation applied to the normalized extracted feature vectors. 3. The method of claim 1 , further comprising rescaling and binning the distance metrics of the 4D cost volume to a selected integer range. 4. The method of claim 1 , wherein the feature vector extraction is performed by a trained convolutional neural network (CNN), the training based on training data comprising pairs of training images and associated ground truth optical flow vectors. 5. The method of claim 4 , wherein the training further comprises performing a stochastic gradient descent operation on the training data. 6. The method of claim 1 , further comprising: down-sampling the first image and the second image, from an original resolution to a selected lower resolution; generating an estimated optical flow field comprising the estimated optical flow vector; and up-sampling the estimated optical flow field to the original resolution using interpolation. 7. The method of claim 6 , further comprising post processing of the up-sampled estimated optical flow field to in-fill occluded regions, the in-filling based on extrapolation performed within homography fitted segments of the up-sampled estimated optical flow field. 8. The method of claim 1 , further comprising providing the estimated optical flow vector to at least one of a video segmentation application, a motion detection application, an object tracking application, an action recognition application, an autonomous driving system, a computer navigation application, and a computer vision application. 9. The method of claim 1 , wherein the constructing comprises constructing a 4-dimensional (4D) cost volume to store a distance metric between each feature vector of the first set of feature vectors and a selected subset of feature vectors of the second set of feature vectors. 10. A system for optical flow estimation, the system comprising: a feature extraction circuit to extract a first set of feature vectors from a first image, each feature vector of the first set associated with a pixel of the first image; and to extract a second set of feature vectors from a second image, each feature vector of the second set associated with a pixel of the second image; a cost volume construction circuit to construct a 4-dimensional (4D) cost volume to store a distance metric between each feature vector of the first set of feature vectors and a selected subset of feature vectors of the second set of feature vectors; and a cost volume processing circuit to perform a flow-semi-global matching (Flow-SGM) on the 4D cost volume to estimate an optical flow vector for pixels of the first image and to generate an estimated optical flow field comprising the estimated optical flow vector. 11. The system of claim 10 , wherein the cost volume construction circuit is further to normalize the extracted feature vectors to unity length, and calculate the distance metric as a Euclidean distance using a vector dot product operation applied to the normalized extracted feature vectors. 12. The system of claim 10 , wherein the cost volume construction circuit is further to rescale and bin the distance metrics of the 4D cost volume to a selected integer range. 13. The system of claim 10 , wherein the feature extraction circuit further comprises a trained convolutional neural network (CNN) to extract the feature vectors, the training based on training data comprising pairs of training images and associated ground truth optical flow vectors. 14. The system of claim 13 , further comprising a training system to train the CNN based on application of a stochastic gradient descent to the training data. 15. The system of claim 10 , further comprising: an image down-sampling circuit to down-sample the first image and the second image, from an original resolution to a selected lower resolution; and an up-sampling circuit to up-sample the estimated optical flow field to the original resolution using interpolation. 16. The system of claim 15 , further comprising a post-processing circuit to in-fill occluded regions of the up-sampled estimated optical flow field, the in-filling based on extrapolation performed within homography fitted segments of the up-sampled estimated optical flow field. 17. The system of claim 10 , wherein the post-processing circuit is further to provide the estimated optical flow vector to at least one of a video segmentation application, a motion detection application, an object tracking application, an action recognition application, an autonomous driving system, a computer navigation application, and a computer vision application. 18. At least one non-transitory computer readable storage medium having instructions encoded thereon that, when executed by one or more processors, result in the following operations for optical flow estimation, the operations comprising: extracting a first set of feature vectors from a first image, each feature vector of the first set associated with a pixel of the first image; extracting a second set of feature vectors from a second image, each feature vector of the second set associated with a pixel of the second image; constructing a 4-dimensional (4D) cost volume to store a distance metric between each feature vector of the first set of feature vectors and a selected subset of feature vectors of the second set of feature vectors; and performing a flow-semi-global matching (Flow-SGM) on the 4D cost volume to estimate an optical flow vector for pixels of the first image or the second image or both of the first image and second image. 19. The computer readable storage medium of claim 18 , further comprising the operations of normalizing the extracted feature vectors to unity length, and calculating the distance metric as a Euclidean distance using a vector dot product operation applied to the normalized extracted feature vectors. 20. The computer readable storage medium of claim 18 , further comprising the operations of rescaling and binning the distance metrics of the 4D cost volume to a selected integer range. 21. The computer readable storage medium of claim 18 , wherein the feature vector extraction is performed by a trained convolutional neural network (CNN), the training based on training data comprising pairs of training images and associated ground truth optical flow vectors. 22. The computer readable storage medium of claim 21 , wherein the training further comprises the operati

Assignees

Inventors

Classifications

  • G06V10/82Primary

    using neural networks · CPC title

  • using classification, e.g. of video objects · CPC title

  • G06T7/60Primary

    Analysis of geometric attributes · CPC title

  • based on distances to training or reference patterns · CPC title

  • Integrating the filters into a hierarchical structure, e.g. convolutional neural networks [CNN] · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10467768B2 cover?
Techniques are provided for estimation of optical flow between images using 4-dimensional cost volume processing. A methodology implementing the techniques according to an embodiment includes extracting a first set of feature vectors from a first image and extracting a second set of feature vectors from a second image. Each feature vector of the first set is associated with a pixel of the first…
Who is the assignee on this patent?
Intel Corp
What technology area does this patent fall under?
Primary CPC classification G06V10/82. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Nov 05 2019 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).