Real-time monocular structure from motion

US10706582B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10706582-B2
Application numberUS-201815965480-A
CountryUS
Kind codeB2
Filing dateApr 27, 2018
Priority dateSep 17, 2012
Publication dateJul 7, 2020
Grant dateJul 7, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems and methods are described for multithreaded navigation assistance by acquired with a single camera on-board a vehicle, using 2D-3D correspondences for continuous pose estimation, and combining the pose estimation with 2D-2D epipolar search to replenish 3D points.

First claim

Opening claim text (preview).

What is claimed is: 1. A system for autonomous vehicular navigation using a multithreaded monocular structure from motion (SFM) architecture, comprising: a single camera positioned on-board a vehicle; and a multithreaded processor coupled to the single camera and a non-transitory computer-readable storage medium, the multithreaded processor being included in the multithreaded SFM architecture, and being configured for: acquiring images using the single camera positioned on-board the vehicle; estimating camera motion using monocular SFM by performing continuous camera pose estimation of 2D-3D correspondences; detecting a local planarity of a road and correcting for scale drift using the monocular SFM based on the camera pose estimation; determining visual odometry correspondences and replenishing 3D points of the images by combining the pose estimation with a 2D-2D multi-threaded per-frame epipolar search, the per-frame epipolar search continuously generating, for each frame, redundantly validated 3D points persistent across comparatively long tracks, the determining visual odometry further comprising: parallelizing the epipolar search across a plurality of threads; validating the 3D points among intermediate points, performing a local bundle adjustment, and retaining the validated 3D points for insertion of new 3D points at a keyframe in a main thread to frontload the validated 3D points for the pose estimation; performing, in real-time, the pose estimation at each of a plurality of frames using only the validated 3D points to reduce processing requirements and increase processing speed of the pose estimation; executing a real-time global bundle adjustment in a thread-safe architecture in parallel with the real-time pose estimation; determining an optimized planar homography mapping between a road in two frames based on one or more determined ground estimation cues, the cues including triangulated sparse 3D points, inter-frame dense stereo matching, and a displacement computed based on a distance from a vanishing point; and estimating per-frame relative importances of the cues by computing observation covariances for each of the cues and performing ground plane estimation by combining the triangulated sparse 3D points and cues using a Kalman filter; and autonomously controlling driving functions of the vehicle based on the determined visual odometry and the local planarity of the road. 2. The system of claim 1 , wherein the processor is further configured for providing fast 3D-2D correspondences using pose-guided matching. 3. The system of claim 1 , wherein the processor is further configured for performing epipolar constrained search to produce per-frame 2D-2D correspondences. 4. The system of claim 1 , wherein the processor is further configured for performing vanishing point detection to hypothesize a feature match search window along one or more radial lines from the VP for pruning mismatches due to repeated features. 5. The system of claim 1 , wherein the processor is further configured for validating each 3D point against all frames in real-time, refining cameras and 3D points by the performing the local bundle adjustment, and collecting and refining 3D points from an epipolar thread. 6. The system of claim 1 , wherein the processor is further configured for collecting and refinding to allow bundle adjustment using long tracks. 7. The system of claim 1 , wherein the processor is further configured for performing real-time scale correcting by combining scale estimates from 3D points and planar homography mappings. 8. The system of claim 1 , wherein the validating the 3D points provides increased accuracy and timing of the pose estimation based on the epipolar constrained search, triangulation and backprojection. 9. The system of claim 1 , wherein the data-driven covariance learning comprises adapting the observation covariance on a per-frame basis by combining the cues triangulated sparse 3D points and homography-guided dense inter-frame stereo cues using a Kalman filter, where relative importance of cues is estimated on a per-frame basis learned through observation covariances. 10. A method for vehicular navigation using a multithreaded monocular structure from motion (SFM) architecture, comprising: autonomously navigating a vehicle using a multithreaded processor coupled to a non-transitory computer-readable storage medium, the multithreaded processor being included in the multithreaded SFM architecture, and being configured for: acquiring images using a single camera positioned on-board a vehicle; estimating camera motion using monocular SFM by performing continuous camera pose estimation of 2D-3D correspondences; detecting a local planarity of a road and correcting for scale drift using the monocular SFM based on the camera pose estimation; determining visual odometry correspondences and replenishing 3D points of the images by combining the pose estimation with a 2D-2D multi-threaded per-frame epipolar search, the per-frame epipolar search continuously generating, for each frame, redundantly validated 3D points persistent across comparatively long tracks, the determining visual odometry further comprising: parallelizing the epipolar search across a plurality of threads; validating the 3D points among intermediate points, performing a local bundle adjustment, and retaining the validated 3D points for insertion of new 3D points at a keyframe in a main thread to frontload the validated 3D points for the pose estimation; performing, in real-time, the pose estimation at each of a plurality of frames using only the validated 3D points to reduce processing requirements and increase processing speed of the pose estimation; executing a real-time global bundle adjustment in a thread-safe architecture in parallel with the real-time pose estimation; determining an optimized planar homography mapping between a road in two frames based on one or more determined ground estimation cues, the cues including triangulated sparse 3D points, inter-frame dense stereo matching, and a displacement computed based on a distance from a vanishing point; and estimating per-frame relative importances of the cues by computing observation covariances for each of the cues and performing ground plane estimation by combining the triangulated sparse 3D points and cues using a Kalman filter; and autonomously controlling driving functions of the vehicle based on the determined visual odometry and the local planarity of the road. 11. The method of claim 10 , further comprising providing fast 3D-2D correspondences using pose-guided matching. 12. The method of claim 10 , further comprising performing epipolar constrained search to produce per-frame 2D-2D correspondences. 13. The method of claim 10 , further comprising performing vanishing point detection to hypothesize a feature match search window along one or more radial lines from the VP for pruning mismatches due to repeated features. 14. The method of claim 10 , further comprising validating each 3D point against all frames in real-time, refining cameras and 3D points by the performing the local bundle adjustment, and collecting and refining 3D points from an epipolar thread. 15. The method of claim 10 , further comprising collecting and refinding to allow bundle adjustment using long tracks. 16. The method of claim 10 , further comprising performing real-time scale correcting by combining scale estimates from 3D points and planar homography mappings. 17. The method of claim 10 , wherein the validating the 3D points provides inc

Assignees

Inventors

Classifications

  • G06T7/73Primary

    using feature-based methods · CPC title

  • exterior to a vehicle by using sensors mounted on the vehicle · CPC title

  • Terrestrial scenes (scenes under surveillance with static cameras G06V20/52; scenes perceived from the exterior of a vehicle G06V20/56; scenes perceived from the interior of a vehicle G06V20/59) · CPC title

  • Recognition of the road, e.g. of lane markings; Recognition of the vehicle driving pattern in relation to the road · CPC title

  • Salient point detection; Corner detection · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10706582B2 cover?
Systems and methods are described for multithreaded navigation assistance by acquired with a single camera on-board a vehicle, using 2D-3D correspondences for continuous pose estimation, and combining the pose estimation with 2D-2D epipolar search to replenish 3D points.
Who is the assignee on this patent?
Nec Lab America Inc, Nec Corp
What technology area does this patent fall under?
Primary CPC classification G06T7/73. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jul 07 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).