What technology area does this patent fall under?

Primary CPC classification G06T7/246. Mapped technology areas include Physics.

When was this patent published?

Publication date Tue Jun 17 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Systems and methods for generic visual odometry using learned features via neural camera models

US12333750B2 · US · B2

Patent metadata
Field	Value
Publication number	US-12333750-B2
Application number	US-202217722360-A
Country	US
Kind code	B2
Filing date	Apr 17, 2022
Priority date	Sep 15, 2020
Publication date	Jun 17, 2025
Grant date	Jun 17, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems and methods for self-supervised learning for visual odometry using camera images, may include: estimating correspondences between keypoints of a target camera image and keypoints of a context camera image; based on the keypoint correspondences, lifting a set of 2D keypoints to 3D, using a neural camera model; and projecting the 3D keypoints into the context camera image using the neural camera model. Some embodiments may use the neural camera model to achieve the lifting and projecting of keypoints without a known or calibrated camera model.

First claim

Opening claim text (preview).

What is claimed is: 1. A method comprising: estimating correspondences between keypoints of a target camera image and keypoints of a context camera image, wherein the target camera image and the context camera image are obtained from a monocular sequence; using a ray surface decoder to predict a ray surface from the target image, wherein the predicted ray surface associates a respective pixel in the target image with a corresponding direction; based on the keypoint correspondences and the predicted ray surface, lifting a set of 2D keypoints to 3D, using a neural camera model; and projecting the 3D keypoints into the context camera image using the neural camera model. 2. The method of claim 1 , wherein lifting to 3D and projection to 2D achieved through the neural camera model are performed without a known or calibrated camera model. 3. The method of claim 1 , wherein the neural camera model is configured to learn a pixel-wise ray surface that enables learning depth and pose estimates in a self-supervised way from a wider variety of camera geometries. 4. The method of claim 1 , further comprising using a key point network to learn a keypoint matrix for the target camera image and the context camera image, wherein estimating the correspondences between the keypoints of the target camera image and the keypoints of the context camera image comprises using learned descriptors from the keypoint matrix to estimate the correspondences between the keypoints of the target camera image and the keypoints of the context camera image. 5. The method of claim 4 , wherein using the learned descriptors to estimate the correspondences between the target camera image and the context camera image comprises computing a set of corresponding keypoints comprising a keypoint from the target image and a warped corresponding keypoint in the context image. 6. The method of claim 4 , wherein using the learned descriptors to estimate the correspondences between the target image and the context image comprises self-supervised 3D keypoint learning. 7. The method of claim 1 , wherein using the ray surface decoder to predict the ray surface from the target image comprises learning a residual ray surface and adding the residual ray surface to a fixed ray surface to produce the predicted ray surface.

Assignees

Toyota Res Inst Inc

Inventors

Classifications

G06T2207/20084
Artificial neural networks [ANN] · CPC title
G06T15/205
Image-based rendering · CPC title
G06T7/73
using feature-based methods · CPC title
G06T7/246Primary
using feature-based methods, e.g. the tracking of corners or segments · CPC title
G06T3/18
Image warping, e.g. rearranging pixels individually · CPC title

Patent family

Related publications grouped by family.

View patent family 80627845

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12333750B2 cover?: Systems and methods for self-supervised learning for visual odometry using camera images, may include: estimating correspondences between keypoints of a target camera image and keypoints of a context camera image; based on the keypoint correspondences, lifting a set of 2D keypoints to 3D, using a neural camera model; and projecting the 3D keypoints into the context camera image using the neural…
Who is the assignee on this patent?: Toyota Res Inst Inc
What technology area does this patent fall under?: Primary CPC classification G06T7/246. Mapped technology areas include Physics.
When was this patent published?: Publication date Tue Jun 17 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).