Adversarial scene adaptation for camera pose regression

US11600017B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11600017-B2
Application numberUS-202016861838-A
CountryUS
Kind codeB2
Filing dateApr 29, 2020
Priority dateApr 29, 2020
Publication dateMar 7, 2023
Grant dateMar 7, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A pose estimation training system includes: a first model configured to generate a first 6 degrees of freedom (DoF) pose of a first camera that captured a first image from a first domain; a second model configured to generate a second 6 DoF pose of a second camera that captured a second image from a second domain, where the second domain is different than the first domain; a discriminator module configured to, based on first and second outputs from the first and second encoder modules, generate a discriminator output indicative of whether the first and second images are from the same domain; and a training control module configured to, based on the discriminator output, selectively adjust at least one weight value shared by the first model and the second model.

First claim

Opening claim text (preview).

What is claimed is: 1. A training system, comprising: a first model including: a first encoder module configured to generate a first vector representation based on a first image from a first domain using first weight values; a first localizer module configured to generate a second vector representation based on the first vector representation; and a first regressor module configured to generate a first 6 degrees of freedom (DoF) pose of a first camera that captured the first image based on the second vector representation using second weight values; a second model including: a second encoder module configured to generate a third vector representation based on a second image from a second domain using the first weight values, wherein the second domain is different than the first domain; a second localizer module configured to generate a fourth vector representation based on the third vector representation; and a second regressor module configured to generate a second 6 DoF pose of a second camera that captured the second image based on the fourth vector representation and using the second weight values; a discriminator module configured to, based on first and second outputs from the first and second encoder modules, generate a discriminator output indicative of whether the first and second images are from the same domain; and a training control module configured to, based on the discriminator output, selectively adjust at least one of: at least one of the first weight values; and at least one of the second weight values. 2. The training system of claim 1 wherein the training control module is configured to adjust at least one of the first weight values until the discriminator output indicates that the first and second images are from the same domain. 3. The training system of claim 1 wherein the training control module is configured to input a different first image to the first model in response to the discriminator output indicating that the first and second images are from the same domain. 4. The training system of claim 1 wherein the training control module is configured to input a different second image to the second model in response to the discriminator output indicating that the first and second images are from the same domain. 5. The training system of claim 1 wherein the training control module is configured to, in response to the discriminator output indicating that the first and second images are from the same domain: apply a geometric transformation to the first image, thereby generating a geometrically transformed image; and input the geometrically transformed image to the first model. 6. The training system of claim 5 wherein the geometric transformation includes a rotation of the image by a predetermined angle. 7. The training system of claim 5 wherein the geometric transformation includes one of translation, affine warping, and distortion. 8. The training system of claim 1 wherein the training control module is further configured to store in memory the first model, the first weight values, and the second weight values. 9. The training system of claim 1 wherein: the first localizer module includes at least one fully connected layers configured to map the first vector representation to the second vector representation; and the second localizer module includes the at least one fully connected layers configured to map the third vector representation to the fourth vector representation. 10. The training system of claim 1 wherein the first and second regressor modules each include at least two separate connected layers. 11. The training system of claim 1 wherein the first and second encoder modules each include an encoder and a fully connected layer localizer. 12. A vehicle comprising: at least one propulsion device; memory including: the first model of claim 1 ; and the first and second weight values; a camera configured to capture an image; a pose module configured to determine a 6 DoF pose of the camera based on the image and using the first model and the first and second weight values; and a control module configured to actuate the at least one propulsion device based on the 6 DoF pose of the camera. 13. The training system of claim 1 wherein the first and second DoF poses are absolute 6 DoF poses. 14. The training system of claim 1 wherein the first and second DoF poses are relative 6 DoF poses. 15. The training system of claim 1 wherein the training control module is configured to adjust at least one of the first weight values based on minimizing a loss of the discriminator module. 16. The training system of claim 1 wherein the training control module is configured to adjust at least one of the second weight values based on minimizing a loss of the first and second regressor modules. 17. The training system of claim 1 wherein: the first 6 DoF includes: a first three dimensional position of the first camera that captured the first image; and a first three dimensional orientation of the first camera that captured the first image; and the second 6 DoF includes: a second three dimensional position of the second camera that captured the second image; and a second three dimensional orientation of the second camera that captured the second image. 18. A training method, comprising: by a first model: generating a first vector representation based on a first image from a first domain using first weight values; generating a second vector representation based on the first vector representation; and generating a first 6 degrees of freedom (DoF) pose of a first camera that captured the first image based on the second vector representation using second weight values; by a second model: generating a third vector representation based on a second image from a second domain using the first weight values, wherein the second domain is different than the first domain; generating a fourth vector representation based on the third vector representation; and generating a second 6 DoF pose of a second camera that captured the second image based on the fourth vector representation and using the second weight values; based on first and second outputs from the first and second models, generating a discriminator output indicative of whether the first and second images are from the same domain; and based on the discriminator output, selectively adjusting at least one of: at least one of the first weight values; and at least one of the second weight values. 19. A non-transitory computer readable medium comprising code that, when executed by one or more processors, performs a training method, comprising: by a first model: generating a first vector representation based on a first image from a first domain using first weight values; generating a second vector representation based on the first vector representation; and generating a first 6 degrees of freedom (DoF) pose of a first camera that captured the first image based on the second vector representation using second weight values; by a second model: generating a third vector representation based on a second image from a second domain using the first weight values, wherein the second domain is different than the first domain; generating a fourth vector representation based on the third vector representation; and generating a second 6 DoF pose of a second camera that captured the second image based on the fourth vector representation and using the second weight values; based on first and second outpu

Assignees

Inventors

Classifications

  • G06N20/20Primary

    Ensemble learning · CPC title

  • G06T7/73Primary

    using feature-based methods · CPC title

  • Rotation of whole images or parts thereof · CPC title

  • Training; Learning · CPC title

  • G06T7/75Primary

    involving models · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11600017B2 cover?
A pose estimation training system includes: a first model configured to generate a first 6 degrees of freedom (DoF) pose of a first camera that captured a first image from a first domain; a second model configured to generate a second 6 DoF pose of a second camera that captured a second image from a second domain, where the second domain is different than the first domain; a discriminator modul…
Who is the assignee on this patent?
Naver Corp, Naver Labs Corp
What technology area does this patent fall under?
Primary CPC classification G06N20/20. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Mar 07 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).