Who is the assignee on this patent?

Microsoft Technology Licensing Llc

What technology area does this patent fall under?

Primary CPC classification G06T17/00. Mapped technology areas include Physics.

When was this patent published?

Publication date Tue Jun 08 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Pose synthesis

US11030772B2 · US · B2

Patent metadata
Field	Value
Publication number	US-11030772-B2
Application number	US-201916429696-A
Country	US
Kind code	B2
Filing date	Jun 3, 2019
Priority date	Jun 3, 2019
Publication date	Jun 8, 2021
Grant date	Jun 8, 2021

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Examples are disclosed that relate to computing devices and methods for synthesizing a novel pose of an object. One example provides a method comprising receiving a reference image of an object corresponding to an original viewpoint. The reference image of the object is translated into a depth map of the object, and a new depth map of the object is synthesized to correspond to a new viewpoint. A new image of the object is generated from the new viewpoint based on the new depth map of the object and the reference image of the object.

First claim

Opening claim text (preview).

The invention claimed is: 1. Enacted on a computing system, a method for synthesizing a novel pose of an object, the method comprising: receiving a reference image of an object corresponding to an original viewpoint; translating the reference image of the object into a reference depth map of the object; synthesizing a new depth map of the object corresponding to a new viewpoint; and inputting the reference image of the object and the new depth map of the object into an identity recovery model to generate a new image of the object from the new viewpoint. 2. The method of claim 1 , wherein translating the reference image of the object into the reference depth map of the object comprises: inputting the reference image of the object into a domain transfer module; and receiving the reference depth map of the object from the domain transfer module. 3. The method of claim 2 , further comprising receiving a foreground mask from the domain transfer module, the foreground mask identifying pixels associated with the object. 4. The method of claim 2 , wherein the domain transfer module comprises a domain transfer model, the method further comprising training the domain transfer model on a dataset of paired images and depth maps. 5. The method of claim 1 , wherein synthesizing the new depth map of the object corresponding to the new viewpoint comprises: inputting the reference depth map of the object into a depth map generator; receiving the new depth map of the object from the depth map generator; and refining the new depth map of the object using a 3D depth refinement module. 6. The method of claim 5 , further comprising receiving a sequence of new depth maps from the depth map generator, and wherein refining the new depth map comprises using a 3D convolutional neural network to enforce consistency among the sequence of new depth maps. 7. The method of claim 1 , wherein generating the new image of the object comprises: mapping the reference image of the object to an appearance parameter; mapping the new depth map of the object to a shape parameter; and combining the shape parameter and the appearance parameter to generate the new image of the object from the new viewpoint. 8. The method of claim 1 , further comprising training the identity recovery model on unpaired depth and image data. 9. The method of claim 1 , further comprising training the identity recovery model by: using a first structure encoder to map the reference image to a reference shape parameter; using a second structure encoder to map the new depth map to a new shape parameter; using a first appearance encoder to map the reference image to a reference appearance parameter; using a second appearance encoder to map the new depth map to a new appearance parameter; and combining each of the reference shape parameter and the new shape parameter with one of the reference appearance parameter and the new appearance parameter to generate an image. 10. The method of claim 1 , further comprising training the identity recovery model using supervised learning and unsupervised learning. 11. The method of claim 10 , further comprising directly supervising the training using the reference image of the object, the reference depth map, and the new depth map. 12. A computing device comprising: a processor; and a storage device storing instructions executable by the processor to receive a reference image of an object corresponding to an original viewpoint; translate the reference image of the object into a reference depth map of the object; synthesize a new depth map of the object corresponding to a new viewpoint; and input the reference image of the object and the new depth map of the object into an identity recovery model to generate a new image of the object from the new viewpoint. 13. The computing device of claim 12 , wherein the instructions are further executable to translate the reference image of the object into the reference depth map of the object by: inputting the reference image of the object into a domain transfer module; and receiving the reference depth map of the object from the domain transfer module. 14. The computing device of claim 13 , wherein the instructions are further executable to receive a foreground mask from the domain transfer module, the foreground mask identifying pixels associated with the object. 15. The computing device of claim 12 , wherein the instructions are further executable to generate the new image of the object by: mapping the reference image of the object to an appearance parameter; mapping the new depth map of the object to a shape parameter; and combining the shape parameter and the appearance parameter to generate the new image of the object from the new viewpoint. 16. The computing device of claim 12 , wherein the instructions are further executable to train the identity recovery model by: using a first structure encoder to map the reference image to a reference shape parameter; using a second structure encoder to map the new depth map to a new shape parameter; using a first appearance encoder to map the reference image to a reference appearance parameter; using a second appearance encoder to map the new depth map to a new appearance parameter; and combining each of the reference shape parameter and the new shape parameter with one of the reference appearance parameter and the new appearance parameter to generate an image. 17. The computing device of claim 16 , wherein the instructions are further executable to directly supervise the training using the reference image of the object, the reference depth map, and the new depth map. 18. A computing device comprising: a processor; and a storage device storing instructions executable by the processor to receive a reference image of an object corresponding to an original viewpoint; translate the reference image of the object into a reference depth map of the object; synthesize a new depth map of the object corresponding to a new viewpoint; and input the reference image of the object and the new depth map of the object into an identity recovery model to generate a new image of the object from the new viewpoint, wherein the instructions are further executable to generate the new image of the object by: mapping the reference image of the object to an appearance parameter; mapping the new depth map of the object to a shape parameter; and combining the shape parameter and the appearance parameter to generate a new image of the object from the new viewpoint.

Assignees

Microsoft Technology Licensing Llc

Inventors

Classifications

G06T2207/30244
Camera pose · CPC title
G06T2207/20221
Image fusion; Image merging · CPC title
G06T2200/04
involving 3D image data · CPC title
G06T17/00Primary
Three-dimensional [3D] modelling for computer graphics · CPC title
G06T7/73
using feature-based methods · CPC title

Patent family

Related publications grouped by family.

View patent family 70554253

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11030772B2 cover?: Examples are disclosed that relate to computing devices and methods for synthesizing a novel pose of an object. One example provides a method comprising receiving a reference image of an object corresponding to an original viewpoint. The reference image of the object is translated into a depth map of the object, and a new depth map of the object is synthesized to correspond to a new viewpoint. …
Who is the assignee on this patent?: Microsoft Technology Licensing Llc
What technology area does this patent fall under?: Primary CPC classification G06T17/00. Mapped technology areas include Physics.
When was this patent published?: Publication date Tue Jun 08 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).

How to read this patent

Abstract

First claim

Assignees

Inventors

Classifications

Patent family

External sources

Related patents

Registration of range images using virtual gimbal information

Method and device for determining the shape of an object represented in an image, corresponding computer program product and computer-readable medium

Real-time 3D reconstruction with power efficient depth sensor usage

Learning image processing tasks from scene reconstructions

Frequently asked questions