Who is the assignee on this patent?

Microsoft Technology Licensing Llc

What technology area does this patent fall under?

Primary CPC classification G06V20/20. Mapped technology areas include Physics.

When was this patent published?

Publication date Thu Oct 04 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Camera/object pose from predicted coordinates

US2018285697A1 · US · A1

Patent metadata
Field	Value
Publication number	US-2018285697-A1
Application number	US-201815895990-A
Country	US
Kind code	A1
Filing date	Feb 13, 2018
Priority date	Feb 22, 2013
Publication date	Oct 4, 2018
Grant date	—

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Camera or object pose calculation is described, for example, to relocalize a mobile camera (such as on a smart phone) in a known environment or to compute the pose of an object moving relative to a fixed camera. The pose information is useful for robotics, augmented reality, navigation and other applications. In various embodiments where camera pose is calculated, a trained machine learning system associates image elements from an image of a scene, with points in the scene's 3D world coordinate frame. In examples where the camera is fixed and the pose of an object is to be calculated, the trained machine learning system associates image elements from an image of the object with points in an object coordinate frame. In examples, the image elements may be noisy and incomplete and a pose inference engine calculates an accurate estimate of the pose.

First claim

Opening claim text (preview).

1 . A method of calculating pose of an entity comprising: receiving, at a processor, at least one image where the image is either: of the entity and captured by a fixed camera, or of a scene captured by a mobile camera in the scene; applying image elements of the at least one image to a trained machine learning system to obtain a plurality of associations between image elements and points in either entity coordinates or scene coordinates; and calculating the pose of the entity from the associations. 2 . A method as claimed in claim 1 where the entity is a mobile camera and the pose of the camera is calculated. 3 . A method as claimed in claim 1 where the entity is an object and the pose of the object is calculated using the at least one image captured by a fixed camera. 4 . A method as claimed in claim 1 comprising calculating the pose of the entity as parameters having six degrees of freedom, three indicating rotation of the entity and three indicating position of the entity. 5 . A method as claimed in claim 1 , the machine learning system having been trained using images with image elements labeled either with scene coordinates or object coordinates. 6 . A method as claimed in claim 1 where the machine learning system is a random decision forest. 7 . A method as claimed in claim 1 where the machine learning system comprises a plurality of trained random forests and the method comprises applying the image elements of the at least one image to the plurality of trained random forests, each random forest having been trained using images from a different one of a plurality of scenes, and calculating which of the scenes the mobile camera was in when the at least one image was captured. 8 . A method as claimed in claim 1 the machine learning system having been trained using images of a plurality of scenes with image elements labeled with scene identifiers and labeled with scene coordinates of points in the scene the image elements depict. 9 . A method as claimed in claim 1 comprising applying only a subsample of the image elements of the at least one image to the trained machine learning system. 10 . A method as claimed in claim 1 comprising calculating the pose by searching amongst a set of possible pose candidates and using samples of associations between image elements and points to assess the pose candidates. 11 . A method as claimed in claim 1 comprising receiving at the processor, a stream of images, and calculating the pose by searching amongst a set of possible pose candidates which includes a pose calculated from another image in the stream. 12 . A method as claimed in claim 1 at least partially carried out using hardware logic selected from any one or more of: a field-programmable gate array, a program-specific integrated circuit, a program-specific standard product, a system-on-a-chip, a complex programmable logic device, a graphics processing unit. 13 . A method as claimed in claim 1 where the entity is a mobile camera and the pose of the camera is calculated, the method comprising accessing a 3D model of the scene and refining the camera pose using the accessed 3D model. 14 . A method comprising: receiving, at a processor, a plurality of images, each image having a plurality of image elements labeled with coordinates of points either in a scene the image elements depict or of an object the image elements depict; training a machine learning system using the received plurality of images such that when an image element from another image is applied to the machine learning system, an estimate of a coordinate of a point the image element depicts is produced. 15 . A method as claimed in claim 14 comprising receiving a plurality of images of sub-scenes, each having a plurality of image elements labeled with scene coordinates in a space in which the sub-scenes are embedded; and training the machine learning system using the received plurality of images of sub-scenes. 16 . A pose tracker comprising: a processor arranged to receive at least one image either of an object captured by a fixed camera, or of a scene captured by a mobile camera; the processor arranged to apply image elements of the at least one image to a trained machine learning system to obtain a plurality of associations between image elements and points in either object coordinates or scene coordinates; and a pose inference engine arranged to calculate a position and orientation of either the object or the mobile camera from the associations. 17 . A pose tracker as claimed in claim 16 the processor arranged to apply only a subsample of the image elements of the at least one image to the trained machine learning system. 18 . A pose tracker as claimed in claim 16 the pose inference engine arranged to calculate the pose by searching amongst a set of possible pose candidates and using samples of associations between image elements and points in either object coordinates or scene coordinates to assess the pose candidates. 19 . A pose tracker as claimed in claim 16 the processor arranged to receive a stream of images, and comprising a pose inference engine arranged to calculate the pose by searching amongst a set of possible pose candidates which includes a pose calculated from another image in the stream. 20 . A pose tracker as claimed in claim 16 at least partially implemented using hardware logic selected from any one or more of: a field-programmable gate array, a program-specific integrated circuit, a program-specific standard product, a system-on-a-chip, a complex programmable logic device, a graphics processing unit.

Assignees

Microsoft Technology Licensing Llc

Inventors

Classifications

G06V10/7625
Hierarchical techniques, i.e. dividing or merging patterns to obtain a tree-like representation; Dendograms · CPC title
G06V10/774
Generating sets of training patterns; Bootstrap methods, e.g. bagging or boosting · CPC title
G06V10/764
using classification, e.g. of video objects · CPC title
G06V20/20Primary
in augmented reality scenes · CPC title
G06F18/24323
Tree-organised classifiers · CPC title

Patent family

Related publications grouped by family.

View patent family 50239978

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2018285697A1 cover?: Camera or object pose calculation is described, for example, to relocalize a mobile camera (such as on a smart phone) in a known environment or to compute the pose of an object moving relative to a fixed camera. The pose information is useful for robotics, augmented reality, navigation and other applications. In various embodiments where camera pose is calculated, a trained machine learning sys…
Who is the assignee on this patent?: Microsoft Technology Licensing Llc
What technology area does this patent fall under?: Primary CPC classification G06V20/20. Mapped technology areas include Physics.
When was this patent published?: Publication date Thu Oct 04 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).