Method and apparatus for tracking object, and method and apparatus for calculating object pose information
US-2015131859-A1 · May 14, 2015 · US
US9672441B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-9672441-B2 |
| Application number | US-201514959891-A |
| Country | US |
| Kind code | B2 |
| Filing date | Dec 4, 2015 |
| Priority date | Jan 17, 2012 |
| Publication date | Jun 6, 2017 |
| Grant date | Jun 6, 2017 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Enhanced contrast between an object of interest and background surfaces visible in an image is provided using controlled lighting directed at the object. Exploiting the falloff of light intensity with distance, a light source (or multiple light sources), such as an infrared light source, can be positioned near one or more cameras to shine light onto the object while the camera(s) capture images. The captured images can be analyzed to distinguish object pixels from background pixels.
Opening claim text (preview).
The invention claimed is: 1. A method for analyzing captured images, the method comprising: capturing a first plurality of digital images of a field of view using a camera and at least one light source; identifying pixels of the first plurality of digital images that correspond to an object rather than to a background, wherein the pixels corresponding to the object are identified based on differences between first, second and third digital images captured by the capturing; capturing a second plurality of digital images; identifying pixels of the second plurality of digital images that correspond to the object rather than to the background; detecting a motion of the object based at least in part upon the identified pixels of the first plurality of digital images and the identified pixels of the second plurality of digital images; comparing the motion of the object to a library; and enabling a particular gesture associated with the motion of the object. 2. The method of claim 1 , wherein the capturing of the first plurality of digital images comprises: pulsing the at least one light source from the first illumination power to the second illumination power. 3. The method of claim 2 , wherein the pulsing comprises: pulsing the at least one light source at regular intervals while the camera captures the first plurality of digital images, such that a light intensity is stronger for object pixels than for background pixels. 4. The method of claim 3 , wherein the identifying of the pixels of the first plurality of digital images comprises: comparing pixels in successive digital images to distinguish the object pixels from the background pixels. 5. The method of claim 1 , wherein the identifying of the pixels of the first plurality of digital images comprises: identifying pixels that correspond to a hand rather than to a ceiling. 6. The method of claim 1 , wherein the identifying of the pixels of the first plurality of digital images comprises: sending a reduced representation of each digital image of the first plurality of digital images in which identified background pixels are zeroed out. 7. The method of claim 1 , wherein the capturing of the first plurality of digital images further comprises: adjusting an intensity of the at least one light source, such that the pixels corresponding to the object are saturated due to a brightness level of the object. 8. The method of claim 7 , wherein the identifying comprises: identifying the saturated pixels as pixels that correspond to the object. 9. The method of claim 1 , wherein the capturing of the first plurality of digital images further comprises: filtering light outside a band around a peak frequency of the at least one light source. 10. The method of claim 1 , wherein the identifying comprises: detecting in at least one digital image, of the first plurality of digital images, a Gaussian brightness falloff pattern indicative of a rounded object. 11. The method of claim 1 , wherein the constructing of the model further comprises: reconstructing a position of the object from an outline of a shape of the object, as seen from a particular vantage point, and defined by the identified pixels of the first plurality of digital images, the reconstruction of the position including: defining a plurality of lines tangent to the object in the first plurality of digital images from the particular vantage point in planes, referred to herein as “slices”, from the particular vantage point to the object in a given slice; and determining the position and an approximate cross-section of the object in the slice using one or more simple closed curves. 12. A wearable goggle, comprising: at least one camera oriented toward a field of view; at least one light source disposed on a same side of the field of view as the at least one camera and oriented to illuminate the field of view; and a processor coupled to the at least one camera and the at least one light source and configured to: capture a first plurality of digital images of the field of view using the at least one camera and the at least one light source; identify pixels of the first plurality of digital images that correspond to an object rather than to a background, wherein the pixels corresponding to the object are identified based on differences between first, second and third digital images; capture a second plurality of digital images; identify pixels of the second plurality of digital images that correspond to the object rather than to the background; detect a motion of the object based at least in part upon the identified pixels of the first plurality of digital images and the identified pixels of the second plurality of digital images; compare the motion of the object to a library; and enable a particular gesture associated with the motion of the object, the gesture indicating a user interaction with a virtual or augmented environment. 13. The wearable goggle of claim 12 , wherein the virtual or augmented environment comprises: a representation of a user's hand based at least in part upon the captured first plurality of digital images and the captured second plurality of digital images. 14. A computer system comprising: at least one camera oriented toward a field of view; at least one light source disposed on a same side of the field of view as the at least one camera and oriented to illuminate the field of view; and a processor coupled to the at least one camera and the at least one light source and configured to: capture a first plurality of digital images of the field of view using the at least one camera and the at least one light source; identify pixels of the first plurality of digital images that correspond to an object rather than to a background, wherein the pixels corresponding to the object are identified based on differences between first, second and third digital images; capture a second plurality of digital images; identify pixels of the second plurality of digital images that correspond to the object rather than to the background; detect a motion of the object based at least in part upon the identified pixels of the first plurality of digital images and the identified pixels of the second plurality of digital images; compare the motion of the object to a library; and enable a particular gesture associated with the motion of the object, the gesture indicating a user input to the computer system. 15. A tablet computer system comprising: at least one camera having a field of view and oriented toward a front surface so as to capture a motion of a user; at least one light source disposed on a same side of the field of view as the at least one camera and oriented to illuminate the field of view; and a processor coupled to the at least one camera and the at least one light source and configured to: capture a first plurality of digital images of the field of view using the at least one camera and the at least one light source; identify pixels of the first plurality of digital images that correspond to an object rather than to a background, wherein the pixels corresponding to the object are identified based on differences between first, second and third digital images; capture a second plurality of images; identify pixels of the second plurality of digital images that correspond to the object rather than to the background; detect a motion of the object based at least in part upon the identified pixels of the first plurality of digital images and the identified pixels of the second plurality of digital images; compare the motion of the object to a libr
Gesture based interaction, e.g. based on a set of recognized hand gestures (interaction based on gestures traced on a digitiser G06F3/04883) · CPC title
Transmitting camera control signals through networks, e.g. control via the Internet · CPC title
Motion-based segmentation · CPC title
involving reference images or patches · CPC title
Determining position or orientation of objects or cameras (camera calibration G06T7/80) · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.