Adaptive sampling of training views

US10769437B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10769437-B2
Application numberUS-201815949728-A
CountryUS
Kind codeB2
Filing dateApr 10, 2018
Priority dateApr 10, 2018
Publication dateSep 8, 2020
Grant dateSep 8, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A head-mounted display, a method, and a non-transitory computer readable medium are provided. An embodiment of a method for obtaining training sample views of an object includes the step of storing, in a memory, multiple views of an object. The method also includes the step of deriving similarity scores between adjacent views and then a sampling density is varied based on the similarity scores.

First claim

Opening claim text (preview).

What is claimed is: 1. A non-transitory computer-readable medium storing thereon a computer program that, when executed by one or more processors, performs a method comprising the steps of: (a) receiving, in a memory, feature data sets of (i) a reference object or (ii) a 3D model, each feature data set being obtained from an image captured or generated from a primary set of views within a view range around the reference object or the 3D model, (b) deriving similarity scores between each pair of adjacent views among the primary set of views using the feature data sets, (c) sampling a secondary set of views disposed between each pair of the adjacent primary views that have the similarity score being equal to or less than a threshold so that sampling density varies across the view range, and (d) generating data corresponding to (i) the primary set of views including each pair of the adjacent primary views that have the similarity score being equal to or less than the threshold and (ii) the secondary set of views. 2. The non-transitory computer-readable medium of claim 1 , further performing the steps of: sampling at least one view from a group of the primary set of views and the secondary set of views at a coarse sampling resolution at equal angles from each other with respect to azimuth and/or elevation; and storing the sampled views in the memory. 3. The non-transitory computer-readable medium of claim 1 , the method further comprising performing sampling at a finer sampling resolution, relative to a sampling resolution of the sampling in step (c), near adjacent views having similarity scores lower than a threshold to obtain further sampling views. 4. The non-transitory computer-readable medium of claim 3 , the method further performing the steps of: (e) deriving similarity scores between the additional views sampled at the finer sampling resolution; (f) performing an even finer sampling resolution near the additional views having low similarity scores; and (g) repeating steps (e) and (f) until the similarity scores exceed a minimum threshold. 5. The non-transitory computer-readable medium of claim 1 , wherein similarity scores are based on appearance and/or shape of adjacent views of the reference object or the 3D model. 6. The non-transitory computer-readable medium of claim 1 , the method further performing the step of: clustering images generated from adjacent views having high similarity scores to reduce a total number of images. 7. A method comprising the steps of: (a) receiving, in a memory, feature data sets of (i) a reference object or (ii) a 3D model, each feature data set being obtained from an image captured or generated from a primary set of views within a view range around the reference object or the 3D model, (b) deriving similarity scores between each pair of adjacent views among the primary set of views using the feature data sets, (c) sampling a secondary set of views disposed between each pair of the adjacent primary views that have the similarity score being equal to or less than a threshold so that sampling density varies across the view range, and (d) generating data corresponding to (i) the primary set of views including each pair of the adjacent primary views that have the similarity score being equal to or less than the threshold and (ii) the secondary set of views. 8. The method of claim 7 , wherein the step of sampling the views includes sampling views denser between a pair of adjacent views than between another pair of adjacent views, depending on the similarity scores. 9. The method of claim 7 , further comprising the steps of: storing the sampled views in the memory. 10. The method of claim 7 , wherein the adjacent views are sampled at equal distances from a primary view. 11. The method of claim 7 , further comprising performing additional sampling at a finer sampling resolution, relative to a sampling resolution of the sampling in step (c), near adjacent views having similarity scores lower than a threshold to obtain further sampling views. 12. The method of claim 11 , further comprising the steps of: a) deriving similarity scores between the additional views sampled at the finer sampling resolution; b) performing an even finer sampling resolution near the additional views having low similarity scores; and c) repeating steps (a) and (b) until the similarity scores exceed a minimum threshold. 13. The method of claim 7 , wherein the data as training data is used for detecting the object. 14. The method of claim 13 , wherein, after the views are sampled and the training data is accumulated, a head-mounted display device is configured to detect the object using the training data. 15. The method of claim 7 , wherein similarity scores are based on appearance and/or shape of the object. 16. A head-mounted display device comprising: memory configured to store data used as training data for detecting an object, the data derived from multiple views of the object; a processing device configured to utilize the training data to detect the object; and a display component configured to display images of the detected object; wherein, before object detection, the multiple views are obtained using a varying sampling density based on similarity scores of adjacent views by the steps of: (a) receiving feature data sets of (i) a reference object or (ii) a 3D model, each feature data set being obtained from an image captured or generated from a primary set of views within a view range around the reference object or the 3D model, (b) deriving similarity scores between each pair of adjacent views among the primary set of views using the feature data sets, (c) sampling a secondary set of views disposed between each pair of the adjacent primary views that have the similarity score being equal to or less than a threshold so that sampling density varies across the view range, and (d) generating the data corresponding to (i) the primary set of views including each pair of the adjacent primary views that have the similarity score being equal to or less than the threshold and (ii) the secondary set of views. 17. The head-mounted display device of claim 16 , wherein, before object detection, the multiple views are originally sampled at a coarse sampling resolution at locations surrounding the object. 18. The head-mounted display device of claim 17 , wherein, before object detection, the multiple views are sampled by: performing a finer sampling resolution near adjacent views having low similarity scores to obtain additional views; deriving similarity scores between the additional views sampled at the finer sampling resolution; performing an even finer sampling resolution near the additional views having low similarity scores; and repeating the deriving and performing steps until the similarity scores exceed a minimum threshold. 19. The head-mounted display device of claim 18 , wherein the step of sampling the views includes sampling views denser between a pair of adjacent views than between another pair of adjacent views, depending on the similarity scores. 20. The non-transitory computer-readable medium according to claim 1 , wherein the method further comprises: receiving information identifying an object detection device including a camera for use in detecting a pose of an object in a real scene, the object corresponding to the reference object or the 3D model; acquiring, based at least in part on the information identifying the object detection device, a camera parameter se

Assignees

Inventors

Classifications

  • Obtaining sets of training patterns; Bootstrap methods, e.g. bagging or boosting · CPC title

  • Organisation of the matching processes, e.g. simultaneous or sequential comparisons of image or video features; Coarse-fine approaches, e.g. multi-scale approaches; using context analysis; Selection of dictionaries · CPC title

  • Terrestrial scenes (scenes under surveillance with static cameras G06V20/52; scenes perceived from the exterior of a vehicle G06V20/56; scenes perceived from the interior of a vehicle G06V20/59) · CPC title

  • G06T19/006Primary

    Mixed reality (object pose determination, tracking or camera calibration for mixed reality G06T7/00) · CPC title

  • Generating training patterns; Bootstrap methods, e.g. bagging or boosting · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10769437B2 cover?
A head-mounted display, a method, and a non-transitory computer readable medium are provided. An embodiment of a method for obtaining training sample views of an object includes the step of storing, in a memory, multiple views of an object. The method also includes the step of deriving similarity scores between adjacent views and then a sampling density is varied based on the similarity scores.
Who is the assignee on this patent?
Seiko Epson Corp
What technology area does this patent fall under?
Primary CPC classification G06T19/006. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Sep 08 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 9 related publications on this page (citations in our corpus or others sharing the same primary CPC).