Systems and methods for scale invariant 3D object detection leveraging processor architecture

US9659217B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9659217-B2
Application numberUS-201615219798-A
CountryUS
Kind codeB2
Filing dateJul 26, 2016
Priority dateAug 22, 2014
Publication dateMay 23, 2017
Grant dateMay 23, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An example method includes receiving a plurality of templates of a plurality of objects, where a template comprises feature values sampled at corresponding points of a two-dimensional grid of points positioned over a particular view of an object and scaled based on a depth of the object at the particular view. The method may further include receiving an image of an environment and determining a matrix representative of the image, where a row of the matrix comprises feature values sampled at a particular point of the two-dimensional grid positioned over one or more locations within the image and scaled based on depths of the one or more locations. The method may additionally include determining at least one similarity vector corresponding to at least one template and using the at least one similarity vector to identify at least one matching template for at least one object located within the image.

First claim

Opening claim text (preview).

What is claimed is: 1. A method, comprising: receiving, at a computing device, a plurality of templates of a plurality of objects, wherein a template of an object comprises a plurality of feature values sampled at corresponding points of a two-dimensional grid of points positioned over a particular view of the object and scaled based on a depth of the object at the particular view; receiving, at the computing device, an image of an environment; determining, by the computing device, feature values sampled at each point of the two-dimensional grid when the two-dimensional grid is positioned over a plurality of locations within the image of the environment and scaled based on respective depths of the plurality of locations within the image of the environment; and identifying, by the computing device based on the determined feature values, a matching template from the plurality of templates for a particular object located at one of the plurality of locations within the image of the environment. 2. The method of claim 1 , wherein the image of the environment is received from an optical sensor, and wherein the two-dimensional grid is scaled relative to a canonical depth that is chosen based on a predicted viewing range of the optical sensor. 3. The method of claim 1 , wherein the plurality of feature values in each template of the plurality of templates comprise angles of two-dimensional gradient vectors representative of directional color changes. 4. The method of claim 1 , wherein the plurality of locations within the image comprise equally spaced pixels within the image. 5. The method of claim 1 , further comprising determining a matrix representative of the image, wherein a row of the matrix comprises the feature values sampled at a particular point of the two-dimensional grid positioned over the plurality locations within the image and scaled based on the respective depths of the plurality of locations within the image; and using the matrix representative of the image to identify the matching template for the particular object. 6. The method of claim 5 , further comprising storing the matrix in a memory storage, wherein the rows of the matrix are stored linearly within the memory storage. 7. The method of claim 5 , wherein the rows of the matrix representative of the image are ordered based on a numbering of the points of the two-dimensional grid, wherein the numbering starts at a center point of the two-dimensional grid and increases according to a spiral of grid points extending from the center point. 8. The method of claim 7 , further comprising: determining additional feature values from plurality of locations within the image at a plurality of additional grid points, wherein the plurality of additional grid points expand the two-dimensional grid; and determining a plurality of additional rows for the matrix comprising the additional feature values, wherein the plurality of additional rows are ordered by numbering the plurality of additional grid points according to the spiral of grid points. 9. The method of claim 8 , further comprising: identifying an object at a location from the plurality of locations within the image that is outside of the two-dimensional grid positioned over the location and scaled based on the depth of the location; and adding the plurality of additional grid points to expand the two-dimensional grid to cover the identified object. 10. A non-transitory computer readable medium having stored therein instructions, that when executed by a computing system, cause the computing system to perform functions comprising: receiving a plurality of templates of a plurality of objects, wherein a template of an object comprises a plurality of feature values sampled at corresponding points of a two-dimensional grid of points positioned over a particular view of the object and scaled based on a depth of the object at the particular view; receiving an image of an environment; determining feature values sampled at each point of the two-dimensional grid when the two-dimensional grid is positioned over a plurality of locations within the image of the environment and scaled based on respective depths of the plurality of locations within the image of the environment; and identifying, based on the determined feature values, a matching template from the plurality of templates for a particular object located at one of the plurality of locations within the image of the environment. 11. The non-transitory computer readable medium of claim 10 , wherein the image of the environment is received from an optical sensor, and wherein the two-dimensional grid is scaled relative to a canonical depth that is chosen based on a predicted viewing range of the optical sensor. 12. The non-transitory computer readable medium of claim 10 , wherein the plurality of feature values in each template of the plurality of templates comprise numerical representations of normal vectors from surfaces of objects. 13. The non-transitory computer readable medium of claim 10 , wherein the plurality of locations within the image comprise equally spaced pixels within the image. 14. The non-transitory computer readable medium of claim 10 , the functions further comprising determining a matrix representative of the image, wherein a row of the matrix comprises the feature values sampled at a particular point of the two-dimensional grid positioned over the plurality locations within the image and scaled based on the respective depths of the plurality of locations within the image; and using the matrix representative of the image to identify the matching template for the particular object. 15. The non-transitory computer readable medium of claim 14 , the functions further comprising storing the matrix in a memory storage, wherein the rows of the matrix are stored linearly within the memory storage. 16. The non-transitory computer readable medium of claim 14 , wherein the rows of the matrix representative of the image are ordered based on a numbering of the points of the two-dimensional grid, wherein the numbering starts at a center point of the two-dimensional grid and increases according to a spiral of grid points extending from the center point. 17. The non-transitory computer readable medium of claim 16 , the functions further comprising: determining additional feature values from plurality of locations within the image at a plurality of additional grid points, wherein the plurality of additional grid points expand the two-dimensional grid; and determining a plurality of additional rows for the matrix comprising the additional feature values, wherein the plurality of additional rows are ordered by numbering the plurality of additional grid points according to the spiral of grid points. 18. A system, comprising: at least one optical sensor; at least one processor; and a non-transitory computer readable medium having stored therein instructions, that when executed by the at least one processor, cause the at least one processor to: receive a plurality of templates of a plurality of objects, wherein a template of an object comprises a plurality of feature values sampled at corresponding points of a two-dimensional grid of points positioned over a particular view of the object and scaled based on a depth of the object at the particular view; receive, from the at least one optical sensor, an image of an environment; determine feature values sampled at each point of the two-dimensional grid when the two-dimensional grid is positioned over a plurality of locations within the image

Assignees

Inventors

Classifications

  • B25J5/007Primary

    mounted on wheels · CPC title

  • Terrestrial scenes (scenes under surveillance with static cameras G06V20/52; scenes perceived from the exterior of a vehicle G06V20/56; scenes perceived from the interior of a vehicle G06V20/59) · CPC title

  • Matching criteria, e.g. proximity measures · CPC title

  • Encoded features or binary features, e.g. local binary patterns [LBP] · CPC title

  • including video camera means · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9659217B2 cover?
An example method includes receiving a plurality of templates of a plurality of objects, where a template comprises feature values sampled at corresponding points of a two-dimensional grid of points positioned over a particular view of an object and scaled based on a depth of the object at the particular view. The method may further include receiving an image of an environment and determining a…
Who is the assignee on this patent?
X Dev Llc
What technology area does this patent fall under?
Primary CPC classification B25J5/007. Mapped technology areas include Operations & Transport.
When was this patent published?
Publication date Tue May 23 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).