Semantically aware keypoint matching

US12183087B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12183087-B2
Application numberUS-202318489687-A
CountryUS
Kind codeB2
Filing dateOct 18, 2023
Priority dateApr 14, 2020
Publication dateDec 31, 2024
Grant dateDec 31, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method for keypoint matching performed by a semantically aware keypoint matching model includes generating a semanticly segmented image from an image captured by a sensor of an agent, the semanticly segmented image associating a respective semantic label with each pixel of a group of pixels associated with the image. The method also includes generating a set of augmented keypoint descriptors by augmenting, for each keypoint of the set of keypoints associated with the image, a keypoint descriptor with semantic information associated with one or more pixels, of the semantically segmented image, corresponding to the keypoint. The method further includes controlling an action of the agent in accordance with identifying a target image having one or more first augmented keypoint descriptors that match one or more second augmented keypoint descriptors of the set of augmented keypoint descriptors.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for keypoint matching performed by a semantically aware keypoint matching model, the method comprising: generating a semanticly segmented image from an image captured by a sensor of an agent, the semanticly segmented image associating a respective semantic label with each pixel of a group of pixels associated with the image; generating a set of augmented keypoint descriptors by augmenting, for each keypoint of the set of keypoints associated with the image, a keypoint descriptor with semantic information associated with one or more pixels, of the semanticaly segmented image, corresponding to the keypoint; and controlling an action of the agent in accordance with identifying a target image having one or more first augmented keypoint descriptors that match one or more second augmented keypoint descriptors of the set of augmented keypoint descriptors. 2. The method of claim 1 , further comprising identifying the one or more first augmented keypoint descriptors of the target image matching the one or more second augmented keypoint descriptors of the image based on a nearest neighbor matching function. 3. The method of claim 2 , further comprising biasing the matching of the one or more first augmented keypoint descriptors of the target image and the one or more second augmented keypoint descriptors toward two or more descriptors having matching semantic labels. 4. The method of claim 1 , further comprising identifying a current location of the agent based on the target image, wherein controlling the action of the agent comprises navigating to a new location from the current location. 5. The method of claim 1 , wherein: the target image is identified from a plurality of stored images; and each one of the stored images is associated with one or more augmented keypoint descriptors. 6. The method of claim 1 , wherein the agent is an autonomous vehicle or a semi-autonomous vehicle. 7. The method of claim 1 , wherein the image is a monocular image. 8. An apparatus for keypoint matching performed by a semantically aware keypoint matching model, comprising: one or more processors; and one or more memories coupled with the one or more processors and storing instructions operable, when executed by the one or more processors, to cause the apparatus to: generate a semanticly segmented image from an image captured by a sensor of an agent, the semanticly segmented image associating a respective semantic label with each pixel of a group of pixels associated with the image; generate a set of augmented keypoint descriptors by augmenting, for each keypoint of the set of keypoints associated with the image, a keypoint descriptor with semantic information associated with one or more pixels, of the semanticaly segmented image, corresponding to the keypoint; and control an action of the agent in accordance with identifying a target image having one or more first augmented keypoint descriptors that match one or more second augmented keypoint descriptors of the set of augmented keypoint descriptors. 9. The apparatus of claim 8 , wherein execution of the instructions further cause the apparatus to identify the one or more first augmented keypoint descriptors of the target image matching the one or more second augmented keypoint descriptors of the image based on a nearest neighbor matching function. 10. The apparatus of claim 9 , wherein execution of the instructions further cause the apparatus to bias the matching of the one or more first augmented keypoint descriptors of the target image and the one or more second augmented keypoint descriptors toward two or more descriptors having matching semantic labels. 11. The apparatus of claim 8 , wherein execution of the instructions further cause the apparatus to identify a current location of the agent based on the target image, wherein controlling the action of the agent comprises navigating to a new location from the current location. 12. The apparatus of claim 8 , wherein: the target image is identified from a plurality of stored images; and each one of the stored images is associated with one or more augmented keypoint descriptors. 13. The apparatus of claim 8 , wherein the agent is an autonomous vehicle or a semi-autonomous vehicle. 14. The apparatus of claim 8 , wherein the image is a monocular image. 15. A non-transitory computer-readable medium having program code recorded thereon for keypoint matching performed by a semantically aware keypoint matching model, the program code executed by a processor and comprising: program code to generate a semanticly segmented image from an image captured by a sensor of an agent, the semanticly segmented image associating a respective semantic label with each pixel of a group of pixels associated with the image; program code to generate a set of augmented keypoint descriptors by augmenting, for each keypoint of the set of keypoints associated with the image, a keypoint descriptor with semantic information associated with one or more pixels, of the semanticaly segmented image, corresponding to the keypoint; and program code to control an action of the agent in accordance with identifying a target image having one or more first augmented keypoint descriptors that match one or more second augmented keypoint descriptors of the set of augmented keypoint descriptors. 16. The non-transitory computer-readable medium of claim 15 , wherein the program code further comprises program code to identify the one or more first augmented keypoint descriptors of the target image matching the one or more second augmented keypoint descriptors of the image based on a nearest neighbor matching function. 17. The non-transitory computer-readable medium of claim 16 , wherein the program code further comprises program code to bias the matching of the one or more first augmented keypoint descriptors of the target image and the one or more second augmented keypoint descriptors toward two or more descriptors having matching semantic labels. 18. The non-transitory computer-readable medium of claim 15 , wherein the program code further comprises program code to identify a current location of the agent based on the target image, wherein controlling the action of the agent comprises navigating to a new location from the current location. 19. The non-transitory computer-readable medium of claim 15 , wherein: the target image is identified from a plurality of stored images; and each one of the stored images is associated with one or more augmented keypoint descriptors. 20. The non-transitory computer-readable medium of claim 15 , wherein the agent is an autonomous vehicle or a semi-autonomous vehicle.

Assignees

Inventors

Classifications

  • Means capturing signals occurring naturally from the environment, e.g. ambient optical, acoustic, gravitational or magnetic signals (using passive navigation aids external to the vehicle G05D1/244; using signals from positioning sensors located off-board the vehicle G05D1/249) · CPC title

  • using artificial intelligence [AI] techniques · CPC title

  • Comparing pixel values or logical combinations thereof, or feature values having positional relevance, e.g. template matching · CPC title

  • involving a learning process · CPC title

  • using a video camera in combination with image processing means · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12183087B2 cover?
A method for keypoint matching performed by a semantically aware keypoint matching model includes generating a semanticly segmented image from an image captured by a sensor of an agent, the semanticly segmented image associating a respective semantic label with each pixel of a group of pixels associated with the image. The method also includes generating a set of augmented keypoint descriptors …
Who is the assignee on this patent?
Toyota Res Inst Inc
What technology area does this patent fall under?
Primary CPC classification G06V20/56. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Dec 31 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 6 related publications on this page (citations in our corpus or others sharing the same primary CPC).