Key points detection using multiple image modalities

US11967102B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11967102-B2
Application numberUS-202117378495-A
CountryUS
Kind codeB2
Filing dateJul 16, 2021
Priority dateJul 16, 2021
Publication dateApr 23, 2024
Grant dateApr 23, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Image-based key points detection using a convolutional neural network (CNN) may be impacted if the key points are occluded in the image. Images obtained from additional imaging modalities such as depth and/or thermal images may be used in conjunction with RGB images to reduce or minimize the impact of the occlusion. The additional images may be used to determine adjustment values that are then applied to the weights of the CNN so that the convolution operations may be performed in a modality aware manner to increase the robustness, accuracy, and efficiency of key point detection.

First claim

Opening claim text (preview).

The invention claimed is: 1. A method for detecting key points of an object, the method comprising: obtaining a first image of the object, wherein the first image comprises a first imaging property of the object; obtaining a second image of the object, wherein the second image comprises a second imaging property of the object; and processing the first image through a convolutional neural network (CNN) based on at least the first imaging property and the second imaging property to determine one or more key points of the object, wherein: the CNN comprises a first convolution layer that includes multiple kernels each associated with a respective set of weights; processing the first image through the CNN comprises performing, via the first convolution layer of the CNN, a first set of convolution operations on the first image based on the first imaging property of the object; and performing the first set of convolution operations on the first image comprises, for a pixel that belongs to a patch of the first image: identifying a corresponding patch in the second image; adjusting a weight of the first convolution layer to be applied to the pixel, wherein the adjustment is made based on the second imaging property associated with two or more pixels of the corresponding patch in the second image; and performing the first set of convolution operations based on the adjusted weight of the first convolution layer. 2. The method of claim 1 , wherein the first image comprises a red/green/blue (RGB) image of the object and the second image comprises a depth or thermal image of the object. 3. The method of claim 1 , wherein the one or more key points are occluded in the first image. 4. The method of claim 1 , wherein the CNN further comprises a second convolution layer that includes multiple kernels each associated with a respective second set of weights, the method further comprises performing a second set of convolution operations on an output of the first convolution layer, and wherein the second set of weights associated with each of the multiple kernels of the second convolution layer is adjusted based on the second imaging property of the object during the performance of the second set of convolution operations. 5. The method of claim 1 , wherein the CNN further comprises a second convolution layer that includes multiple kernels each associated with a respective second set of weights, the method further comprises performing a second set of convolution operations on an output of the first convolution layer, and wherein the second set of weights associated with each of the multiple kernels of the second convolution layer is not adjusted based on the second imaging property of the object during the performance of the second set of convolution operations. 6. The method of claim 1 , wherein the CNN comprises multiple additional convolution layers each associated with a respective set of weights, the method further comprises performing convolution operations on the first image via the multiple additional convolution layers, and wherein the respective set of weights associated with each of the multiple additional convolution layers is adjusted based on the second imaging property of the object during the performance of the convolution operations. 7. The method of claim 1 , wherein the one or more key points of the object are determined using a single branch of the CNN configured to process the first image in conjunction with the second image. 8. The method of claim 1 , wherein the one or more key points include a joint location. 9. An apparatus configured to detect key points of an object, the apparatus comprising: one or more processors configured to: obtain a first image of the object, wherein the first image comprises a first imaging property of the object; obtain a second image of the object, wherein the second image comprises a second imaging property of the object; and process the first image through a convolutional neural network (CNN) based on at least the first imaging property and the second imaging property to determine one or more key points of the object, wherein: the CNN comprises a first convolution layer that includes multiple kernels each associated with a respective set of weights; the one or more processors being configured to process the first image through the CNN comprises the one or more processors being configured to perform, via the first convolution layer of the CNN, a first set of convolution operations on the first image based on the first imaging property; and the one or more processors being configured to perform the first set of convolution operations on the first image comprises the one or more processors being configured to, for a pixel that belongs to a patch of the first image: identify a corresponding patch in the second image; adjust a weight of the first convolution layer to be applied to the pixel, wherein the adjustment is made based on the second imaging property associated with two or more pixels of the corresponding patch in the second image; and perform the first set of convolution operations based on the adjusted weight of the first convolution layer. 10. The apparatus of claim 9 , wherein the first image comprises a red/green/blue (RGB) image of the object and the second image comprises a depth or thermal image of the object. 11. The apparatus of claim 9 , wherein the one or more key points are occluded in the first image. 12. The apparatus of claim 9 , wherein the CNN further comprises a second convolution layer that includes multiple kernels each associated with a respective second set of weights, the one or more processors are further configured to perform a second set of convolution operations on an output of the first convolution layer, and wherein the second set of weights associated with each of the multiple kernels of the second convolution layer is adjusted based on the second imaging property of the object during the performance of the second set of convolution operations. 13. The apparatus of claim 9 , wherein the CNN further comprises a second convolution layer that includes multiple kernels each associated with a respective second set of weights, the one or more processors are further configured to perform a second set of convolution operations on an output of the first convolution layer, and wherein the second set of weights associated with each of the multiple kernels of the second convolution layer is not adjusted based on the second imaging property of the object. 14. The apparatus of claim 9 , wherein the CNN comprises multiple additional convolution layers each associated with a respective set of weights, the one or more processors are further configured to perform convolution operations on the first image via the multiple additional convolution layers, and wherein the respective set of weights associated with each of the multiple additional convolution layers is adjusted based on the second imaging property of the object during the performance of the convolution operations. 15. The apparatus of claim 9 , wherein the one or more key points of the object are determined using a single branch of the CNN configured to process the first image in conjunction with the second image. 16. The apparatus of claim 9 , wherein the one or more key points include a joint location.

Assignees

Inventors

Classifications

  • G06T7/73Primary

    using feature-based methods · CPC title

  • Combinations of networks · CPC title

  • Biomedical image inspection · CPC title

  • Static body considered as a whole, e.g. static pedestrian or occupant recognition · CPC title

  • Color image · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11967102B2 cover?
Image-based key points detection using a convolutional neural network (CNN) may be impacted if the key points are occluded in the image. Images obtained from additional imaging modalities such as depth and/or thermal images may be used in conjunction with RGB images to reduce or minimize the impact of the occlusion. The additional images may be used to determine adjustment values that are then …
Who is the assignee on this patent?
Shanghai United Imaging Intelligence Co Ltd
What technology area does this patent fall under?
Primary CPC classification G06T7/73. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Apr 23 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 6 related publications on this page (citations in our corpus or others sharing the same primary CPC).