Optimizations for dynamic object instance detection, segmentation, and structure mapping

US10586350B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10586350-B2
Application numberUS-201815972035-A
CountryUS
Kind codeB2
Filing dateMay 4, 2018
Priority dateDec 3, 2017
Publication dateMar 10, 2020
Grant dateMar 10, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

In one embodiment, a system accesses pose probability models for predetermined parts of a body depicted in an image. Each of the pose probability models is configured for determining a probability of the associated predetermined body part being at a location in the image. The system determines a candidate pose that is defined by a set of coordinates representing candidate locations of the predetermined body parts. The system further determines a first probability score for the candidate pose based on the pose probability models and the set of coordinates of the candidate pose. A pose representation is generated for the candidate pose using a transformation model and the candidate pose. The system determines a second probability score for the pose representation based on a pose-representation probability model. The system selects the candidate pose to represent a pose of the body based on at least the first and second probability scores.

First claim

Opening claim text (preview).

What is claimed is: 1. A method comprising, by a computing system: accessing a plurality of pose probability models for a plurality of predetermined parts of a body that is depicted in an image, respectively, wherein each of the plurality of pose probability models is configured for determining a probability of the associated predetermined part of the body being at a location in the image, wherein the plurality of pose probability models is generated by a machine-learning model; determining a candidate pose that is defined by a set of coordinates representing candidate locations of the predetermined parts of the body in the image; determining a first probability score for the candidate pose based on the plurality of pose probability models and the set of coordinates of the candidate pose; generating a pose representation for the candidate pose using a transformation model and the candidate pose; determining a second probability score for the pose representation based on a pose-representation probability model; and selecting the candidate pose to represent a pose of the body depicted in the image based on at least the first probability score and the second probability score. 2. The method of claim 1 , wherein each coordinate in the set of coordinates of the candidate pose is defined in a first coordinate system of the image; wherein the pose representation is defined in a first spatial dimension and is generated by applying the transformation model to a set of normalized coordinates that correspond to the set of coordinates of the candidate pose, respectively; and wherein each coordinate in the set of normalized coordinates is defined in a second coordinate system that is different from the first coordinate system. 3. The method of claim 2 , further comprising: reprojecting the pose representation from the first spatial dimension into a second spatial dimension associated with the second coordinate system; and computing a reprojection error based on the reprojected pose representation and the normalized coordinates; wherein the selection of the candidate pose is further based on the reprojection error. 4. The method of claim 2 , wherein the pose representation is generated based on differences between the set of normalized coordinates and an aggregate representation of a plurality of sets of normalized coordinates that are associated with a plurality of poses, respectively. 5. The method of claim 2 , wherein the second coordinate system is defined relative to one or more of the predetermined parts of the body. 6. The method of claim 1 , wherein each of the plurality of probability models is a probability heat map. 7. The method of claim 1 , wherein the transformation model is generated using principal component analysis. 8. The method of claim 1 , wherein at least one of the plurality of predetermined parts of the body corresponds to a joint of the body. 9. A system comprising: one or more processors and one or more computer-readable non-transitory storage media coupled to one or more of the processors, the one or more computer-readable non-transitory storage media comprising instructions operable when executed by one or more of the processors to cause the system to perform operations comprising: accessing a plurality of pose probability models for a plurality of predetermined parts of a body that is depicted in an image, respectively, wherein each of the plurality of pose probability models is configured for determining a probability of the associated predetermined part of the body being at a location in the image, wherein the plurality of pose probability models is generated by a machine-learning model; determining a candidate pose that is defined by a set of coordinates representing candidate locations of the predetermined parts of the body in the image; determining a first probability score for the candidate pose based on the plurality of pose probability models and the set of coordinates of the candidate pose; generating a pose representation for the candidate pose using a transformation model and the candidate pose; determining a second probability score for the pose representation based on a pose-representation probability model; and selecting the candidate pose to represent a pose of the body depicted in the image based on at least the first probability score and the second probability score. 10. The system of claim 9 , wherein each coordinate in the set of coordinates of the candidate pose is defined in a first coordinate system of the image; wherein the pose representation is defined in a first spatial dimension and is generated by applying the transformation model to a set of normalized coordinates that correspond to the set of coordinates of the candidate pose, respectively; and wherein each coordinate in the set of normalized coordinates is defined in a second coordinate system that is different from the first coordinate system. 11. The system of claim 10 , wherein the processors are further operable when executing the instructions to perform operations comprising: reprojecting the pose representation from the first spatial dimension into a second spatial dimension associated with the second coordinate system; and computing a reprojection error based on the reprojected pose representation and the normalized coordinates; wherein the selection of the candidate pose is further based on the reprojection error. 12. The system of claim 10 , wherein the pose representation is generated based on differences between the set of normalized coordinates and an aggregate representation of a plurality of sets of normalized coordinates that are associated with a plurality of poses, respectively. 13. The system of claim 10 , wherein the second coordinate system is defined relative to one or more of the predetermined parts of the body. 14. The system of claim 9 , wherein each of the plurality of probability models is a probability heat map. 15. One or more computer-readable non-transitory storage media embodying software that is operable when executed to cause one or more processors to perform operations comprising: accessing a plurality of pose probability models for a plurality of predetermined parts of a body that is depicted in an image, respectively, wherein each of the plurality of pose probability models is configured for determining a probability of the associated predetermined part of the body being at a location in the image, wherein the plurality of pose probability models is generated by a machine-learning model; determining a candidate pose that is defined by a set of coordinates representing candidate locations of the predetermined parts of the body in the image; determining a first probability score for the candidate pose based on the plurality of pose probability models and the set of coordinates of the candidate pose; generating a pose representation for the candidate pose using a transformation model and the candidate pose; determining a second probability score for the pose representation based on a pose-representation probability model; and selecting the candidate pose to represent a pose of the body depicted in the image based on at least the first probability score and the second probability score. 16. The media of claim 15 , wherein each coordinate in the set of coordinates of the candidate pose is defined in a first coordinate system of the image; wherein the pose representation is defined in a first spatial dimension and is generated by applying the transformation model to a set of normalized coordinates that correspond to the set of coordi

Assignees

Inventors

Classifications

  • using classification, e.g. of video objects · CPC title

  • G06T7/75Primary

    involving models · CPC title

  • G06T7/73Primary

    using feature-based methods · CPC title

  • Artificial neural networks [ANN] · CPC title

  • Probabilistic image processing · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10586350B2 cover?
In one embodiment, a system accesses pose probability models for predetermined parts of a body depicted in an image. Each of the pose probability models is configured for determining a probability of the associated predetermined body part being at a location in the image. The system determines a candidate pose that is defined by a set of coordinates representing candidate locations of the prede…
Who is the assignee on this patent?
Facebook Inc
What technology area does this patent fall under?
Primary CPC classification G06T7/75. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Mar 10 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 5 related publications on this page (citations in our corpus or others sharing the same primary CPC).