Systems and methods for identifying users in media content based on poselets and neural networks

US9704029B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9704029-B2
Application numberUS-201615284296-A
CountryUS
Kind codeB2
Filing dateOct 3, 2016
Priority dateDec 17, 2014
Publication dateJul 11, 2017
Grant dateJul 11, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems, methods, and non-transitory computer-readable media can receive a first image including a representation of a first user. A second image including a representation of a second user can be received. A first set of poselets associated with the first user can be detected in the first image. A second set of poselets associated with the second user can be detected in the second image. The first image including the first set of poselets can be inputted into a first instance of a neural network to generate a first multi-dimensional vector. The second image including the second set of poselets can be inputted into a second instance of the neural network to generate a second multi-dimensional vector. A first distance metric between the first multi-dimensional vector and the second multi-dimensional vector can be determined.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method comprising: determining, by a computing system, a first distance between a first value and a second value, wherein the first value is based on provision of a first image including a first set of poselets associated with a first user into a first instance of a neural network and the second value is based on provision of a second image including a second set of poselets associated with the first user into a second instance of the neural network; determining, by the computing system, a second distance between the first value and a third value, wherein the third value is based on provision of a third image including a third set of poselets associated with a second user into a third instance of the neural network; and training, by the computing system, the neural network to cause the first distance to be less than the second distance. 2. The computer-implemented method of claim 1 , wherein at least one of the first value, the second value, and the third value is a multi-dimensional vector. 3. The computer-implemented method of claim 1 , wherein the first image corresponds to a query image, wherein the second image corresponds to a positive image, wherein the third image corresponds to a negative image, wherein the first image, the second image, and the third image are included in a set of training images. 4. The computer-implemented method of claim 1 , wherein the training of the neural network includes modifying one or more weights associated with the neural network via one or more neural network back-propagation processes. 5. The computer-implemented method of claim 1 , wherein the training of the neural network to cause the first distance to be lesser than the second distance is based on minimizing a loss metric, and wherein the loss metric is determined by calculating a maximum value between zero and (1−the second distance+the first distance). 6. The computer-implemented method of claim 5 , wherein the first distance corresponds to a first Euclidean distance between the first value and the second value, and wherein the second distance corresponds to a second Euclidean distance between the first value and the third value. 7. The computer-implemented method of claim 1 , further comprising: determining that the first distance is less than a specified distance threshold. 8. The computer-implemented method of claim 1 , wherein the first set of poselets and the second set of poselets are included within a set of defined poselets. 9. The computer-implemented method of claim 8 , wherein the set of defined poselets is associated with at least one of a body portion, a combination of multiple body portions, a front facial portion, a side facial portion, a head portion, a hair portion, a wearable article portion, a perspective, or a pose. 10. The computer-implemented method of claim 1 , wherein each of the first value and the second value is associated with 256 dimensions. 11. A system comprising: at least one processor; and a memory storing instructions that, when executed by the at least one processor, cause the system to perform: determining a first distance between a first value and a secondvalue, wherein the first value is based on provision of a first image including a first set of poselets associated with a first user into a first instance of a neural network and the second value is based on provision of a second image including a second set of poselets associated with the first user into a second instance of the neural network; determining a second distance between the first value and a third value, wherein the third value is based on provision of a third image including a third set of poselets associated with a second user into a third instance of the neural network; and training the neural network to cause the first distance to be less than the second distance. 12. The computer-implemented method of claim 11 , wherein at least one of the first value, the second value, and the third value is a multi-dimensional vector. 13. The system of claim 11 , wherein the training of the neural network to cause the first distance to be lesser than the second distance is based on minimizing a loss metric, and wherein the loss metric is determined by calculating a maximum value between zero and (1−the second distance+the first distance). 14. The system of claim 13 , wherein the first distance corresponds to a first Euclidean distance between the first value and the second value, and wherein the second distance corresponds to a second Euclidean distance between the first value and the third value. 15. The system of claim 11 , wherein the instructions cause the system to further perform: determining that the first distance is less than a specified distance threshold. 16. A non-transitory computer-readable storage medium including instructions that, when executed by at least one processor of a computing system, cause the computing system to perform a method comprising: determining a first distance between a first value and a second value, wherein the first value is based on provision of a first image including a first set of poselets associated with a first user into a first instance of a neural network and the second value is based on provision of a second image including a second set of poselets associated with the first user into a second instance of the neural network; determining a second distance between the first value and a third value, wherein the third value is based on provision of a third image including a third set of poselets associated with a second user into a third instance of the neural network; and training the neural network to cause the first distance to be less than the second distance. 17. The computer-implemented method of claim 16 , wherein at least one of the first value, the second value, and the third value is a multi-dimensional vector. 18. The non-transitory computer-readable storage medium of claim 16 , wherein the training of the neural network to cause the first distance to be lesser than the second distance is based on minimizing a loss metric, and wherein the loss metric is determined by calculating a maximum value between zero and (1−the second distance+the first distance). 19. The non-transitory computer-readable storage medium of claim 18 , wherein the first distance corresponds to a first Euclidean distance between the first value and the second value, and wherein the second distance corresponds to a second Euclidean distance between the first value and the third value. 20. The non-transitory computer-readable storage medium of claim 16 , wherein the instructions cause the system to further perform: determining that the first distance is less than a specified distance threshold.

Assignees

Inventors

Classifications

  • Classification techniques · CPC title

  • using neural networks · CPC title

  • Proximity, similarity or dissimilarity measures · CPC title

  • Matching criteria, e.g. proximity measures · CPC title

  • Distances to prototypes · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9704029B2 cover?
Systems, methods, and non-transitory computer-readable media can receive a first image including a representation of a first user. A second image including a representation of a second user can be received. A first set of poselets associated with the first user can be detected in the first image. A second set of poselets associated with the second user can be detected in the second image. The f…
Who is the assignee on this patent?
Facebook Inc
What technology area does this patent fall under?
Primary CPC classification G06V40/10. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jul 11 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).