Semi-supervised learning using clustering as an additional constraint

US11954881B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11954881-B2
Application numberUS-201916513991-A
CountryUS
Kind codeB2
Filing dateJul 17, 2019
Priority dateAug 28, 2018
Publication dateApr 9, 2024
Grant dateApr 9, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

In some implementations a neural network is trained to perform a main task using a clustering constraint, for example, using both a main task training loss and a clustering training loss. Training inputs are inputted into a main task neural network to produce output labels predicting locations of the parts of the objects in the training inputs. Data from pooled layers of the main task neural network is inputted into a clustering neural network. The main task neural network and the clustering neural network are trained based on a main task loss from the main task neural network and a clustering loss from the clustering neural network. The main task loss is determined by comparing differences between the output labels and the training labels. The clustering loss encourages the clustering network to learn to label the parts of the objects individually, e.g., to learn groups corresponding to the object parts.

First claim

Opening claim text (preview).

What is claimed is: 1. A method, comprising: at an electronic device with one or more processors: obtaining a training set of training inputs and corresponding training labels, the training labels identifying known locations of parts of objects in the training inputs; inputting the training inputs into a main task neural network to produce output labels predicting locations of the parts of the objects in the training inputs; inputting data from pooled layers of the main task neural network into a clustering neural network to identify clusters comprising a first group of features in a first subset of the pooled layers that correspond with a first part of at least one object of the objects and a second group of features in a second subset of the pooled layers that correspond with a second part of the at least one object of the objects, wherein the first subset is different than the second subset of pooled; and training the main task neural network and the clustering neural network based on a main task loss from the main task neural network and a clustering loss from the clustering neural network. 2. The method of claim 1 further comprising: inputting additional inputs into the main task neural network to produce additional output labels and corresponding confidence values; selecting, based on the confidence values, an automatically-labeled training set of data comprising a subset of the additional inputs and a corresponding subset of the additional output labels; and further training the main task neural network and the clustering neural network using the automatically-labeled training set of data. 3. The method of claim 1 further comprising determining the main task loss by comparing the output labels and the training labels. 4. The method of claim 1 further comprising determining the main task loss using learned quality assurance metrics. 5. The method of claim 1 wherein the clustering loss is configured to cause the clustering neural network to learn to label the parts of the objects individually. 6. The method of claim 1 , wherein the clustering loss is configured to cause the clustering neural network to learn groups corresponding to the parts of the objects. 7. The method of claim 1 , wherein the clustering neural network is trained to identify a first group of the pooled layers corresponding to a first pattern and a second group of the pooled layers corresponding to a second pattern. 8. The method of claim 1 , wherein the main task neural network and the clustering neural network are trained together using the main task loss and the clustering loss to cause clusters learnt by the clustering neural network to correspond to the parts of the objects. 9. The method of claim 1 , wherein the main task neural network and the clustering neural network are trained together using the main task loss and the clustering loss to cause similarity between sub-parts of feature maps across multiple images. 10. The method of claim 1 , wherein groups learned by the clustering neural network correspond to human body parts. 11. The method of claim 1 , wherein a number of groups learned by the clustering neural network corresponds to a number of the parts of the objects. 12. The method of claim 1 , wherein the main task neural network is trained for human pose estimation, wherein the parts of the objects correspond to parts of a skeleton representing human pose. 13. The method of claim 1 , wherein the main task neural network is trained for hand tracking, body tracking, or gaze tracking. 14. The method of claim 1 , wherein the main task neural network is trained for semantic segmentation of audio. 15. The method of claim 1 , wherein the main task neural network is trained for semantic segmentation of text. 16. The method of claim 1 further comprising integrating the main task neural network into an application stored on a non-transitory computer-readable medium. 17. A system comprising: a non-transitory computer-readable storage medium; and one or more processors coupled to the non-transitory computer-readable storage medium, wherein the non-transitory computer-readable storage medium comprises program instructions that, when executed on the one or more processors, cause the system to perform operations comprising: obtaining a training set of training inputs and corresponding training labels, the training labels identifying known locations of parts of objects in the training inputs; inputting the training inputs into a main task neural network to produce output labels predicting locations of the parts of the objects in the training inputs; inputting data from pooled layers of the main task neural network into a clustering neural network to identify clusters comprising a first group of features in a first subset of the pooled layers that correspond with a first part of at least one object of the objects and a second group of features in a second subset of the pooled layers that correspond with a second part of the at least one object of the objects, wherein the first subset is different than the second subset; and training the main task neural network and the clustering neural network based on a main task loss from the main task neural network and a clustering loss from the clustering neural network. 18. The system of claim 17 , wherein the operations further comprise: inputting additional inputs into the main task neural network to produce additional output labels and corresponding confidence values; selecting, based on the confidence values, an automatically-labeled training set of data comprising a subset of the additional inputs and a corresponding subset of the additional output labels; and further training the main task neural network and the clustering neural network using the automatically-labeled training set of data. 19. The system of claim 17 , wherein the operations further comprise determining the main task loss by comparing the output labels and the training labels. 20. A non-transitory computer-readable storage medium, storing program instructions computer-executable on a computer to perform operations comprising: obtaining a training set of training inputs and corresponding training labels, the training labels identifying known locations of parts of objects in the training inputs; inputting the training inputs into a main task neural network to produce output labels predicting locations of the parts of the objects in the training inputs; inputting data from pooled layers of the main task neural network into a clustering neural network to identify clusters comprising a first group of features in a first subset of the pooled layers that correspond with a first part of at least one object of the objects and a second group of features in a second subset of the pooled layers that correspond with a second part of the at least one object of the objects, wherein the first subset is different than the second subset; and training the main task neural network and the clustering neural network based on a main task loss from the main task neural network and a clustering loss from the clustering neural network.

Assignees

Inventors

Classifications

  • Weakly supervised learning, e.g. semi-supervised or self-supervised learning · CPC title

  • Supervised learning · CPC title

  • Convolutional networks [CNN, ConvNet] · CPC title

  • G06T7/73Primary

    using feature-based methods · CPC title

  • by evaluating different subsets according to an optimisation criterion, e.g. class separability, forward selection or backward elimination · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11954881B2 cover?
In some implementations a neural network is trained to perform a main task using a clustering constraint, for example, using both a main task training loss and a clustering training loss. Training inputs are inputted into a main task neural network to produce output labels predicting locations of the parts of the objects in the training inputs. Data from pooled layers of the main task neural ne…
Who is the assignee on this patent?
Apple Inc
What technology area does this patent fall under?
Primary CPC classification G06T7/73. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Apr 09 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).