Label inheritance for soft label generation in information processing system

US12548309B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12548309-B2
Application numberUS-202217569030-A
CountryUS
Kind codeB2
Filing dateJan 5, 2022
Priority dateJan 5, 2022
Publication dateFeb 10, 2026
Grant dateFeb 10, 2026

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Label inheritance techniques are disclosed for soft label generation in an information processing system that uses machine learning. For example, a method generates at least one label for a given data instance from a training data set useable to train a machine learning-based model. The at least one label is generated by assigning one or more labels associated with one or more ancestors of the data instance such that the data instance inherits the one or more labels associated with the one or more ancestors as the at least one label.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method, comprising: training a capsule neural network that incorporates one or more dynamic routing algorithms to estimate features of objects using a training data set comprising a plurality of image data instances to generate one or more mutually orthogonal capsules corresponding to one or more data instances of the training data set, each mutually orthogonal capsule comprising a vector representing an estimate of a local feature of a corresponding image data instance; extracting the one or more mutually orthogonal capsules inside the trained capsule neural network; computing a probabilistic bag of images representation for each of the one or more mutually orthogonal capsules based on a relation between each mutually orthogonal capsule and the training data set, the computing comprising: generating a first matrix associated with the training data set and the one or more mutually orthogonal capsules; generating a second matrix associated with the one or more mutually orthogonal capsules and corresponding dimensions of the one or more mutually orthogonal capsules; and multiplying the first matrix and the second matrix; identifying one or more ancestors of a given data instance based on the probabilistic bag of images representation of a given mutually orthogonal capsule associated with the given data instance; generating at least one soft label for the given data instance from the training data set, wherein the at least one soft label is generated by: comparing the probabilistic bag of images representation to the identified one or more ancestors of the given data instance to compute a similarity value for each of the identified one or more ancestors; and assigning one or more labels associated with one or more ancestors of the given data instance based on the computed similarity values such that the given data instance inherits the one or more labels associated with at least one of the identified one or more ancestors of the given data instance as the at least one soft label; and iteratively re-training the capsule neural network by inputting one or more new data instances to generate at least one or more new soft labels for the one or more new data instances, the generating comprising generating a third matrix associated with the one or more new data instances and the one or more mutually orthogonal capsules to identify one or more ancestors of each of the one or more new data instances and to compute a similarity value for each of the identified one or more ancestors of each of the one or more new data instances, the one or more new soft labels further being associated with the one or more ancestors of the one or more new data instances; wherein the method is performed by at least one processor and at least one memory storing executable computer program instructions. 2 . The method of claim 1 , wherein the step of generating the at least one soft label for the given data instance further comprises selecting one or more similar samples from the probabilistic bag of images representation for each mutually orthogonal capsule, and assigning one or more probabilities associated with the one or more similar samples as a soft label for each mutually orthogonal capsule. 3 . The method of claim 1 , wherein each mutually orthogonal capsule represents a topic associated with the corresponding image data instance. 4 . The method of claim 1 , further comprising using each mutually orthogonal capsule as distilled data in a proxy data generation application. 5 . The method of claim 1 , further comprising using each mutually orthogonal capsule as distilled data in a neural network self-training application. 6 . The method of claim 1 , wherein identifying the one or more ancestors of the given data instance further comprises: normalizing the probabilistic bag of images representation; and selecting as the one or more ancestors a predetermined number of samples from the training data set having the highest computed similarity values relative to the given mutually orthogonal capsule. 7 . The method of claim 1 , wherein assigning the one or more labels comprises performing a weighted summation of labels associated with the identified one or more ancestors, wherein a weight for the label of each ancestor is based on its computed similarity value. 8 . The method of claim 1 , wherein each value in the probabilistic bag of images representation represents a similarity between a corresponding image data instance in the training data set and the given mutually orthogonal capsule. 9 . The method of claim 1 , wherein assigning the one or more labels comprises performing a weighted summation of labels associated with the identified one or more ancestors, wherein a weight for the label of each ancestor is based on its computed similarity value. 10 . The method of claim 1 , wherein each value in the probabilistic bag of images representation represents a similarity between a corresponding image data instance in the training data set and the given mutually orthogonal capsule. 11 . An apparatus, comprising: at least one processor and at least one memory storing computer program instructions wherein, when the at least one processor executes the computer program instructions, the apparatus is configured to: train a capsule neural network that incorporates one or more dynamic routing algorithms to estimate features of objects using a training data set comprising a plurality of image data instances to generate one or more mutually orthogonal capsules corresponding to one or more data instances of the training data set, each mutually orthogonal capsule comprising a vector representing an estimate of a local feature of a corresponding image data instance; extract the one or more mutually orthogonal capsules inside the trained capsule neural network; compute a probabilistic bag of images representation for each of the one or more mutually orthogonal capsules based on a relation between each mutually orthogonal capsule and the training data set, the computing comprising: generating a first matrix associated with the training data set and the one or more mutually orthogonal capsules; generating a second matrix associated with the one or more mutually orthogonal capsules and corresponding dimensions of the one or more mutually orthogonal capsules; and multiplying the first matrix and the second matrix; identify one or more ancestors of a given data instance based on the probabilistic bag of images representation of a given mutually orthogonal capsule associated with the given data instance; generate at least one soft label for the given data instance from the training data set, wherein the at least one soft label is generated by: comparing the probabilistic bag of images representation to the identified one or more ancestors of the given data instance to compute a similarity value for each of the identified one or more ancestors; and assigning one or more labels associated with one or more ancestors of the given data instance based on the computed similarity values such that the given data instance inherits the one or more labels associated with at least one of the identified one or more ancestors of the given data instance as the at least one soft label; and iteratively re-train the capsule neural network by inputting one or more new data instances to generate one or more new soft labels for the one or more new data instances, the generating comprising generating a third matrix associated with the one or more new data instances and the one or more mutually orthogonal capsules to identify one or more ancestors of each of the one or more new data instances and to com

Assignees

Inventors

Classifications

  • using metadata automatically derived from the content · CPC title

  • Clustering; Classification · CPC title

  • G06V10/82Primary

    using neural networks · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12548309B2 cover?
Label inheritance techniques are disclosed for soft label generation in an information processing system that uses machine learning. For example, a method generates at least one label for a given data instance from a training data set useable to train a machine learning-based model. The at least one label is generated by assigning one or more labels associated with one or more ancestors of the …
Who is the assignee on this patent?
Dell Products Lp
What technology area does this patent fall under?
Primary CPC classification G06V10/82. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Feb 10 2026 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).