Method for outlier robust subgroup inference via clustering in the gradient space

US2024362534A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2024362534-A1
Application numberUS-202318309755-A
CountryUS
Kind codeA1
Filing dateApr 28, 2023
Priority dateApr 28, 2023
Publication dateOct 31, 2024
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A computer-implemented method for identifying relevant subgroups, which are relevant for training a subgroup-robust classifier, in a training dataset associated with a machine learning model includes receiving a classification dataset wherein subgroups are unlabeled. For each data point in the classification dataset, the method uses gradient space partitioning (GraSP) to identify a gradient representation of each data point by extracting an associated gradient of a logistic regression classification loss with respect to weights of a logistic regression. The gradient representations are clustered to provide estimated subgroup labels the cluster assignments are output as the estimated subgroup labels.

First claim

Opening claim text (preview).

What is claimed is: 1 . A computer-implemented method for identifying relevant subgroups in a training dataset associated with a machine learning model, comprising: receiving a classification dataset wherein subgroups are unlabeled; for each data point in the classification dataset, using gradient space partitioning (GraSP) to identify a gradient representation of each data point by extracting an associated gradient of a logistic regression classification loss with respect to weights of a logistic regression; clustering the gradient representations to provide estimated subgroup labels; and outputting cluster assignments as the estimated subgroup labels. 2 . The computer-implemented method of claim 1 , further comprising using an outlier-robust clustering algorithm to perform the clustering of the gradient representations. 3 . The computer-implemented method of claim 1 , wherein classes are labeled in the classification dataset. 4 . The computer-implemented method of claim 1 , further comprising learning group annotations and identifying outliers of the classification dataset. 5 . The computer-implemented method of claim 1 , further comprising training a robust classifier using the estimated subgroup labels. 6 . The computer-implemented method of claim 5 , further comprising applying distributionally robust optimization (DRO) to train the robust classifier. 7 . The computer-implemented method of claim 1 , further comprising, in response to receiving the classification dataset, applying a non-robust neural network classifier, wherein a last layer representation of the non-robust neural network classifier is extracted as dimension-reduced features. 8 . The computer-implemented method of claim 7 , wherein the gradient space partitioning is performed on the last-layer representation. 9 . A computer-implemented method for identifying relevant subgroups, in a presence of outliers, for training a classifier to be robust to the identified subgroups, comprising: receiving a classification dataset wherein subgroups are unlabeled; for each data point in the classification dataset, using gradient space partitioning (GraSP) to identify a gradient representation of each data point by extracting an associated gradient of a logistic regression classification loss with respect to weights of a logistic regression, wherein the GraSP further learns group annotations and identify outliers; clustering the gradient representations to estimate subgroup labels, wherein clustering further comprises using an outlier-robust clustering algorithm to cluster the gradient representations; outputting cluster assignments as the estimated subgroup labels; and training a robust classifier using the estimated subgroup labels. 10 . The computer-implemented method of claim 9 , wherein classes are labeled in the classification dataset. 11 . The computer-implemented method of claim 9 , further comprising applying distributionally robust optimization (DRO) to train the robust classifier. 12 . The computer-implemented method of claim 9 , further comprising, in response to receiving the classification dataset, applying a non-robust neural network classifier, wherein a last layer representation of the non-robust neural network classifier is extracted as dimension-reduced features. 13 . The computer-implemented method of claim 12 , wherein the gradient space partitioning is performed on the last-layer representation. 14 . A non-transitory computer readable storage medium tangibly embodying a computer readable program code having computer readable instructions that, when executed, causes a computer device to carry out a method of identifying relevant subgroups in a training dataset associated with a machine learning model, the method comprising: receiving a classification dataset wherein subgroups are unlabeled; for each data point in the classification dataset, using gradient space partitioning (GraSP) to identify a gradient representation of each data point by extracting an associated gradient of a logistic regression classification loss with respect to weights of a logistic regression; clustering the gradient representations to provide estimated subgroup labels; and outputting cluster assignments as the estimated subgroup labels. 15 . The non-transitory computer readable storage medium of claim 14 , the method further comprising using an outlier-robust clustering algorithm to perform the clustering of the gradient representations. 16 . The non-transitory computer readable storage medium of claim 14 , the method further comprising learning group annotations and identifying outliers of the classification dataset. 17 . The non-transitory computer readable storage medium of claim 14 , the method further comprising training a robust classifier using the estimated subgroup labels. 18 . The non-transitory computer readable storage medium of claim 17 , the method further comprising applying distributionally robust optimization (DRO) to train the robust classifier. 19 . The non-transitory computer readable storage medium of claim 14 , the method further comprising, in response to receiving the classification dataset, applying a non-robust neural network classifier, wherein a last layer representation of the non-robust neural network classifier is extracted as dimension-reduced features. 20 . The non-transitory computer readable storage medium of claim 19 , wherein the gradient space partitioning is performed on the last-layer representation.

Assignees

Inventors

Classifications

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2024362534A1 cover?
A computer-implemented method for identifying relevant subgroups, which are relevant for training a subgroup-robust classifier, in a training dataset associated with a machine learning model includes receiving a classification dataset wherein subgroups are unlabeled. For each data point in the classification dataset, the method uses gradient space partitioning (GraSP) to identify a gradient rep…
Who is the assignee on this patent?
IBM, Massachusetts Inst Technology
What technology area does this patent fall under?
Primary CPC classification G06N20/00. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Oct 31 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).