What technology area does this patent fall under?

Primary CPC classification G06N20/00. Mapped technology areas include Physics.

When was this patent published?

Publication date Tue Sep 13 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 6 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Enhancing fairness in transfer learning for machine learning models with missing protected attributes in source or target domains

US11443236B2 · US · B2

Patent metadata
Field	Value
Publication number	US-11443236-B2
Application number	US-201916692974-A
Country	US
Kind code	B2
Filing date	Nov 22, 2019
Priority date	Nov 22, 2019
Publication date	Sep 13, 2022
Grant date	Sep 13, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method of utilizing a computing device to correct source data used in machine learning includes receiving, by the computing device, first data. The computing device corrects the source data via an application of a covariate shift to the source data based upon the first data where the covariate shift re-weighs the source data.

First claim

Opening claim text (preview).

What is claimed is: 1. A method of utilizing a computing device to correct source data used in machine learning, the method comprising: receiving, by a computing device, first data; providing, by the computing device, transfer learning processing in absence of one or more protected attributes using a covariate shift combined with re-weighing to reduce a difference in group-specific prevalences for the source data; wherein the source data is partially labeled data, the first data is target data that includes the one or more protected attributes, and the transfer learning processing uses a target-fair covariate shift: wherein the target-fair covariate shift uses weights that are chosen to minimize a linear combination of a fairness loss with a classification loss; and wherein the fairness loss is evaluated on the target data where the one or more protected attributes are available. 2. The method of claim 1 , further comprising: training, by the computing device one or more machine learning models using the re-weighed source data. 3. The method of claim 1 , wherein the source data is fully labeled data having the one or more protected attributes, the first data is target data, and the transfer learning processing uses a prevalence-constrained covariate shift. 4. The method of claim 3 , wherein the prevalence-constrained covariate shift uses learned weights based on a difference as compared to covariate shift weights subject to constraints on weighted prevalences. 5. A computer program product for correcting source data used in machine learning, the computer program product comprising a computer readable storage medium having program instructions embodied therewith, the program instructions executable by a processor to cause the processor to: receive, by the processor, first data; provide, by the processor, transfer learning processing in absence of one or more protected attributes using a covariate shift combined with re-weighing to reduce a difference in group-specific prevalences for the source data; wherein the source data is partially labeled data, the first data is target data that includes the one or more protected attributes, and the transfer learning processing uses a target-fair covariate shift; wherein the target-fair covariate shift uses weights that are chosen to minimize a linear combination of a fairness loss with a classification loss; and wherein the fairness loss is evaluated on the target data where the one or more protected attributes are available. 6. The computer program product of claim 5 , wherein the program instructions executable by the processor further causes the processor to: train, by the processor, one or more machine learning models using the re-weighed source data. 7. The computer program product of claim 5 , wherein the source data is fully labeled data having the one or more protected attributes, the first data is target data, and the transfer learning processing uses a prevalence-constrained covariate shift. 8. The computer program product of claim 7 , wherein the prevalence-constrained covariate shift uses learned weights that are chosen based on a difference as compared to covariate shift weights subject to constraints on weighted prevalences. 9. An apparatus comprising: a memory configured to store instructions; and a processor configured to execute the instructions to: receive first data; provide transfer learning processing in absence of one or more protected attributes using a covariate shift combined with re-weighing to reduce a difference in group-specific prevalences for a source data; wherein the source data is partially labeled data, the first data is target data that includes the one or more protected attributes, and the transfer learning processing uses a target-fair covariate shift; and wherein the target-fair covariate shift uses weights that are chosen to minimize a linear combination of a fairness loss with a classification loss, and the fairness loss is evaluated on the target data where the one or more protected attributes are available. 10. The apparatus of claim 9 , wherein the processor is configured to further execute the instructions to: train one or more machine learning models using the re-weighed source data. 11. The apparatus of claim 9 , wherein the source data is fully labeled data having the one or more protected attributes, the first data is target data, and the transfer learning processing uses a prevalence-constrained covariate shift. 12. The apparatus of claim 11 , wherein the prevalence-constrained covariate shift uses learned weights that are chosen based on a difference as compared to covariate shift weights subject to constraints on weighted prevalences.

Assignees

Inventors

Classifications

G06N20/00Primary
Machine learning · CPC title

Patent family

Related publications grouped by family.

View patent family 75971307

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11443236B2 cover?: A method of utilizing a computing device to correct source data used in machine learning includes receiving, by the computing device, first data. The computing device corrects the source data via an application of a covariate shift to the source data based upon the first data where the covariate shift re-weighs the source data.
Who is the assignee on this patent?: IBM
What technology area does this patent fall under?: Primary CPC classification G06N20/00. Mapped technology areas include Physics.
When was this patent published?: Publication date Tue Sep 13 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 6 related publications on this page (citations in our corpus or others sharing the same primary CPC).

How to read this patent

Abstract

First claim

Assignees

Inventors

Classifications

Patent family

External sources

Related patents

Methods and apparatus to perform malware detection using a generative adversarial network

Prediction model construction device, prediction model construction method and prediction model construction program recording medium

Machine learning predictive labeling system

Distributed event prediction and machine learning object recognition system

Training systems and methods for sequence taggers

Generative machine learning systems for drug design

Frequently asked questions