Learning Mahalanobis Distance Metrics from Data

US2022405529A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2022405529-A1
Application numberUS-202117345730-A
CountryUS
Kind codeA1
Filing dateJun 11, 2021
Priority dateJun 11, 2021
Publication dateDec 22, 2022
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The present invention provides techniques for learning Mahalanobis distance similarity metrics from data for individually fair machine learning models. In one aspect, a method for learning a fair Mahalanobis distance similarity metric includes: obtaining data with similarity annotations; selecting, based on the data obtained, a model for learning a Mahalanobis covariance matrix Σ; and learning the Mahalanobis covariance matrix Σ from the data using the model selected, wherein the Mahalanobis covariance matrix Σ fully defines the fair Mahalanobis distance similarity metric.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method for learning a fair Mahalanobis distance similarity metric, the method comprising: obtaining data with similarity annotations; selecting, based on the data obtained, a model for learning a Mahalanobis covariance matrix Σ; and learning the Mahalanobis covariance matrix Σ from the data using the model selected, wherein the Mahalanobis covariance matrix Σ fully defines the fair Mahalanobis distance similarity metric. 2 . The method of claim 1 , wherein the fair Mahalanobis distance similarity metric is of a form: d x ( x 1 ,x 2 ) (φ( x 1 )−φ)( x 2 )Σ(φ( x 1 )−φ( x 2 ))), wherein φ(x):X→R d is an embedding map and Σ∈S + d . 3 . The method of claim 1 , wherein the data obtained comprises groups of comparable samples. 4 . The method of claim 3 , wherein the model selected comprises a factor model. 5 . The method of claim 4 , wherein the factor model comprises: φ i =A * u i +B * υ i +∈ i , wherein φ i ∈R d is a learned representation of x i , u i ∈R K is a sensitive attribute of x i for a task at hand, υ i ∈R L is a relevant attribute of x i for the task at hand, and ∈ i is an error term. 6 . The method of claim 5 , further comprising: choosing an orthogonal complement of ran (A * ) for the Mahalanobis covariance matrix Σ, wherein ran (A * ) is a column space of A * ; and solving for ran (A * ). 7 . The method of claim 1 , wherein the data obtained comprises pairs of samples that are comparable, incomparable, or combinations thereof. 8 . The method of claim 7 , wherein the model selected comprises a binary response model. 9 . The method of claim 8 , wherein the data comprises human user feedback in a form of triplets {(x i1 , x i2 , y i )} i=1 n , where y i ∈{0,1} indicates whether a human user considers x i1 and x i2 comparable, wherein (x i1 , x i2 , y i ) satisfies the binary response model: y i ❘ x i ⁢ 1 , x i ⁢ 2 ∼ Ber ⁢ ( 2 ⁢ σ ⁡ ( - d i ) ) , d i = Δ  φ i ⁢ 1 - φ i ⁢ 2  Σ 0 2 ⁢ = ( φ i ⁢ 1 - φ i ⁢ 2 ) T ⁢ Σ 0 ( φ i ⁢ 1 - φ i ⁢ 2 ) ⁢ = 〈 ( φ i ⁢ 1 - φ i ⁢ 2 ) ⁢ ( φ i ⁢ 1 - φ i

Assignees

Inventors

Classifications

  • G06F18/285Primary

    Selection of pattern recognition techniques, e.g. of classifiers in a multi-classifier system · CPC title

  • Generating training patterns; Bootstrap methods, e.g. bagging or boosting · CPC title

  • Software arrangements specially adapted for pattern recognition, e.g. user interfaces or toolboxes therefor · CPC title

  • G06F18/22Primary

    Matching criteria, e.g. proximity measures · CPC title

  • Learning methods · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2022405529A1 cover?
The present invention provides techniques for learning Mahalanobis distance similarity metrics from data for individually fair machine learning models. In one aspect, a method for learning a fair Mahalanobis distance similarity metric includes: obtaining data with similarity annotations; selecting, based on the data obtained, a model for learning a Mahalanobis covariance matrix Σ; and learning …
Who is the assignee on this patent?
IBM, Univ Michigan
What technology area does this patent fall under?
Primary CPC classification G06F18/285. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Dec 22 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 10 related publications on this page (citations in our corpus or others sharing the same primary CPC).