Interpretable clustering via multi-polytope machines

US2023267339A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2023267339-A1
Application numberUS-202217675202-A
CountryUS
Kind codeA1
Filing dateFeb 18, 2022
Priority dateFeb 18, 2022
Publication dateAug 24, 2023
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

In unsupervised interpretable machine learning, one or more datasets having multiple features can be received. A machine can be trained to jointly cluster and interpret resulting clusters of the dataset by at least jointly clustering the dataset into clusters and generating hyperplanes in a multi-dimensional feature space of the dataset, where the hyperplanes separate pairs of the clusters, where a hyperplane separates a pair of clusters. Jointly clustering the dataset into clusters and generating hyperplanes can repeat until convergence, where the clustering in a subsequent iteration uses the generated hyperplanes from a previous iteration to optimize performance of the clustering. The hyperplanes can be adjusted to further improve the performance of the clustering. The clusters and interpretation of the clusters can be provided, where a cluster's interpretation is provided based on hyperplanes that construct a polytope containing the cluster.

First claim

Opening claim text (preview).

What is claimed is: 1 . A system for training a machine to perform unsupervised interpretable machine learning, comprising: at least one processor; and a memory device coupled with the at least one processor; the at least one processor configured to at least: receive a dataset having multiple features; train to jointly cluster and interpret resulting clusters of the dataset by at least: clustering the dataset into clusters; generating hyperplanes in a multi-dimensional feature space of the dataset, the hyperplanes separating pairs of the clusters, wherein a hyperplane separates a pair of clusters; repeating the clustering and generating until convergence, wherein the clustering in a subsequent iteration uses the generated hyperplanes from a previous iteration to optimize performance of the clustering; adjusting the hyperplanes to further improve the performance of the clustering; providing the clusters and interpretation of the clusters, wherein a cluster's interpretation is provided based on hyperplanes that construct a polytope containing the cluster. 2 . The system of claim 1 , wherein the clustering and the generating of the hyperplanes are performed as a single mixed integer non-linear programming that solves alternating minimization between the clustering and the hyperplane generating. 3 . The system of claim 1 , wherein the clustering is implemented using a representation aware k-means clustering that clusters with awareness of representation error using a clustering metric. 4 . The system of claim 1 , wherein the hyperplanes are generated based on configurable parameters that control sparsity of the hyperplanes for interpretability. 5 . The system of claim 1 , wherein the adjusting of the hyperplanes is performed based on a selected clustering metric. 6 . The system of claim 5 , wherein the selected clustering metric includes Silhouette index. 7 . The system of claim 5 , wherein the selected clustering metric includes Dunn index. 8 . A computer-implemented method of training a machine to perform unsupervised interpretable machine learning, comprising: receiving a dataset having multiple features; clustering the dataset into clusters; generating hyperplanes in a multi-dimensional feature space of the dataset, the hyperplanes separating pairs of the clusters, wherein a hyperplane separates a pair of clusters; repeating the clustering and generating until convergence, wherein the clustering in a subsequent iteration uses the generated hyperplanes from a previous iteration to optimize performance of the clustering; adjusting the hyperplanes to further improve the performance of the clustering; providing the clusters and interpretation of the clusters, wherein a cluster's interpretation is provided based on hyperplanes that construct a polytope containing the cluster, wherein the machine is trained to jointly cluster and interpret resulting clusters of the dataset. 9 . The computer-implemented method of claim 8 , wherein the clustering and the generating of the hyperplanes are performed as a single mixed integer non-linear programming that solves alternating minimization between the clustering and the hyperplane generating. 10 . The computer-implemented method of claim 8 , wherein the clustering is implemented using a representation aware k-means clustering that clusters with awareness of representation error using a clustering metric. 11 . The computer-implemented method of claim 8 , wherein the hyperplanes are generated based on configurable parameters that control sparsity of the hyperplanes for interpretability. 12 . The computer-implemented method of claim 8 , wherein the adjusting of the hyperplanes is performed based on a selected clustering metric. 13 . The computer-implemented method of claim 12 , wherein the selected clustering metric includes Silhouette index. 14 . The computer-implemented method of claim 12 , wherein the selected clustering metric includes Dunn index. 15 . A computer program product comprising a computer readable storage medium having program instructions embodied therewith, the program instructions readable by a device to cause the device to: receive a dataset having multiple features; train to jointly cluster and interpret resulting clusters of the dataset by at least: cluster the dataset into clusters; generate hyperplanes in a multi-dimensional feature space of the dataset, the hyperplanes separating pairs of the clusters, wherein a hyperplane separates a pair of clusters; repeat clustering and generating until convergence, wherein the clustering in a subsequent iteration uses the generated hyperplanes from a previous iteration to optimize performance of the clustering; adjust the hyperplanes to further improve the performance of the clustering; provide the clusters and interpretation of the clusters, wherein a cluster's interpretation is provided based on hyperplanes that construct a polytope containing the cluster. 16 . The computer program product of claim 15 , wherein the clustering and the generating of the hyperplanes are performed as a single mixed integer non-linear programming that solves alternating minimization between the clustering and the hyperplane generating. 17 . The computer program product of claim 15 , wherein the clustering is implemented using a representation aware k-means clustering that clusters with awareness of representation error using a clustering metric. 18 . The computer program product of claim 15 , wherein the hyperplanes are generated based on configurable parameters that control sparsity of the hyperplanes for interpretability. 19 . The computer program product of claim 15 , wherein the adjusting of the hyperplanes is performed based on a selected clustering metric. 20 . The computer program product of claim 19 , wherein the selected clustering metric includes at least one selected from the group of Silhouette index and Dunn index.

Assignees

Inventors

Classifications

  • G06N20/10Primary

    using kernel methods, e.g. support vector machines [SVM] · CPC title

  • G06N5/022Primary

    Knowledge engineering; Knowledge acquisition · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2023267339A1 cover?
In unsupervised interpretable machine learning, one or more datasets having multiple features can be received. A machine can be trained to jointly cluster and interpret resulting clusters of the dataset by at least jointly clustering the dataset into clusters and generating hyperplanes in a multi-dimensional feature space of the dataset, where the hyperplanes separate pairs of the clusters, whe…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06N20/10. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Aug 24 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).