Visualize data and significant records based on relationship with the model

US12293438B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12293438-B2
Application numberUS-202218064959-A
CountryUS
Kind codeB2
Filing dateDec 13, 2022
Priority dateDec 13, 2022
Publication dateMay 6, 2025
Grant dateMay 6, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

In an approach for post-modeling data visualization and analysis, a processor presents a first visualization of a training dataset in a first plot. Responsive to receiving a selection of a data group of the training dataset to analyze, a processor identifies three or fewer key model features of the data group of the training dataset. A processor ascertains a representative record of each key model feature of the three or fewer key model features using a Local Interpretable Model-Agnostic Explanation technique. A processor presents a second visualization of the three or fewer key model features and the representative record of each key model feature in a second plot.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method comprising: presenting, by one or more processors, a first visualization of a training dataset in a first plot; responsive to receiving a selection of a data group of the training dataset to analyze, identifying, by the one or more processors, three or fewer key model features of the data group of the training dataset; ascertaining, by the one or more processors, a representative record of each key model feature of the three or fewer key model features using a Local Interpretable Model-Agnostic Explanation technique; presenting, by the one or more processors, a second visualization of the three or fewer key model features and the representative record of each key model feature in a second plot; correcting or completing, by the one or more processors, the training dataset, wherein the training dataset is either incorrect or incomplete; prior to presenting the first visualization of the training dataset in the first plot, gathering, by one or more processors, the training dataset from one or more sources; determining, by one or more processors, a degree of importance of the one or more key model features; ranking, by one or more processors, the one or more key model features according to the degree of importance; and selecting, by one or more processors, the three or fewer key model features based on a set of criteria, wherein the set of criteria is selected from a group consisting of: a degree of accuracy of each key model feature of the training dataset and a pre-set configuration, further comprises: selecting, by one or more processors, two key model features of the three or fewer key model features selected; and condensing, by one or more processors, a key model feature of the three or fewer key model features not selected into a linear combination using a Principle Component Analysis to produce a three-dimension condensed data. 2. The computer-implemented method of claim 1 , further comprising: subsequent to selecting the three or fewer key model features based on the set of criteria, selecting, by one more processors, a first key model feature from the three or fewer key model features selected; assigning, by one or more processors, a first value to the first key model feature; and assigning, by one or more processors, a second value to a second key model feature and a third key model feature of the three or fewer key model features selected. 3. The computer-implemented method of claim 2 , further comprising: calculating, by one or more processors, the degree of accuracy of the second key model feature and the third key model feature; determining, by one or more processors, that the degree of accuracy of the second key model feature and the third key model feature exceeds a first threshold; and responsive to determining the degree of accuracy exceeds the first threshold, designating, by one or more processors, the key model feature as a valid feature. 4. The computer-implemented method of claim 2 , further comprising: calculating, by one or more processors, the degree of accuracy of the second key model feature; determining, by one or more processors, that the degree of accuracy of the second key model feature does not exceed the first threshold; and responsive to determining the degree of accuracy of the second key model feature does not exceed the first threshold, adding, by one or more processors, the second key model feature to a list of candidates. 5. The computer-implemented method of claim 1 , wherein selecting the three or fewer key model features based on the set of criteria further comprises: clustering, by one or more processors, the training dataset with the three or fewer key model features selected; comparing, by one or more processors, a first cluster with a baseline; and selecting, by one or more processors, the three or fewer key model features of the first cluster closest within a second threshold to the baseline. 6. The computer-implemented method of claim 1 , wherein ascertaining the representative record of each key model feature of the three or fewer key model features using a Local Interpretable Model-Agnostic Explanation technique further comprises: calculating, by one or more processors, a center of a selected cluster; comparing, by one or more processors, the representative record of each key model feature to the calculated center of the selected cluster; selecting, by one or more processors, the representative record of the key model feature closest within a fourth threshold to the calculated center of the selected cluster; and pairing, by one or more processors, the representative record of each key model feature. 7. A computer program product comprising: one or more computer readable storage media and program instructions stored on the one or more computer readable storage media, the program instructions comprising: program instructions to present a first visualization of a training dataset in a first plot; responsive to receiving a selection of a data group of the training dataset to analyze, program instructions to identify three or fewer key model features of the data group of the training dataset; program instructions to ascertain a representative record of each key model feature of the three or fewer key model features using a Local Interpretable Model-Agnostic Explanation technique, further comprises: program instructions to calculate a center of a selected cluster; program instructions to compare the representative record of each key model feature to the calculated center of the selected cluster; program instructions to select the representative record of the key model feature closest within a fourth threshold to the calculated center of the selected cluster; and program instructions to pair the representative record of each key model feature; program instructions to present a second visualization of the three or fewer key model features and the representative record of each key model feature in a second plot; and program instructions to correct or complete the training dataset, wherein the training dataset is either incorrect or incomplete. 8. The computer program product of claim 7 , further comprising: prior to presenting the first visualization of the training dataset in the first plot, program instructions to gather the training dataset from one or more sources; program instructions to identify one or more key model features of the training dataset; program instructions to determine a degree of importance of the one or more key model features; program instructions to rank the one or more key model features according to the degree of importance; and program instructions to select the three or fewer key model features based on a set of criteria, wherein the set of criteria is selected from a group consisting of: a degree of accuracy of each key model feature of the training dataset and a pre-set configuration. 9. The computer program product of claim 8 , further comprising: subsequent to selecting the three or fewer key model features based on the set of criteria, program instructions to select a first key model feature from the three or fewer key model features selected; program instructions to assign a first value to the first key model feature; and program instructions to assign a second value to a second key model feature and a third key model feature of the three or fewer key model features selected. 10. The computer program product of claim 9 , further comprising: program instructions to calculate the degree of accuracy of the second key model feature and the third key model feature; program instructions to determine that the degree of accuracy of

Assignees

Inventors

Classifications

  • G06T11/26Primary

    Drawing of charts or graphs · CPC title

  • G06N20/00Primary

    Machine learning · CPC title

  • G06T11/206Primary

    Physics · mapped topic

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12293438B2 cover?
In an approach for post-modeling data visualization and analysis, a processor presents a first visualization of a training dataset in a first plot. Responsive to receiving a selection of a data group of the training dataset to analyze, a processor identifies three or fewer key model features of the data group of the training dataset. A processor ascertains a representative record of each key mo…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06T11/26. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue May 06 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).