Systems and methods for determining data criticality based on causal evaluation

US12505364B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12505364-B2
Application numberUS-202117236044-A
CountryUS
Kind codeB2
Filing dateApr 21, 2021
Priority dateApr 21, 2021
Publication dateDec 23, 2025
Grant dateDec 23, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Techniques described herein relate to methods and systems for determining data asset criticality. Such techniques may include making a first determination that a plurality of data asset attributes are part of a causal attribute set; calculating a SHapeley Additive explanation (SHAP) value for each of the plurality of data asset attributes in the causal attribute set; and performing a weighted mean calculation using the SHAP values for each of the plurality of data asset attributes and a corresponding attribute value for each of the plurality of data asset attributes of a data asset to obtain a criticality score for the data asset.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method for determining data asset criticality, the method comprising: making a first determination that a plurality of data asset attributes are part of a causal attribute set; calculating a SHapley Additive explanation (SHAP) value for each of the plurality of data asset attributes in the causal attribute set; performing a weighted mean calculation using the SHAP values for each of the plurality of data asset attributes and a corresponding attribute value for each of the plurality of data asset attributes of a data asset to obtain a criticality score for the data asset; including the data asset in a ranking of data assets using the criticality score; and classifying the data asset into a criticality classification based on a criticality threshold; generating a backup recommendation set based on the criticality classification, wherein the backup recommendation set comprises backup actions comprising at least one selected from a group consisting of a key rotation frequency adjustment, a backup retention adjustment, a backup storage type adjustment, and a backup storage location adjustment; and adjusting an attribute of the data asset to include a backup action of the backup actions. 2 . The method of claim 1 , wherein making the first determination that the plurality of data asset attributes are part of the causal attribute set comprises: performing a data category value calculation using a historical attribute set of a plurality of data assets to obtain a plurality of data category values for a plurality of data assets; and performing a causal inference analysis using the historical attribute set, a causal graph, and the plurality of data category values to obtain the causal attribute set. 3 . The method of claim 2 , wherein the data category value calculation comprises a linear regression analysis. 4 . The method of claim 2 , wherein the causal graph is a directed acyclic graph (DAG). 5 . The method of claim 1 , wherein, before the weighted mean calculation, the SHAP values are scaled to values between zero and one. 6 . A non-transitory computer readable medium comprising computer readable program code, which when executed by a computer processor enables the computer processor to perform a method for determining data asset criticality, the method comprising: making a first determination that a plurality of data asset attributes are part of a causal attribute set; calculating a SHapley Additive explanation (SHAP) value for each of the plurality of data asset attributes in the causal attribute set; and performing a weighted mean calculation using the SHAP values for each of the plurality of data asset attributes and a corresponding attribute value for each of the plurality of data asset attributes of a data asset to obtain a criticality score for the data asset; including the data asset in a ranking of data assets using the criticality score; and classifying the data asset into a criticality classification based on a criticality threshold; generating a backup recommendation set based on the criticality classification, wherein the backup recommendation set comprises backup actions comprising at least one selected from a group consisting of a key rotation frequency adjustment, a backup retention adjustment, a backup storage type adjustment, and a backup storage location adjustment; and adjusting an attribute of the data asset to include a backup action of the backup actions. 7 . The non-transitory computer readable medium of claim 6 , wherein making the first determination that the plurality of data asset attributes are part of the causal attribute set comprises: performing a data category value calculation using a historical attribute set of a plurality of data assets to obtain a plurality of data category values for a plurality of data assets; and performing a causal inference analysis using the historical attribute set, a causal graph, and the plurality of data category values to obtain the causal attribute set. 8 . The non-transitory computer readable medium of claim 7 , wherein the data category value calculation comprises a linear regression analysis. 9 . The non-transitory computer readable medium of claim 7 , wherein the causal graph is a directed acyclic graph (DAG). 10 . The non-transitory computer readable medium of claim 6 , wherein, before the weighted mean calculation, the SHAP values are scaled to values between zero and one. 11 . A system for determining data asset criticality, the system comprising: a data valuator, comprising a processor, memory, and a storage device, operatively connected to a plurality of data assets, and configured to: make a first determination that a plurality of data asset attributes are part of a causal attribute set; calculate a SHapley Additive explanation (SHAP) value for each of the plurality of data asset attributes in the causal attribute set; and perform a weighted mean calculation using the SHAP values for each of the plurality of data asset attributes and a corresponding attribute value for each of the plurality of data asset attributes of a data asset to obtain a criticality score for the data asset; include the data asset in a ranking of data assets using the criticality score; and classify the data asset into a criticality classification based on a criticality threshold; generate a backup recommendation set based on the criticality classification, wherein the backup recommendation set comprises backup actions comprising at least one selected from a group consisting of a key rotation frequency adjustment, a backup retention adjustment, a backup storage type adjustment, and a backup storage location adjustment; and adjust an attribute of the data asset to include a backup action of the backup actions. 12 . The system of claim 11 , wherein, to make the first determination that the plurality of data asset attributes are part of the causal attribute set, the data valuator is further configured to: perform a data category value calculation using a historical attribute set of a plurality of data assets to obtain a plurality of data category values for a plurality of data assets; and perform a causal inference analysis using the historical attribute set, a causal graph, and the plurality of data category values to obtain the causal attribute set. 13 . The system of claim 12 , wherein the data category value calculation comprises a linear regression analysis, and the causal graph is a directed acyclic graph (DAG). 14 . The system of claim 11 , wherein, before the weighted mean calculation, the SHAP values are scaled to values between zero and one.

Assignees

Inventors

Classifications

  • based on approximation criteria, e.g. principal component analysis · CPC title

  • Generating training patterns; Bootstrap methods, e.g. bagging or boosting · CPC title

  • Inference or reasoning models · CPC title

  • Backup scheduling policy · CPC title

  • Machine learning · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12505364B2 cover?
Techniques described herein relate to methods and systems for determining data asset criticality. Such techniques may include making a first determination that a plurality of data asset attributes are part of a causal attribute set; calculating a SHapeley Additive explanation (SHAP) value for each of the plurality of data asset attributes in the causal attribute set; and performing a weighted m…
Who is the assignee on this patent?
Emc Ip Holding Co Llc
What technology area does this patent fall under?
Primary CPC classification G06N7/01. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Dec 23 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 11 related publications on this page (citations in our corpus or others sharing the same primary CPC).