What technology area does this patent fall under?

Primary CPC classification G06F16/215. Mapped technology areas include Physics.

When was this patent published?

Publication date Tue Feb 11 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Smart datastore selection for protection engines based on uncertainty quantification

US12222910B2 · US · B2

Patent metadata
Field	Value
Publication number	US-12222910-B2
Application number	US-202217652405-A
Country	US
Kind code	B2
Filing date	Feb 24, 2022
Priority date	Feb 24, 2022
Publication date	Feb 11, 2025
Grant date	Feb 11, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

One example method includes collecting respective data for each datastore in a group of datastores, clustering the data, based on the clustering, applying one of two different mutually exclusive labels to each of the datastores, applying an uncertainty quantifier to determine, for each datastore, a confidence level that the label assigned to that datastore is the correct label, ranking the datastores according to their respective label and confidence level, and presenting, to a user, a list that includes at least a datastore with a highest confidence level.

First claim

Opening claim text (preview).

What is claimed is: 1. A method, comprising: for each datastore in a group of datastores: collecting data; clustering the data based on parameters; based on the clustering, applying one of two different mutually exclusive labels to the datastore based on a determination of whether or not a protection engine is to be deployed to protect the data; performing a credibility assessment with respect to the data, and assigning a credibility score to each dataset by determining whether or not the parameters are related to health of each dataset; applying an uncertainty quantifier to determine, for the datastore, a confidence level that the label assigned to the datastore is correct; ranking the datastores of the group of datastores according to respective label and confidence level; and presenting, to a user, a list that includes whichever of the datastores has a highest confidence level. 2. The method as recited in claim 1 , wherein one of the two different mutually exclusive labels is a HIGH label, and the other of the two different mutually exclusive labels is a LOW label, and the HIGH label indicates a healthy datastore, and the LOW label indicates an unhealthy datastore. 3. The method as recited in claim 1 , wherein the data that is collected for one of the datastores comprises data indicative of a health of that one datastore. 4. The method as recited in claim 1 , wherein the clustering clusters the data into two clusters that collectively contain all the datastores, and a first one of the two different mutually exclusive labels is assigned to datastores in a first cluster of the two clusters, and a second one of the two different mutually exclusive labels is assigned to datastores in a second cluster of the two clusters. 5. The method as recited in claim 1 , wherein the clustering is performed using HDBSCAN (Hierarchical Density-Based Spatial Clustering of Applications with Noise). 6. The method as recited in claim 1 , wherein the label on the datastore with the highest confidence level is HIGH. 7. The method as recited in claim 1 , further comprising receiving a user selection of a datastore with the label HIGH and a highest confidence level of any of the datastores. 8. The method as recited in claim 1 , further comprising receiving a user command to restore protected data to a datastore with the label HIGH and a highest confidence level of any of the datastores. 9. The method as recited in claim 1 , further comprising receiving a user command to deploy a data protection resource to a datastore with the label HIGH and a highest confidence level of any of the datastores. 10. The method as recited in claim 1 , wherein, when the credibility score is low indicating that the parameters are not related to the health of each datastore, the method further comprising: collecting additional data to obtain an updated credibility. 11. A non-transitory storage medium having stored therein instructions that are executable by one or more hardware processors to perform operations comprising: for each datastore in a group of datastores: collecting data; clustering the data based on parameters; based on the clustering, applying one of two different mutually exclusive labels to the datastore based on a determination of whether or not a protection engine is to be deployed to protect the data; performing a credibility assessment with respect to the data, and assigning a credibility score to each dataset by determining whether or not the parameters are related to health of each dataset; applying an uncertainty quantifier to determine, for the datastore, a confidence level that the label assigned to the datastore is correct; ranking the datastores of the group of datastores according to respective label and confidence level; and presenting, to a user, a list that includes whichever of the datastores has a highest confidence level. 12. The non-transitory storage medium as recited in claim 11 , wherein one of the two different mutually exclusive labels is a HIGH label, and the other of the two different mutually exclusive labels is a LOW label, and the HIGH label indicates a healthy datastore, and the LOW label indicates an unhealthy datastore. 13. The non-transitory storage medium as recited in claim 11 , wherein the data that is collected for one of the datastores comprises data indicative of a health of that one datastore. 14. The non-transitory storage medium as recited in claim 11 , wherein the clustering clusters the data into two clusters that collectively contain all the datastores, and a first one of the two different mutually exclusive labels is assigned to datastores in a first cluster of the two clusters, and a second one of the two different mutually exclusive labels is assigned to datastores in a second cluster of the two clusters. 15. The non-transitory storage medium as recited in claim 11 , wherein the clustering is performed using HDBSCAN (Hierarchical Density-Based Spatial Clustering of Applications with Noise). 16. The non-transitory storage medium as recited in claim 11 , wherein the label on the datastore with the highest confidence level is HIGH. 17. The non-transitory storage medium as recited in claim 11 , further comprising receiving a user selection of a datastore with the label HIGH and a highest confidence level of any of the datastores. 18. The non-transitory storage medium as recited in claim 11 , further comprising receiving a user command to restore protected data to a datastore with the label HIGH and a highest confidence level of any of the datastores. 19. The non-transitory storage medium as recited in claim 11 , further comprising receiving a user command to deploy a data protection resource to a datastore with the label HIGH and a highest confidence level of any of the datastores. 20. The non-transitory storage medium as recited in claim 11 , wherein, when the credibility score is low indicating that the parameters are not related to the health of each datastore, the operations further comprise: collecting additional data to obtain an updated credibility.

Assignees

Dell Products Lp

Inventors

Classifications

G06F16/2365
Ensuring data consistency and integrity · CPC title
G06F16/285
Clustering or classification · CPC title
G06F11/1458
Management of the backup or restore process · CPC title
G06F16/215Primary
Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors · CPC title
G06F11/1448Primary
Management of the data involved in backup or backup restore · CPC title

Patent family

Related publications grouped by family.

View patent family 87574187

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12222910B2 cover?: One example method includes collecting respective data for each datastore in a group of datastores, clustering the data, based on the clustering, applying one of two different mutually exclusive labels to each of the datastores, applying an uncertainty quantifier to determine, for each datastore, a confidence level that the label assigned to that datastore is the correct label, ranking the data…
Who is the assignee on this patent?: Dell Products Lp
What technology area does this patent fall under?: Primary CPC classification G06F16/215. Mapped technology areas include Physics.
When was this patent published?: Publication date Tue Feb 11 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).