Data sensitivity classification using content-based datasets

US12423457B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12423457-B2
Application numberUS-202217975576-A
CountryUS
Kind codeB2
Filing dateOct 27, 2022
Priority dateOct 27, 2022
Publication dateSep 23, 2025
Grant dateSep 23, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Providing content-based sensitivity classification to content data in a data processing system, by defining sensitivity designations for data objects stored in the system, and creating datasets by grouping metadata for data objects that are intended to classified with a same sensitivity designation, wherein each dataset spans multiple storage devices of different storage types, and wherein each dataset defines a single data sensitivity unit for the data objects referenced by a respective dataset. A sensitivity classifier component tags each dataset with a sensitivity tag to specify a protection or control operation on the data objects referenced by the respective dataset, and a processing component then processes data objects for each tagged dataset in accordance with the specified protection or control operation.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method of providing content-based sensitivity classification to content data in a data processing system, comprising: defining sensitivity designations for data objects stored in the system; storing metadata of content data stored in disparate storage environments in a scanning database; creating a protection policy comprising a query representing data to be protected by the protection policy: executing the query comprising metadata selectors as dataset tags for matching against the scanning database to generate a dataset comprising a logical collection of metadata for unstructured data objects grouped together by one or more filters, and that represents data to be processed similarly with respect to a protection or control operation defined by the protection policy, and further wherein the data objects are classified with a same sensitivity designation, wherein the dataset spans multiple storage devices of different storage types, and defines a single data sensitivity unit for the data objects referenced by the dataset; tagging each dataset with a sensitivity tag to specify a protection or control operations on the data objects referenced by the dataset; and processing data objects for each tagged dataset in accordance with the specified protection or control operation, wherein the protection operation comprises backing up data from operating memory to certain specified storage devices and locations, and wherein the control operation comprises at least one of: defining access permissions to the data objects by users of the system, changing a read/write status, and enforcing security measures on the data objects through encryption. 2. The method of claim 1 wherein the dataset comprises a set of metadata organized in a table, and wherein the sensitivity tag comprises an alphanumeric label appended to the table. 3. The method of claim 1 further comprising: processing a new data object received by the system; and assigning, to an assigned dataset, the new data object based on an associated sensitivity designation of the new data object. 4. The method of claim 1 wherein the sensitivity designation characterizes a dataset as sensitive or not sensitive with respect to disclosure of or access to data objects referenced by the dataset. 5. The method of claim 4 wherein the sensitivity designation comprises a classification including public, internal, proprietary, secret, and top secret. 6. The method of claim 5 wherein the sensitivity designation is derived from a characteristic of the data objects reference by the dataset including at least one of: inherent character due to data type or source, or explicit attributes including secrecy markings. 7. The method of claim 1 wherein the dataset evolves over time through one or more milestone events that changes a sensitivity classification of the dataset due to change of context of the data objects, the method further comprising re-classifying a dataset containing data objects that change an initial sensitivity classification upon passing a milestone event. 8. The method of claim 7 wherein the referenced data objects are subject to different protection operations and different control operations between each milestone event. 9. A system for providing content-based sensitivity classification to content data in a data processing system, comprising: a sensitivity classifier component defining sensitivity designations for data objects stored in the system; a scanning database storing metadata of content data stored in disparate storage environments in a scanning database; a hardware-based dataset management component creating a protection policy comprising a query representing data to be protected by the protection policy and executing the query comprising metadata selectors as dataset tags for matching against the scanning database to generate a dataset comprising a logical collection of metadata for unstructured data objects grouped together by one or more filters, and that represents data to be processed similarly with respect to a protection or control operation defined by the protection policy, and further wherein the data objects are classified with a same sensitivity designation, wherein the dataset spans multiple storage devices of different storage types, and defines a single data sensitivity unit for the data objects referenced by the dataset, and tagging each dataset with a sensitivity tag to specify a protection or control operation on the data objects referenced by the dataset; and a data processing component processing data objects for each tagged dataset in accordance with the specified protection or control operation, wherein the protection operation comprises backing up data from operating memory to certain specified storage devices and locations, and wherein the control operation comprises at least one of: defining access permissions to the data objects by users of the system, changing a read/write status, and enforcing security measures on the data objects through encryption. 10. The system of claim 9 wherein the dataset comprises a set of metadata organized in a table, and wherein the sensitivity tag comprises an alphanumeric label appended to the table. 11. The system of claim 9 wherein the data processing component processes a new data object received by the system, and assigns, to an assigned dataset, the new data object based on an associated sensitivity designation of the new data object. 12. The system of claim 9 wherein the sensitivity designation characterizes a dataset as sensitive or not sensitive with respect to disclosure of or access to data objects referenced by the dataset, and wherein the sensitivity designation comprises a classification including public, internal, proprietary, secret, and top secret. 13. The system of claim 12 wherein the sensitivity designation is derived from a characteristic of the data objects reference by the dataset including at least one of: inherent character due to data type or source, or explicit attributes including secrecy markings. 14. The system of claim 9 wherein the dataset evolves over time through one or more milestone events that changes a sensitivity classification of the dataset due to change of context of the data objects, the method further comprising re-classifying a dataset containing data objects that change an initial sensitivity classification upon passing a milestone event. 15. The system of claim 14 wherein the referenced data objects are subject to different protection operations and different control operations between each milestone event. 16. A tangible computer program product comprising a non-transitory computer-readable medium having a computer-readable program code embodied therein, the computer-readable program code adapted to be executed by one or more processors to implement a method of providing content-based sensitivity classification to content data in a data processing system, comprising: defining sensitivity designations for data objects stored in the system; storing metadata of content data stored in disparate storage environments in a scanning database; creating a protection policy comprising a query representing data to be protected by the protection policy: executing the query comprising metadata selectors as dataset tags for matching against the scanning database to generate a dataset comprising a logical collection of metadata for unstructured data objects grouped together by one or more filters, and that represents data to be processed similarly with respect to a protection or control operation def

Assignees

Inventors

Classifications

  • Clustering or classification · CPC title

  • where protection concerns the structure of data, e.g. records, types, queries · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12423457B2 cover?
Providing content-based sensitivity classification to content data in a data processing system, by defining sensitivity designations for data objects stored in the system, and creating datasets by grouping metadata for data objects that are intended to classified with a same sensitivity designation, wherein each dataset spans multiple storage devices of different storage types, and wherein each…
Who is the assignee on this patent?
Dell Products Lp
What technology area does this patent fall under?
Primary CPC classification G06F21/6227. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Sep 23 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).