Data sensitivity classification using content-based datasets

US2024143822A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2024143822-A1
Application numberUS-202217975576-A
CountryUS
Kind codeA1
Filing dateOct 27, 2022
Priority dateOct 27, 2022
Publication dateMay 2, 2024
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Providing content-based sensitivity classification to content data in a data processing system, by defining sensitivity designations for data objects stored in the system, and creating datasets by grouping metadata for data objects that are intended to classified with a same sensitivity designation, wherein each dataset spans multiple storage devices of different storage types, and wherein each dataset defines a single data sensitivity unit for the data objects referenced by a respective dataset. A sensitivity classifier component tags each dataset with a sensitivity tag to specify a protection or control operation on the data objects referenced by the respective dataset, and a processing component then processes data objects for each tagged dataset in accordance with the specified protection or control operation.

First claim

Opening claim text (preview).

What is claimed is: 1 . A computer-implemented method of providing content-based sensitivity classification to content data in a data processing system, comprising: defining sensitivity designations for data objects stored in the system; creating datasets by grouping metadata for data objects that are will result in the data objects being classified with a same sensitivity designation, wherein each dataset spans multiple storage devices of different storage types, and wherein each dataset defines a single data sensitivity unit for the data objects referenced by a respective dataset; tagging each dataset with a sensitivity tag to specify a protection or control operation on the data objects referenced by the respective dataset; and processing data objects for each tagged dataset in accordance with the specified protection or control operation. 2 . The method of claim 1 wherein the dataset comprises a set of metadata organized in a table, and wherein the sensitivity tag comprises an alphanumeric label appended to the table. 3 . The method of claim 1 further comprising: processing a new data object received by the system; and assigning, to an assigned dataset, the new data object based on an associated sensitivity designation of the new data object. 4 . The method of claim 1 wherein the sensitivity designation characterizes a dataset as sensitive or not sensitive with respect to disclosure of or access to data objects referenced by the dataset. 5 . The method of claim 4 wherein the sensitivity designation comprises a classification along a range including public, internal, proprietary, secret, and top secret. 6 . The method of claim 5 wherein the sensitivity designation is derived from a characteristic of the data objects reference by the dataset including at least one of: inherent character due to data type or source, or explicit attributes including secrecy markings. 7 . The method of claim 1 wherein the protection operation comprises at least one of: backing up data from operating memory to certain specified storage devices and locations, and wherein the control operation comprises at least one of: defining access permissions to the data objects by users of the system, changing a read/write status, or enforcing security measures on the data objects through encryption. 8 . The method of claim 1 wherein the dataset evolves over time through one or more milestone events that changes a sensitivity classification of the dataset due to change of context of the data objects, the method further comprising re-classifying a dataset containing data objects that change an initial sensitivity classification upon passing a milestone event. 9 . The method of claim 8 wherein the referenced data objects are subject to different protection operations and different control operations between each milestone event. 10 . A system for providing content-based sensitivity classification to content data in a data processing system, comprising: a sensitivity classifier component defining sensitivity designations for data objects stored in the system; a hardware-based dataset management component creating datasets by grouping metadata for data objects that will result in the data objects being classified with a same sensitivity designation, wherein each dataset spans multiple storage devices of different storage types, and wherein each dataset defines a single data sensitivity unit for the data objects referenced by a respective dataset, and tagging each dataset with a sensitivity tag to specify a protection or control operation on the data objects referenced by the respective dataset; and a data processing component processing data objects for each tagged dataset in accordance with the specified protection or control operation. 11 . The system of claim 10 wherein the dataset comprises a set of metadata organized in a table, and wherein the sensitivity tag comprises an alphanumeric label appended to the table. 12 . The system of claim 10 wherein the data processing component processes a new data object received by the system, and assigns, to an assigned dataset, the new data object based on an associated sensitivity designation of the new data object. 13 . The system of claim 10 wherein the sensitivity designation characterizes a dataset as sensitive or not sensitive with respect to disclosure of or access to data objects referenced by the dataset, and wherein the sensitivity designation comprises a classification along a range including public, internal, proprietary, secret, and top secret. 14 . The system of claim 13 wherein the sensitivity designation is derived from a characteristic of the data objects reference by the dataset including at least one of: inherent character due to data type or source, or explicit attributes including secrecy markings. 15 . The system of claim 10 wherein the protection operation comprises at least one of: backing up data from operating memory to certain specified storage devices and locations, and wherein the control operation comprises at least one of: defining access permissions to the data objects by users of the system, changing a read/write status, or enforcing security measures on the data objects through encryption. 16 . The system of claim 10 wherein the dataset evolves over time through one or more milestone events that changes a sensitivity classification of the dataset due to change of context of the data objects, the method further comprising re-classifying a dataset containing data objects that change an initial sensitivity classification upon passing a milestone event. 17 . The system of claim 16 wherein the referenced data objects are subject to different protection operations and different control operations between each milestone event. 18 . A tangible computer program product having stored thereon, programming code that, when executed by a processor, causes the processor to perform a method of providing content-based sensitivity classification to content data in a data processing system, comprising: defining sensitivity designations for data objects stored in the system; creating datasets by grouping metadata for data objects that will result in the data objects being classified with a same sensitivity designation, wherein each dataset spans multiple storage devices of different storage types, and wherein each dataset defines a single data sensitivity unit for the data objects referenced by a respective dataset; tagging each dataset with a sensitivity tag to specify a protection or control operation on the data objects referenced by the respective dataset; and processing data objects for each tagged dataset in accordance with the specified protection or control operation. 19 . The computer program product of claim 18 wherein the sensitivity designation characterizes a dataset as sensitive or not sensitive with respect to disclosure of or access to data objects referenced by the dataset, and further wherein the sensitivity designation comprises a classification along a range including public, internal, proprietary, secret, and top secret. 20 . The computer program product of claim 19 wherein the dataset comprises a set of metadata organized in a table, and wherein the sensitivity tag comprises an alphanumeric label appended to the table.

Assignees

Inventors

Classifications

  • where protection concerns the structure of data, e.g. records, types, queries · CPC title

  • Clustering or classification · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2024143822A1 cover?
Providing content-based sensitivity classification to content data in a data processing system, by defining sensitivity designations for data objects stored in the system, and creating datasets by grouping metadata for data objects that are intended to classified with a same sensitivity designation, wherein each dataset spans multiple storage devices of different storage types, and wherein each…
Who is the assignee on this patent?
Dell Products Lp
What technology area does this patent fall under?
Primary CPC classification G06F21/6227. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu May 02 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 7 related publications on this page (citations in our corpus or others sharing the same primary CPC).