Multi-network data management using content-based datasets and distributed tagging
US-2024143812-A1 · May 2, 2024 · US
US2024143822A1 · US · A1
| Field | Value |
|---|---|
| Publication number | US-2024143822-A1 |
| Application number | US-202217975576-A |
| Country | US |
| Kind code | A1 |
| Filing date | Oct 27, 2022 |
| Priority date | Oct 27, 2022 |
| Publication date | May 2, 2024 |
| Grant date | — |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Providing content-based sensitivity classification to content data in a data processing system, by defining sensitivity designations for data objects stored in the system, and creating datasets by grouping metadata for data objects that are intended to classified with a same sensitivity designation, wherein each dataset spans multiple storage devices of different storage types, and wherein each dataset defines a single data sensitivity unit for the data objects referenced by a respective dataset. A sensitivity classifier component tags each dataset with a sensitivity tag to specify a protection or control operation on the data objects referenced by the respective dataset, and a processing component then processes data objects for each tagged dataset in accordance with the specified protection or control operation.
Opening claim text (preview).
What is claimed is: 1 . A computer-implemented method of providing content-based sensitivity classification to content data in a data processing system, comprising: defining sensitivity designations for data objects stored in the system; creating datasets by grouping metadata for data objects that are will result in the data objects being classified with a same sensitivity designation, wherein each dataset spans multiple storage devices of different storage types, and wherein each dataset defines a single data sensitivity unit for the data objects referenced by a respective dataset; tagging each dataset with a sensitivity tag to specify a protection or control operation on the data objects referenced by the respective dataset; and processing data objects for each tagged dataset in accordance with the specified protection or control operation. 2 . The method of claim 1 wherein the dataset comprises a set of metadata organized in a table, and wherein the sensitivity tag comprises an alphanumeric label appended to the table. 3 . The method of claim 1 further comprising: processing a new data object received by the system; and assigning, to an assigned dataset, the new data object based on an associated sensitivity designation of the new data object. 4 . The method of claim 1 wherein the sensitivity designation characterizes a dataset as sensitive or not sensitive with respect to disclosure of or access to data objects referenced by the dataset. 5 . The method of claim 4 wherein the sensitivity designation comprises a classification along a range including public, internal, proprietary, secret, and top secret. 6 . The method of claim 5 wherein the sensitivity designation is derived from a characteristic of the data objects reference by the dataset including at least one of: inherent character due to data type or source, or explicit attributes including secrecy markings. 7 . The method of claim 1 wherein the protection operation comprises at least one of: backing up data from operating memory to certain specified storage devices and locations, and wherein the control operation comprises at least one of: defining access permissions to the data objects by users of the system, changing a read/write status, or enforcing security measures on the data objects through encryption. 8 . The method of claim 1 wherein the dataset evolves over time through one or more milestone events that changes a sensitivity classification of the dataset due to change of context of the data objects, the method further comprising re-classifying a dataset containing data objects that change an initial sensitivity classification upon passing a milestone event. 9 . The method of claim 8 wherein the referenced data objects are subject to different protection operations and different control operations between each milestone event. 10 . A system for providing content-based sensitivity classification to content data in a data processing system, comprising: a sensitivity classifier component defining sensitivity designations for data objects stored in the system; a hardware-based dataset management component creating datasets by grouping metadata for data objects that will result in the data objects being classified with a same sensitivity designation, wherein each dataset spans multiple storage devices of different storage types, and wherein each dataset defines a single data sensitivity unit for the data objects referenced by a respective dataset, and tagging each dataset with a sensitivity tag to specify a protection or control operation on the data objects referenced by the respective dataset; and a data processing component processing data objects for each tagged dataset in accordance with the specified protection or control operation. 11 . The system of claim 10 wherein the dataset comprises a set of metadata organized in a table, and wherein the sensitivity tag comprises an alphanumeric label appended to the table. 12 . The system of claim 10 wherein the data processing component processes a new data object received by the system, and assigns, to an assigned dataset, the new data object based on an associated sensitivity designation of the new data object. 13 . The system of claim 10 wherein the sensitivity designation characterizes a dataset as sensitive or not sensitive with respect to disclosure of or access to data objects referenced by the dataset, and wherein the sensitivity designation comprises a classification along a range including public, internal, proprietary, secret, and top secret. 14 . The system of claim 13 wherein the sensitivity designation is derived from a characteristic of the data objects reference by the dataset including at least one of: inherent character due to data type or source, or explicit attributes including secrecy markings. 15 . The system of claim 10 wherein the protection operation comprises at least one of: backing up data from operating memory to certain specified storage devices and locations, and wherein the control operation comprises at least one of: defining access permissions to the data objects by users of the system, changing a read/write status, or enforcing security measures on the data objects through encryption. 16 . The system of claim 10 wherein the dataset evolves over time through one or more milestone events that changes a sensitivity classification of the dataset due to change of context of the data objects, the method further comprising re-classifying a dataset containing data objects that change an initial sensitivity classification upon passing a milestone event. 17 . The system of claim 16 wherein the referenced data objects are subject to different protection operations and different control operations between each milestone event. 18 . A tangible computer program product having stored thereon, programming code that, when executed by a processor, causes the processor to perform a method of providing content-based sensitivity classification to content data in a data processing system, comprising: defining sensitivity designations for data objects stored in the system; creating datasets by grouping metadata for data objects that will result in the data objects being classified with a same sensitivity designation, wherein each dataset spans multiple storage devices of different storage types, and wherein each dataset defines a single data sensitivity unit for the data objects referenced by a respective dataset; tagging each dataset with a sensitivity tag to specify a protection or control operation on the data objects referenced by the respective dataset; and processing data objects for each tagged dataset in accordance with the specified protection or control operation. 19 . The computer program product of claim 18 wherein the sensitivity designation characterizes a dataset as sensitive or not sensitive with respect to disclosure of or access to data objects referenced by the dataset, and further wherein the sensitivity designation comprises a classification along a range including public, internal, proprietary, secret, and top secret. 20 . The computer program product of claim 19 wherein the dataset comprises a set of metadata organized in a table, and wherein the sensitivity tag comprises an alphanumeric label appended to the table.
where protection concerns the structure of data, e.g. records, types, queries · CPC title
Clustering or classification · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.