Multi-tenancy data analytics platform
US-2022012251-A1 · Jan 13, 2022 · US
US12423457B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-12423457-B2 |
| Application number | US-202217975576-A |
| Country | US |
| Kind code | B2 |
| Filing date | Oct 27, 2022 |
| Priority date | Oct 27, 2022 |
| Publication date | Sep 23, 2025 |
| Grant date | Sep 23, 2025 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Providing content-based sensitivity classification to content data in a data processing system, by defining sensitivity designations for data objects stored in the system, and creating datasets by grouping metadata for data objects that are intended to classified with a same sensitivity designation, wherein each dataset spans multiple storage devices of different storage types, and wherein each dataset defines a single data sensitivity unit for the data objects referenced by a respective dataset. A sensitivity classifier component tags each dataset with a sensitivity tag to specify a protection or control operation on the data objects referenced by the respective dataset, and a processing component then processes data objects for each tagged dataset in accordance with the specified protection or control operation.
Opening claim text (preview).
What is claimed is: 1. A computer-implemented method of providing content-based sensitivity classification to content data in a data processing system, comprising: defining sensitivity designations for data objects stored in the system; storing metadata of content data stored in disparate storage environments in a scanning database; creating a protection policy comprising a query representing data to be protected by the protection policy: executing the query comprising metadata selectors as dataset tags for matching against the scanning database to generate a dataset comprising a logical collection of metadata for unstructured data objects grouped together by one or more filters, and that represents data to be processed similarly with respect to a protection or control operation defined by the protection policy, and further wherein the data objects are classified with a same sensitivity designation, wherein the dataset spans multiple storage devices of different storage types, and defines a single data sensitivity unit for the data objects referenced by the dataset; tagging each dataset with a sensitivity tag to specify a protection or control operations on the data objects referenced by the dataset; and processing data objects for each tagged dataset in accordance with the specified protection or control operation, wherein the protection operation comprises backing up data from operating memory to certain specified storage devices and locations, and wherein the control operation comprises at least one of: defining access permissions to the data objects by users of the system, changing a read/write status, and enforcing security measures on the data objects through encryption. 2. The method of claim 1 wherein the dataset comprises a set of metadata organized in a table, and wherein the sensitivity tag comprises an alphanumeric label appended to the table. 3. The method of claim 1 further comprising: processing a new data object received by the system; and assigning, to an assigned dataset, the new data object based on an associated sensitivity designation of the new data object. 4. The method of claim 1 wherein the sensitivity designation characterizes a dataset as sensitive or not sensitive with respect to disclosure of or access to data objects referenced by the dataset. 5. The method of claim 4 wherein the sensitivity designation comprises a classification including public, internal, proprietary, secret, and top secret. 6. The method of claim 5 wherein the sensitivity designation is derived from a characteristic of the data objects reference by the dataset including at least one of: inherent character due to data type or source, or explicit attributes including secrecy markings. 7. The method of claim 1 wherein the dataset evolves over time through one or more milestone events that changes a sensitivity classification of the dataset due to change of context of the data objects, the method further comprising re-classifying a dataset containing data objects that change an initial sensitivity classification upon passing a milestone event. 8. The method of claim 7 wherein the referenced data objects are subject to different protection operations and different control operations between each milestone event. 9. A system for providing content-based sensitivity classification to content data in a data processing system, comprising: a sensitivity classifier component defining sensitivity designations for data objects stored in the system; a scanning database storing metadata of content data stored in disparate storage environments in a scanning database; a hardware-based dataset management component creating a protection policy comprising a query representing data to be protected by the protection policy and executing the query comprising metadata selectors as dataset tags for matching against the scanning database to generate a dataset comprising a logical collection of metadata for unstructured data objects grouped together by one or more filters, and that represents data to be processed similarly with respect to a protection or control operation defined by the protection policy, and further wherein the data objects are classified with a same sensitivity designation, wherein the dataset spans multiple storage devices of different storage types, and defines a single data sensitivity unit for the data objects referenced by the dataset, and tagging each dataset with a sensitivity tag to specify a protection or control operation on the data objects referenced by the dataset; and a data processing component processing data objects for each tagged dataset in accordance with the specified protection or control operation, wherein the protection operation comprises backing up data from operating memory to certain specified storage devices and locations, and wherein the control operation comprises at least one of: defining access permissions to the data objects by users of the system, changing a read/write status, and enforcing security measures on the data objects through encryption. 10. The system of claim 9 wherein the dataset comprises a set of metadata organized in a table, and wherein the sensitivity tag comprises an alphanumeric label appended to the table. 11. The system of claim 9 wherein the data processing component processes a new data object received by the system, and assigns, to an assigned dataset, the new data object based on an associated sensitivity designation of the new data object. 12. The system of claim 9 wherein the sensitivity designation characterizes a dataset as sensitive or not sensitive with respect to disclosure of or access to data objects referenced by the dataset, and wherein the sensitivity designation comprises a classification including public, internal, proprietary, secret, and top secret. 13. The system of claim 12 wherein the sensitivity designation is derived from a characteristic of the data objects reference by the dataset including at least one of: inherent character due to data type or source, or explicit attributes including secrecy markings. 14. The system of claim 9 wherein the dataset evolves over time through one or more milestone events that changes a sensitivity classification of the dataset due to change of context of the data objects, the method further comprising re-classifying a dataset containing data objects that change an initial sensitivity classification upon passing a milestone event. 15. The system of claim 14 wherein the referenced data objects are subject to different protection operations and different control operations between each milestone event. 16. A tangible computer program product comprising a non-transitory computer-readable medium having a computer-readable program code embodied therein, the computer-readable program code adapted to be executed by one or more processors to implement a method of providing content-based sensitivity classification to content data in a data processing system, comprising: defining sensitivity designations for data objects stored in the system; storing metadata of content data stored in disparate storage environments in a scanning database; creating a protection policy comprising a query representing data to be protected by the protection policy: executing the query comprising metadata selectors as dataset tags for matching against the scanning database to generate a dataset comprising a logical collection of metadata for unstructured data objects grouped together by one or more filters, and that represents data to be processed similarly with respect to a protection or control operation def
Clustering or classification · CPC title
where protection concerns the structure of data, e.g. records, types, queries · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.