Dataset lifecycle management using monitoring and ACL control for content-based datasets

US12292989B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12292989-B2
Application numberUS-202217975429-A
CountryUS
Kind codeB2
Filing dateOct 27, 2022
Priority dateOct 27, 2022
Publication dateMay 6, 2025
Grant dateMay 6, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Managing a lifecycle of data by identifying data objects that are subject to same control rules in each stage of the lifecycle as grouped data, where the control rules allow only authorized access to or authorized operations on the grouped data based on a current stage of the lifecycle. A dataset is generated for the grouped data by identifying metadata of the grouped data to be processed similarly within the lifecycle, and storing the metadata in the dataset. The control rules associated with the grouped data as stage tags for the dataset. Actions performed on the data referenced by the dataset are monitored to ensure that the monitored actions comply with control rules using the stage tags of the dataset.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method of managing a lifecycle of data processed through a plurality of stages in a system using content-based datasets, comprising: identifying data objects of disparate file formats that are subject to same control rules in each stage of the lifecycle as grouped data, wherein the control rules provide access only to authorized users or perform only authorized operations on the grouped data based on a current stage of the lifecycle, and further wherein the data objects are protected by different data protection policies utilizing the control rules; generating a dataset for the grouped data by scanning the data objects to identify metadata of the grouped data to be processed similarly within the lifecycle, and storing the identified metadata in the dataset, wherein the lifecycle includes a backup operation implementing the data protection policies; iteratively processing the dataset to tag the data objects according to a native file format; attaching multiple tags to the dataset to indicate that the data objects of the dataset are of different file types according to the disparate file formats; merging the protection policies to back up the dataset under a merged protection policy; associating the control rules to the grouped data as stage tags for the dataset; monitoring actions performed on and by the data objects referenced by the dataset in each stage of the lifecycle; and ensuring that the monitored actions comply with control rules using the stage tags of the dataset. 2. The method of claim 1 wherein the lifecycle of a dataset comprises different stages along a timeline and processing the data as it is first created, processed, modified, and destroyed, and wherein the data is subject to different access permissions and operations in each stage of the lifecycle. 3. The method of claim 2 wherein the dataset defines a single control unit for the referenced data objects of the dataset, and further wherein the stage tags are associated with the referenced data objects through the dataset as a single unit based on data content rather than data location. 4. The method of claim 2 wherein the control actions comprise access restrictions to the data using at least one of access control list (ACL) membership or role-based access control (RBAC) process to limit user access to the user for each stage of the lifecycle. 5. The method of claim 4 wherein the control actions further comprise at least one of: data operations including imposing data restrictions including status as confidential or sensitive data, data protection through encryption, routing to storage targets, and changing a read/write or read-only status. 6. The method of claim 1 wherein the dataset generating step comprises: gathering the identified metadata for storage in a data catalog; and executing a user entered query against the catalog to generate the dataset. 7. The method of claim 6 wherein the query comprises metadata selectors as dataset tags for matching against the cataloged metadata. 8. The method of claim 7 wherein the metadata selectors comprise tags consisting of alphanumeric strings applied to respective data objects based on user-defined rules, and wherein the tags define at least one of a file type, name, location, creation time, or characteristic. 9. The method of claim 8 wherein the dataset is organized into collection information and per file and object information, and further wherein the collection information comprises a dataset creation time, the query, role-based access control (RBAC) for the dataset, and first free-form metadata, and wherein the per file and object information comprises location of data of the dataset, unstructured metadata information, and second free-form metadata. 10. A computer-implemented method of managing a lifecycle processing data through a plurality of stages in a system using content-based datasets, comprising: identifying control rules applied to data processed in each stage of the lifecycle; identifying data objects of disparate file formats but subject to the same control rules in each stage of the lifecycle, wherein the data objects are protected by different data protection policies utilizing the control rules, and further wherein the lifecycle includes a backup operation implementing the data protection policies; forming a dataset comprising metadata corresponding to each data object of the identified data objects; iteratively processing the dataset to tag the data objects according to a native file format; attaching multiple tags to the dataset to indicate that the data objects of the dataset are of different file types according to the disparate file formats; merging the protection policies to back up the dataset under a merged protection policy; assigning a stage tag to the dataset, wherein the stage tag specifies the same control rules to be applied in each stage; and applying the corresponding control rule at each stage on corresponding data objects referenced by the dataset as the data is processed from one stage to a next stage in the lifecycle. 11. The method of claim 10 wherein the dataset defines a single control unit for the referenced data objects of the dataset, and further wherein the stage tags are associated with the referenced data objects through the dataset as a single unit based on data content rather than data location. 12. The method of claim 10 wherein the dataset forming step comprises: gathering the identified metadata for storage in a data catalog; and executing a user entered query against the catalog to generate the dataset. 13. The method of claim 12 wherein the query comprises metadata selectors as dataset tags for matching against the cataloged metadata, and further wherein the metadata selectors comprise tags consisting of alphanumeric strings applied to respective data objects based on user-defined rules, and wherein the tags define at least one of a file type, name, location, creation time, or characteristic. 14. The method of claim 10 wherein the lifecycle of a dataset comprises different stages along a timeline and processing the data as it is first created, processed, modified, and destroyed, and wherein the data is subject to different access permissions and operations in each stage of the lifecycle as defined by the control rules. 15. The method of claim 14 wherein the control rules specify access restrictions to the data using at least one of access control list (ACL) membership or role-based access control (RBAC) process to limit user access to the user for each stage of the lifecycle. 16. The method of claim 15 wherein the control rules further specify at least one of: data operations including imposing data restrictions including status as confidential or sensitive data, data protection through encryption, routing to storage targets, and changing a read/write or read-only status. 17. A computer-implemented method of managing a lifecycle processing data through a plurality of stages in a system using content-based datasets, comprising: deriving control rules applied to the data objects of disparate file formats as they are processed in each stage of the plurality of stages; determining data objects having the same applied control rules throughout the lifecycle, wherein the data objects are protected by different data protection policies utilizing the control rules, and further wherein the lifecycle includes a backup operation implementing the data protection policies; forming a dataset comprising metadata corresponding to each data object o

Assignees

Inventors

Classifications

  • characterised by the use of retention policies (retention policies for HSM systems G06F16/185) · CPC title

  • Access rights, e.g. capability lists, access control lists, access tables, access matrices · CPC title

  • Auditing as a secondary aspect · CPC title

  • to a system of files or objects, e.g. local or distributed file system or database · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12292989B2 cover?
Managing a lifecycle of data by identifying data objects that are subject to same control rules in each stage of the lifecycle as grouped data, where the control rules allow only authorized access to or authorized operations on the grouped data based on a current stage of the lifecycle. A dataset is generated for the grouped data by identifying metadata of the grouped data to be processed simil…
Who is the assignee on this patent?
Dell Products Lp
What technology area does this patent fall under?
Primary CPC classification G06F21/6218. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue May 06 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 9 related publications on this page (citations in our corpus or others sharing the same primary CPC).