Shadow data lakes
US-11720548-B1 · Aug 8, 2023 · US
US12566672B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-12566672-B2 |
| Application number | US-202217952500-A |
| Country | US |
| Kind code | B2 |
| Filing date | Sep 26, 2022 |
| Priority date | Sep 26, 2022 |
| Publication date | Mar 3, 2026 |
| Grant date | Mar 3, 2026 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A method and system for leveraging backup process metadata for cloud object storage selective deletions. Under cloud object storage architecture, any unstructured data may be managed and stored in the cloud as objects. Objects thus provide an elastic, scalable format through which unstructured data may be maintained for a variety of purposes, including those directed to data backup, archiving, and/or disaster recovery. Further, any unstructured data, stored within any object, may pertain to one or many file(s). Should data, from multiple files, be stored within a shared object, any post-upload activity (e.g., modifications or deletions in keeping with data regulatory compliances) targeting said data, belonging to any one file or a subset of files, may be complex and resource-expensive. In addressing these undesirable qualities, embodiments disclosed herein use metadata, produced during and/or following backup processes protecting file data, to fulfill said post-upload activity resource-efficiently and without data compromising effects.
Opening claim text (preview).
What is claimed is: 1 . A method for implementing selective overwrites in backup data, the method comprising: identifying, by a first computing system comprising a processor and memory, an item representing select backup host information stored on a backup target associated with a selective deletion request, wherein: the backup target comprises at least a second computing system residing on a cloud computing environment, and the first computing system is operatively connected to the backup target; identifying, by the first computing system, in association with the item, a data object stored on the backup target and a data object portion of the data object, wherein: the data object comprises a portion of unstructured data associated with the item, and the portion of unstructured data comprises at least one of: videos, audio, images, emails, text documents, sensor data, application logs, social media data, location or geo-positioning data, and transactions, and the data object is identified from a backup process metadata record associated with the item, and the backup process metadata record comprises a status describing a state of the portion of unstructured data within the data object; overwriting, by the first computing system, a sub-portion of the unstructured data of the item reflected in the data object portion stored in the backup target using an artifact specified by the selective deletion request; and modifying, after the overwriting and by the first computing system, in the backup process metadata record, the status to reflect: a reason for overwriting the sub-portion of unstructured data, wherein the reason is a data regulatory compliance reason, wherein the data regulatory compliance reason is prevention of certain unstructured data from being stored in a specific geographical region, and an action associated with overwriting the sub-portion of unstructured data, wherein the action comprises modification or deletion of any instance of the sub-portion of unstructured data affected by the data regulatory compliance reason. 2 . The method of claim 1 , wherein the data object comprises the data object portion and a second data object portion, and wherein the second data object portion reflects second item content associated with a second item representing second select backup host information. 3 . The method of claim 1 , wherein the metadata comprises a data offset and a data size both associated with the item. 4 . The method of claim 3 , wherein the data object portion is identified using the data offset in conjunction with the data size. 5 . The method of claim 3 , the method further comprising: identifying, by the first computing system, in association with the item, a second data object stored on the backup target and a second data object portion of the second data object; and overwriting, by the first computing system, second item content of the item reflected in the second data object portion. 6 . A non-transitory computer readable medium (CRM) comprising computer readable program code, which when executed by a computer processor, enables the computer processor to perform a method for implementing selective overwrites in backup data, the method comprising: identifying, by a first computing system comprising a processor and memory, an item representing select backup host information stored on a backup target associated with a selective deletion request, wherein: the backup target comprises at least a second computing system residing on a cloud computing environment, and the first computing system is operatively connected to the backup target; identifying, by the first computing system, in association with the item, a data object stored on the backup target and a data object portion of the data object, wherein: the data object comprises a portion of unstructured data associated with the item, and the portion of unstructured data comprises at least one of: videos, audio, images, emails, text documents, sensor data, application logs, social media data, location or geo-positioning data, and transactions, and the data object is identified from a backup process metadata record associated with the item, and the backup process metadata record comprises a status describing a state of the portion of unstructured data within the data object; overwriting, by the first computing system, a sub-portion of the unstructured data of the item reflected in the data object portion stored in the backup target using an artifact specified by the selective deletion request; and modifying, after the overwriting and by the first computing system, in the backup process metadata record, the status to reflect: a reason for overwriting the sub-portion of unstructured data, wherein the reason is a data regulatory compliance reason, wherein the data regulatory compliance reason is prevention of certain unstructured data from being stored in a specific geographical region, and an action associated with overwriting the sub-portion of unstructured data, wherein the action comprises modification or deletion of any instance of the sub-portion of unstructured data affected by the data regulatory compliance reason. 7 . The non-transitory CRM of claim 6 , wherein the data object comprises the data object portion and a second data object portion, and wherein the second data object portion reflects second item content associated with a second item representing second select backup host information. 8 . The non-transitory CRM of claim 6 , wherein the metadata comprises a data offset and a data size both associated with the item. 9 . The non-transitory CRM of claim 8 , wherein the data object portion is identified using the data offset in conjunction with the data size. 10 . The non-transitory CRM of claim 8 , the method further comprising: identifying, by the first computing system, in association with the item, a second data object stored on the backup target and a second data object portion of the second data object; and overwriting, by the first computing system, second item content of the item reflected in the second data object portion. 11 . A system, the system comprising: a backup target storing backup host information, wherein the backup target comprises at least a second computing system residing on a cloud computing environment; and a first computing system comprising a computer processor operatively connected to the backup target, and configured to perform a method for implementing selective overwrites in backup data, the method comprising: identifying, by the first computing system, an item representing select backup host information stored on a backup target associated with a selective deletion request; identifying, by the first computing system, in association with the item, a data object stored on the backup target and a data object portion of the data object, wherein: the data object comprises a portion of unstructured data associated with the item, and the portion of unstructured data comprises at least one of: videos, audio, images, emails, text documents, sensor data, application logs, social media data, location or geo-positioning data, and transactions, and the data object is identified from a backup process metadata record associated with the item, and the backup process metadata record comprises a status describing a state of the portion of unstructured data within the data object; overwriting, by the first computing system, a sub-portion of the unstructured data of the item reflected in the data object portion stored in the backup target using an artifact specified by the selective deletion request; and modifying, after the overwriting and by t
for networked environments · CPC title
Using snapshots, i.e. a logical point-in-time copy of the data · CPC title
by selection of backup contents · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.