Efficient cleanup/defragmentation mechanism for expired retention locked (compliance & governance) segments in deduped cloud objects
US-2022121529-A1 · Apr 21, 2022 · US
US12105595B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-12105595-B2 |
| Application number | US-202217955125-A |
| Country | US |
| Kind code | B2 |
| Filing date | Sep 28, 2022 |
| Priority date | Sep 28, 2022 |
| Publication date | Oct 1, 2024 |
| Grant date | Oct 1, 2024 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Embodiments for providing file immutability for cloud storage data in a deduplicating filesystem and using a new filesystem that is spawned to receive redirected live data, after which the old filesystem is expired. Data objects are stored in the cloud by defining a protection duration from a first date to a fixed future date, and applying a retention lock to one or more data objects stored in cloud storage during the protection duration, the retention lock preventing unauthorized deletion, modification or movement of the data. A renew threshold date is defined within the protection duration, and a new filesystem is spawned on this date. All new data is redirected to the new filesystem and locked for the protection duration, and the first file system is destroyed.
Opening claim text (preview).
What is claimed is: 1. A computer-implemented method, comprising: defining a protection duration from a first date to a fixed future date; applying a retention lock to one or more data objects stored in a first filesystem in cloud storage during the protection duration, the retention lock preventing deletion, modification or movement of the data objects by an unauthorized entity; defining a renew threshold date within the protection duration; spawning, on the renew threshold date, a new filesystem in the cloud storage; writing a data object to the cloud through a cloud bucket using a PUT request; adding appropriate headers in the PUT request to ensure objects are locked as part of a write operation of the data object itself; redirecting all new data objects to the new filesystem, wherein the new data objects are locked for the protection duration; and destroying the first filesystem. 2. The method of claim 1 further comprising: cleaning up expired objects from the first filesystem; and spawning additional filesystems repeatedly during the protection duration to provide complete protection to all data objects in the cloud storage. 3. The method of claim 1 wherein the protection duration is selected based on a data ingest rate and an amount of data turned over by a garbage collection cycle, and further wherein the retention lock is a compliance retention lock. 4. The method of claim 1 wherein the redirecting step is performed using an operating system scale-out architecture that a manages storage resources for space balancing. 5. The method of claim 1 wherein the redirecting step is performed by one of: using automated processes to make clients aware of the new filesystem, or using manual modifications in the clients to point to the new filesystem. 6. The method of claim 1 wherein the first date corresponds to a date that a first data object is written to the cloud storage, and wherein the renew threshold date is selected based on one of: a proportion of the protection duration, or a total cleanable space. 7. The method of claim 1 wherein the cloud storage comprises part of a deduplication backup system including a process executed by a data storage server running a deduplication filesystem. 8. The method of claim 7 wherein each of the first and new filesystems are deduplication filesystems. 9. A computer-implemented method, comprising: defining a repeatable retention lock period recurring from a fixed date, with each retention lock period having a fixed end date; applying a retention lock to one or more data objects stored in a first filesystem in cloud storage during each retention lock period, wherein the retention lock protects data objects written any time during a respective retention lock period only up to the corresponding fixed end date; defining a renew threshold date for each retention lock period within the protection duration; spawning, on each renew threshold date, a new filesystem in the cloud storage; writing a data object to the cloud through a cloud bucket using a PUT request; adding appropriate headers in the PUT request to ensure objects are locked as part of a write operation of the data object itself; redirecting all new data objects to the corresponding new filesystem, wherein the new data objects are locked for the protection duration; and destroying the first filesystem. 10. The method of claim 9 wherein the redirecting step is performed using an operating system scale-out architecture that a manages storage resources for space balancing. 11. The method of claim 9 wherein the redirecting step is performed by one of: using automated processes to make clients aware of the new filesystem, or using manual modifications in the clients to point to the new filesystem. 12. The method of claim 9 wherein the retention lock preventing deletion, modification or movement of the data objects by an unauthorized entity. 13. The method of claim 12 wherein the repeatable retention lock period is recurred as long as new data objects are written to cloud storage, or live data objects are carried forward by a garbage collection operation, and wherein the renew threshold date is selected based on one of: a proportion of the retention lock period, or a total cleanable space. 14. The method of claim 13 wherein the retention lock period is selected based on a data ingest rate and an amount of data turned over by each garbage collection cycle, and wherein the retention lock is a compliance retention lock. 15. The method of claim 9 wherein the cloud storage comprises part of a deduplication backup system including a process executed by a data storage server running a deduplication filesystem, and wherein each of the first and new filesystems are deduplication filesystems.
Delete operations (erasing in storage systems G06F3/0652) · CPC title
characterised by the use of retention policies (retention policies for HSM systems G06F16/185) · CPC title
De-duplication implemented within the file system, e.g. based on file segments (de-duplication techniques in storage systems for the management of data blocks G06F3/0641) · CPC title
Using snapshots, i.e. a logical point-in-time copy of the data · CPC title
Details of migration of file systems (migration mechanisms in storage systems G06F3/0647) · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.