Utilizing volume-level policies in a storage system
US-12061822-B1 · Aug 13, 2024 · US
US12436844B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-12436844-B2 |
| Application number | US-202217951445-A |
| Country | US |
| Kind code | B2 |
| Filing date | Sep 23, 2022 |
| Priority date | Sep 23, 2022 |
| Publication date | Oct 7, 2025 |
| Grant date | Oct 7, 2025 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A distributed and deduplicating file system that stores backup data to object storage is described. A node, in a cluster of nodes, receives a write operation to back up a data file, and identifies backup data associated with the write operation. The node identifies deduplicated data by deduplicating the backup data. The cluster stores the deduplicated data to each of the buffers which corresponds to a node in the cluster. If the node determines that the size of information, which includes the deduplicated data, stored to each of the buffers has accumulated to become greater than or equal to an object threshold, then the node writes the information, which includes the deduplicated data, stored in each of the buffers to an object storage which stores data for the nodes in the cluster.
Opening claim text (preview).
What is claimed is: 1. A computer-implemented method for a distributed and deduplicating file system that stores backup data to object storage comprising: identifying, by a node in a cluster of a plurality of nodes, backup data associated with a write operation to back up a data file, in response to receiving the write operation; identifying, by the node, deduplicated data by deduplicating the backup data; replicating, by the cluster, the deduplicated data to each of a plurality of distributed buffers corresponding to the plurality of nodes; determining, by the node, whether a size of information, which comprises the deduplicated data, replicating to each of the plurality of distributed buffers has accumulated to become at least an object threshold; and writing, by the node, the information, which comprises the deduplicated data, replicated in each of the plurality of distributed buffers to an object storage that stores data for the plurality of the nodes, in response to a determination that the size of the information replicated to each of the plurality of distributed buffers has accumulated to become at least the object threshold. 2. The computer-implemented method of claim 1 , wherein writing the backup data to the object storage comprises identifying a quantity of the backup data which equals the object threshold, and writing the quantity of the backup data to the object storage. 3. The computer-implemented method of claim 2 , wherein the quantity of the backup data comprises a packed object created by compressing the backup data into a compression region, and packing the compressed region with other compressed regions. 4. The computer-implemented method of claim 1 , wherein the object threshold is based on a size of a data structure which is internally managed as a chunk in the object storage. 5. The computer-implemented method of claim 1 , wherein the object storage responds to a determination that a data object was written and then deleted by a file system by releasing the data object without needing to perform any copy-forward process on the data object. 6. The computer-implemented method of claim 1 , wherein the plurality of distributed buffers is implemented by a corresponding plurality of flash memory devices, and the object storage stores data in hard disk drives. 7. A computer program product comprising a non-transitory computer-readable medium having a computer-readable program code embodied therein to be executed by one or more processors, the program code including instructions to: identify, by a node in a cluster of a plurality of nodes, backup data associated with a write operation to back up a data file, in response to receiving the write operation; identify, by the node, deduplicated data by deduplicating the backup data; replicate, by the cluster, the deduplicated data to each of a plurality of distributed buffers corresponding to the plurality of nodes; determine, by the node, whether a size of information, which comprises the deduplicated data, replicated to each of the plurality of distributed buffers has accumulated to become at least an object threshold; and write, by the node, the information, which comprises the deduplicated data, replicated in each of the plurality of distributed buffers to an object storage that stores data for the plurality of the nodes, in response to a determination that the size of the information replicated to each of the plurality of distributed buffers has accumulated to become at least the object threshold. 8. The computer program product of claim 7 , wherein writing the information to the object storage comprises identifying a quantity of the information which equals the object threshold, and writing the quantity of the information to the object storage. 9. The computer program product of claim 8 , wherein the quantity of the information comprises a packed object created by compressing the backup data into a compression region, and packing the compressed region with other compressed regions. 10. The computer program product of claim 7 , wherein the object threshold is based on a size of a data structure which is internally managed as a chunk in the object storage. 11. The computer program product of claim 7 , wherein the object storage responds to a determination that a data object was written and then deleted by a file system by releasing the data object without needing to perform any copy-forward process on the data object. 12. The computer program product of claim 7 , wherein the plurality of distributed buffers is implemented by a corresponding plurality of flash memory devices. 13. The computer program product of claim 7 , wherein the object storage stores data in hard disk drives. 14. A distributed and deduplicating file system that stores backup data to object storage comprising: one or more processors; and a non-transitory computer readable medium storing a plurality of instructions, which when executed, cause the one or more processors to: identify, by a node in a cluster of a plurality of nodes, backup data associated with a write operation to back up a data file, in response to receiving the write operation; identify, by the node, deduplicated data by deduplicating the backup data; replicate, by the cluster, the deduplicated data to each of a plurality of distributed buffers corresponding to the plurality of nodes; determine, by the node, whether a size of information, which comprises the deduplicated data, replicated to each of the plurality of distributed buffers has accumulated to become at least an object threshold; and write, by the node, the information, which comprises the deduplicated data, replicated in each of the plurality of distributed buffers to an object storage that stores data for the plurality of the nodes, in response to a determination that the size of the information replicated to each of the plurality of distributed buffers has accumulated to become at least the object threshold. 15. The distributed and deduplicating file system of claim 14 , wherein writing the information to the object storage comprises identifying a quantity of the information which equals the object threshold, and writing the quantity of the information to the object storage. 16. The distributed and deduplicating file system of claim 15 , wherein the quantity of the information comprises a packed object created by compressing the backup data into a compression region, and packing the compressed region with other compressed regions. 17. The distributed and deduplicating file system of claim 14 , wherein the object threshold is based on a size of a data structure which is internally managed as a chunk in the object storage. 18. The distributed and deduplicating file system of claim 14 , wherein the object storage responds to a determination that a data object was written and then deleted by a file system by releasing the data object without needing to perform any copy-forward process on the data object. 19. The distributed and deduplicating file system of claim 14 , wherein the plurality of distributed buffers is implemented by a corresponding plurality of flash memory devices. 20. The distributed and deduplicating file system of claim 14 , wherein the object storage stores data in hard disk drives.
De-duplication techniques · CPC title
Backup restoration techniques · CPC title
for networked environments · CPC title
using de-duplication of the data · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.