Distributed and deduplicating file system for storing backup data to object storage

US12436844B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12436844-B2
Application numberUS-202217951445-A
CountryUS
Kind codeB2
Filing dateSep 23, 2022
Priority dateSep 23, 2022
Publication dateOct 7, 2025
Grant dateOct 7, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A distributed and deduplicating file system that stores backup data to object storage is described. A node, in a cluster of nodes, receives a write operation to back up a data file, and identifies backup data associated with the write operation. The node identifies deduplicated data by deduplicating the backup data. The cluster stores the deduplicated data to each of the buffers which corresponds to a node in the cluster. If the node determines that the size of information, which includes the deduplicated data, stored to each of the buffers has accumulated to become greater than or equal to an object threshold, then the node writes the information, which includes the deduplicated data, stored in each of the buffers to an object storage which stores data for the nodes in the cluster.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method for a distributed and deduplicating file system that stores backup data to object storage comprising: identifying, by a node in a cluster of a plurality of nodes, backup data associated with a write operation to back up a data file, in response to receiving the write operation; identifying, by the node, deduplicated data by deduplicating the backup data; replicating, by the cluster, the deduplicated data to each of a plurality of distributed buffers corresponding to the plurality of nodes; determining, by the node, whether a size of information, which comprises the deduplicated data, replicating to each of the plurality of distributed buffers has accumulated to become at least an object threshold; and writing, by the node, the information, which comprises the deduplicated data, replicated in each of the plurality of distributed buffers to an object storage that stores data for the plurality of the nodes, in response to a determination that the size of the information replicated to each of the plurality of distributed buffers has accumulated to become at least the object threshold. 2. The computer-implemented method of claim 1 , wherein writing the backup data to the object storage comprises identifying a quantity of the backup data which equals the object threshold, and writing the quantity of the backup data to the object storage. 3. The computer-implemented method of claim 2 , wherein the quantity of the backup data comprises a packed object created by compressing the backup data into a compression region, and packing the compressed region with other compressed regions. 4. The computer-implemented method of claim 1 , wherein the object threshold is based on a size of a data structure which is internally managed as a chunk in the object storage. 5. The computer-implemented method of claim 1 , wherein the object storage responds to a determination that a data object was written and then deleted by a file system by releasing the data object without needing to perform any copy-forward process on the data object. 6. The computer-implemented method of claim 1 , wherein the plurality of distributed buffers is implemented by a corresponding plurality of flash memory devices, and the object storage stores data in hard disk drives. 7. A computer program product comprising a non-transitory computer-readable medium having a computer-readable program code embodied therein to be executed by one or more processors, the program code including instructions to: identify, by a node in a cluster of a plurality of nodes, backup data associated with a write operation to back up a data file, in response to receiving the write operation; identify, by the node, deduplicated data by deduplicating the backup data; replicate, by the cluster, the deduplicated data to each of a plurality of distributed buffers corresponding to the plurality of nodes; determine, by the node, whether a size of information, which comprises the deduplicated data, replicated to each of the plurality of distributed buffers has accumulated to become at least an object threshold; and write, by the node, the information, which comprises the deduplicated data, replicated in each of the plurality of distributed buffers to an object storage that stores data for the plurality of the nodes, in response to a determination that the size of the information replicated to each of the plurality of distributed buffers has accumulated to become at least the object threshold. 8. The computer program product of claim 7 , wherein writing the information to the object storage comprises identifying a quantity of the information which equals the object threshold, and writing the quantity of the information to the object storage. 9. The computer program product of claim 8 , wherein the quantity of the information comprises a packed object created by compressing the backup data into a compression region, and packing the compressed region with other compressed regions. 10. The computer program product of claim 7 , wherein the object threshold is based on a size of a data structure which is internally managed as a chunk in the object storage. 11. The computer program product of claim 7 , wherein the object storage responds to a determination that a data object was written and then deleted by a file system by releasing the data object without needing to perform any copy-forward process on the data object. 12. The computer program product of claim 7 , wherein the plurality of distributed buffers is implemented by a corresponding plurality of flash memory devices. 13. The computer program product of claim 7 , wherein the object storage stores data in hard disk drives. 14. A distributed and deduplicating file system that stores backup data to object storage comprising: one or more processors; and a non-transitory computer readable medium storing a plurality of instructions, which when executed, cause the one or more processors to: identify, by a node in a cluster of a plurality of nodes, backup data associated with a write operation to back up a data file, in response to receiving the write operation; identify, by the node, deduplicated data by deduplicating the backup data; replicate, by the cluster, the deduplicated data to each of a plurality of distributed buffers corresponding to the plurality of nodes; determine, by the node, whether a size of information, which comprises the deduplicated data, replicated to each of the plurality of distributed buffers has accumulated to become at least an object threshold; and write, by the node, the information, which comprises the deduplicated data, replicated in each of the plurality of distributed buffers to an object storage that stores data for the plurality of the nodes, in response to a determination that the size of the information replicated to each of the plurality of distributed buffers has accumulated to become at least the object threshold. 15. The distributed and deduplicating file system of claim 14 , wherein writing the information to the object storage comprises identifying a quantity of the information which equals the object threshold, and writing the quantity of the information to the object storage. 16. The distributed and deduplicating file system of claim 15 , wherein the quantity of the information comprises a packed object created by compressing the backup data into a compression region, and packing the compressed region with other compressed regions. 17. The distributed and deduplicating file system of claim 14 , wherein the object threshold is based on a size of a data structure which is internally managed as a chunk in the object storage. 18. The distributed and deduplicating file system of claim 14 , wherein the object storage responds to a determination that a data object was written and then deleted by a file system by releasing the data object without needing to perform any copy-forward process on the data object. 19. The distributed and deduplicating file system of claim 14 , wherein the plurality of distributed buffers is implemented by a corresponding plurality of flash memory devices. 20. The distributed and deduplicating file system of claim 14 , wherein the object storage stores data in hard disk drives.

Assignees

Inventors

Classifications

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12436844B2 cover?
A distributed and deduplicating file system that stores backup data to object storage is described. A node, in a cluster of nodes, receives a write operation to back up a data file, and identifies backup data associated with the write operation. The node identifies deduplicated data by deduplicating the backup data. The cluster stores the deduplicated data to each of the buffers which correspon…
Who is the assignee on this patent?
Dell Products Lp
What technology area does this patent fall under?
Primary CPC classification G06F11/1453. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Oct 07 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 10 related publications on this page (citations in our corpus or others sharing the same primary CPC).