Data reduction in a cloud-based storage system
US-2024320146-A1 · Sep 26, 2024 · US
US2025284426A1 · US · A1
| Field | Value |
|---|---|
| Publication number | US-2025284426-A1 |
| Application number | US-202418597830-A |
| Country | US |
| Kind code | A1 |
| Filing date | Mar 6, 2024 |
| Priority date | Mar 6, 2024 |
| Publication date | Sep 11, 2025 |
| Grant date | — |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Systems and methods described herein comprise a management system that, in response to obtaining from a volume of a source storage system, a fingerprint information, provide that information to candidate storage systems, which return estimates indicative of an amount of data storage each of the candidate storage systems would require to store the data. The management system then selects, among the candidate storage systems, one or more destination storage systems to transfer the data from the volume to the selected destination storage systems.
Opening claim text (preview).
What is claimed is: 1 . A method for managing transfer of data from a source storage system, the method comprising: obtaining, from one or more volumes of a source storage system, first fingerprint information; providing the first fingerprint information to candidate storage systems; receiving, from the candidate storage systems, estimates indicative of an amount of data storage each of the candidate storage systems would require to store the data once it is transferred; based on the estimates, selecting, among the candidate storage systems, one or more destination storage systems to obtain selected storage systems; and transferring the data from the one or more volumes to at least the selected storage systems. 2 . The method of claim 1 , wherein one or more of the candidate storage systems obtain an estimate by comparing the first fingerprint information with second fingerprint information that is associated with one or more of the candidate storage systems. 3 . The method of claim 2 , wherein selecting one or more destination storage systems comprises selecting a destination storage system associated with the lowest estimate among the estimates. 4 . The method of claim 2 , wherein comparing comprises comparing the first fingerprint information with a subset of the second fingerprint information to reduce a computational cost without significantly reducing an estimate accuracy. 5 . The method of claim 2 , wherein the first fingerprint information is provided to the candidate storage systems in response to determining one or more properties associated with at least a subset of the candidate storage system. 6 . The method of claim 5 , wherein the one or more properties comprise a data access pattern that is associated with at least one of read data, write data, a data length, or a time series. 7 . The method of claim 5 , wherein the one or more properties comprise a volume label associated with at least one of inventory information or account information. 8 . The method of claim 2 , wherein the second fingerprint information is processed in a tree format. 9 . The method of claim 1 , wherein the first fingerprint information comprises a compression ratio of non-duplicated data. 10 . A non-transitory computer-readable medium for storing instructions for executing a process, the instructions comprising: at a management system, receiving, from a source node, first information associated with a data match rate; providing the first information to candidate storage systems that each uses the first information to generate an estimate of an amount of data to be stored on the candidate storage system; in response to obtaining the estimates, selecting one or more destination storage systems among the candidate storage systems to reduce an amount of data to be transferred to at least one of the one or more destination storage systems; and causing the data to the transferred from the source node to the at least one destination storage system. 11 . The non-transitory computer-readable medium of claim 10 , wherein generating the estimate comprises comparing the first information to second information associated with the candidate storage systems. 12 . The non-transitory computer-readable medium of claim 11 , wherein comparing comprises comparing the first information with a subset of the second information to reduce a computational demand. 13 . The non-transitory computer-readable medium of claim 11 , wherein selecting the one or more destination storage systems comprises selecting a destination storage system associated with the lowest estimate among the estimates. 14 . The non-transitory computer-readable medium of claim 11 , wherein at least one of the first information or the second information comprises at least one of fingerprint information, a data access pattern, a data duplication rate, a volume label, or a compression ratio. 15 . The non-transitory computer-readable medium of claim 11 , wherein the first information is provided to the candidate storage systems in response to determining one or more properties associated with at least a subset of the candidate storage systems. 16 . A system for reducing a data transfer volume, the system comprising: a management system configured to receive first information associated with at least one of fingerprint information, a data access pattern, a data duplication rate, a volume label, or a compression ratio; a source storage communicatively coupled to the management system, the source storage configured to provide the first information to the management system; and a candidate storage systems communicatively coupled to the management system, each of the candidate storage systems is configured to use the first to generate an estimate of an amount of data to be stored on that candidate storage system, wherein the management system is configured to use the estimates to select one or more destination storage systems among the candidate storage systems, such that a transfer volume from the source storage to at least one of the one or more destination storage systems is reduced. 17 . The system of claim 16 , wherein the management system uses a storage device selection module to select a destination storage system that is associated with the lowest estimate among the estimates. 18 . The system of claim 16 , wherein the candidate storage systems generate the estimates by using a storage consumption estimation module to compare the first information to second information associated with the candidate storage systems. 19 . The system of claim 18 , wherein the candidate storage systems compare the first information with a subset of the second information to reduce a computational cost. 20 . The system of claim 16 , wherein management system provides the first information to the candidate storage systems in response to determining one or more properties associated with at least a subset of the candidate storage systems.
Replication mechanisms · CPC title
Saving storage space on storage systems · CPC title
De-duplication techniques · CPC title
Migration mechanisms · CPC title
Improving or facilitating administration, e.g. storage management · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.