Replication using deduplicated secondary copy data
US-10481824-B2 · Nov 19, 2019 · US
US12292802B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-12292802-B2 |
| Application number | US-202418423685-A |
| Country | US |
| Kind code | B2 |
| Filing date | Jan 26, 2024 |
| Priority date | Dec 7, 2017 |
| Publication date | May 6, 2025 |
| Grant date | May 6, 2025 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
An illustrative media agent (MA) in a data storage management system instructs a NAS file server (filer) to restore an MA-created synthesized-copy instead of larger filer-created backup copies. The synthesized-copy is designed only for the particular files to be restored and mimics, and is typically much smaller than, a filer-created backup copy. The synthesized-copy is fed to the filer on restore as a “restore data image.” When receiving a restore request for certain backed-up data files, the MA synthesizes the synthesized-copy on the fly. The MA generates a header mimicking a filer-created backup header; extracts files from filer-created backup copies arranging them within the synthesized-copy as if in filer-created backups; and instructs filer to perform a full-volume restore from the synthesized-copy. The MA serves the synthesized-copy piecemeal as available, rather than waiting to synthesize the entire synthesized-copy. The synthesized-copy is not stored at the MA.
Opening claim text (preview).
What is claimed is: 1. A system comprising: a computing device comprising one or more hardware processors and computer memory, and in communication with a file server; and one or more data storage devices in communication with the computing device, wherein the one or more data storage devices store one or more backup copies that were generated by the file server, and wherein the one or more backup copies comprise a plurality of backed-up data files; wherein the computing device is configured to: generate a synthesized-copy that comprises one or more first backed-up data files selected from among the plurality of backed-up data files in the one or more backup copies that were generated by the file server, wherein to generate the synthesized-copy, the computing device is further configured to: (i) configure a header for the synthesized-copy, based at least in part on information collected by the computing device when the computing device stored the one or more backup copies that were generated by the file server at the one or more data storage devices, wherein the header for the synthesized-copy is readable, by the file server, as a header for a backup copy generated by the file server, even though the synthesized-copy is not generated by the file server, and (ii) within the synthesized-copy, arrange the one or more first backed-up data files, retrieved from the one or more backup copies, into a sequence that is compatible with how the file server restores backup copies; and wherein the computing device is further configured to: cause the file server to restore, to the file server, the synthesized-copy comprising the one or more first backed-up data files, without restoring all of the plurality of backed-up data files. 2. The system of claim 1 , wherein to cause the file server to restore the synthesized-copy to the file server, the computing device is configured to instruct the file server to perform a full-volume restore operation from a data storage volume at the computing device, wherein the data storage volume is identified by an identifier at the computing device and is not actually configured at the computing device. 3. The system of claim 2 , wherein the computing device is further configured to: generate the synthesized-copy responsive to the full-volume restore operation of the file server directed to the data storage volume. 4. The system of claim 1 , wherein the computing device is further configured to: respond to one or more read requests received from the file server to restore the one or more first backed-up data files, by transmitting data blocks of the synthesized-copy to the file server without waiting to generate the synthesized-copy in its entirety at the computing device. 5. The system of claim 1 , wherein to arrange the one or more first backed-up data files into the sequence within the synthesized-copy, the computing device is further configured to: renumber one or more original inode identifiers of the one or more first backed-up data files retrieved from the one or more backup copies to fit into the sequence within the synthesized-copy. 6. The system of claim 5 , wherein to cause the file server to restore the synthesized-copy, the computing device is further configured to: instruct the file server to perform a full-volume restore operation from the computing device, and wherein renumbering of the one or more original inode identifiers enables the file server to restore the one or more first backed-up data files of the synthesized-copy via the full-volume restore operation, even though at least some of the one or more first backed-up data files being restored are not contiguous within the one or more backup copies. 7. The system of claim 1 , wherein the computing device is further configured to: prior to generating the synthesized-copy, instruct the file server to back up data stored on the file server, resulting in the one or more backup copies; cause the one or more backup copies to be stored, as received from the file server, at the one or more data storage devices; collect information about backed-up data files in each of the one or more backup copies received from the file server, and store the information into a data structure at the computing device, and wherein the header generated for the synthesized-copy is based on the information in the data structure. 8. The system of claim 1 , wherein the computing device is further configured to: transmit data blocks of the synthesized-copy to the file server, as the data blocks of the synthesized-copy become available at the computing device and without storing the synthesized-copy in its entirety at the computing device. 9. The system of claim 1 , wherein the computing device is further configured to: generate the synthesized-copy dynamically, in response to a request to restore to the file server the one or more first backed-up data files, without storing the synthesized-copy in its entirety within the system. 10. The system of claim 1 , wherein the synthesized-copy is transmitted by the computing device to the file server, block-by-block, without waiting to generate the synthesized-copy in its entirety at the computing device. 11. A computer-implemented method comprising: by a computing device in a data storage management system, wherein the computing device comprises one or more hardware processors and computer memory: identifying one or more backup copies generated by a file server, wherein the one or more backup copies comprise one or more data files to be restored to the file server; instructing the file server to perform a full-volume restore operation from a data storage volume at the computing device to the file server; constructing a synthesized-copy that is to be restored by the file server in the full-volume restore operation, wherein constructing the synthesized-copy comprises: (i) generating a first header for the synthesized-copy, based at least in part on information at the computing device which was collected by the computing device when the one or more backup copies generated by the file server were stored in secondary storage, wherein the first header is in a format of a header for a backup copy generated by the file server, (ii) retrieving, from the one or more backup copies, respective file data and a respective file header for each of the one or more data files to be restored, and (iii) arranging, within the synthesized-copy, the respective file data and respective file header that were retrieved, into a sequence that is suitable for the file server to perform the full-volume restore operation; and in response to one or more read requests received from the file server during the full-volume restore operation, transmitting data blocks of the synthesized-copy to the file server as each of the data blocks of the synthesized-copy becomes available at the computing device, without storing the synthesized-copy in its entirety at the computing device. 12. The computer-implemented method of claim 11 , wherein the synthesized-copy is constructed based on a request to restore the one or more data files from the one or more backup copies, and wherein the synthesized-copy is configured as a restore image to be restored by the file server from the data storage volume. 13. The computer-implemented method of claim 12 , wherein the data storage volume is identified by an identifier at the computing device and is not actually configured at the computing device. 14. The computer-implemented method of claim 11 , wherein constructing the synthesized-copy further comprises renumbering of original inode identifiers of the one or more data files
Replication mechanisms · CPC title
Distributed or networked storage systems, e.g. storage area networks [SAN], network attached storage [NAS] · CPC title
Management of blocks · CPC title
in relation to data integrity, e.g. data losses, bit errors · CPC title
Backup restoration techniques · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.