Creating file recipes for copy overwrite workloads in deduplication file systems

US12105594B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12105594-B2
Application numberUS-202217841960-A
CountryUS
Kind codeB2
Filing dateJun 16, 2022
Priority dateJun 16, 2022
Publication dateOct 1, 2024
Grant dateOct 1, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Embodiments for processing fastcopy overwrite backup workloads (FCOW) as virtual synthetic backups, by detecting whether or not a backup workload comprises an FCOW workload in which a base file is fastcopied to a target file, wherein portions of the base file are overwritten through overwrites by new data in the target file by a deduplication backup process. For FCOW workloads, the process creates a file recipe by trapping seek offsets and write byte lengths for the overwrites, and writing extent information comprising offset differences to a recipe table of the file recipe, and the file recipe facilitates file processing optimizations including differencing, replication, and tiering.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method of processing fastcopy overwrite backup workloads (FCOW) as virtual synthetic backups, comprising: detecting whether or not a backup workload comprises an FCOW workload in which a base file is fastcopied to a target file, wherein portions of the base file are overwritten through overwrites by new data in the target file by a deduplication backup process executed by a data storage server running a Data Domain File System (DDFS); and creating, for an FCOW workload file, a file recipe by trapping seek offsets and write byte lengths for the overwrites, and writing extent information comprising offset differences to a recipe table of the file recipe, wherein the file recipe facilitates file processing optimizations including differencing, replication, and tiering, wherein the base file and target file are each stored in a Merkle tree structure with content data stored in a bottom level of the tree and indexed by fingerprints, and further wherein the fastcopy process copies metadata of the base file comprising inode information and a reference to the file L6 fingerprint in a first Merkle tree to a second directory for the target file, and wherein a second Merkle tree is generated for the target file in which the new data blocks for the overwritten data are inserted over any updated old data blocks of the first Merkle tree. 2. The method of claim 1 wherein the FCOW workload comprises change block tracking (CBT) workloads, and wherein the detecting step comprises determining whether a specific extended attribute is associated with the base file. 3. The method of claim 2 wherein the extended attribute indicates that the file is a fixed size segment (FSS) file. 4. The method of claim 1 wherein the FCOW backups use one of Data Domain Boost application program interfaces (APIs) or REST APIs. 5. The method of claim 4 wherein the FCOW backup overwrites data segments at correct offsets over a network file system (NFS) protocol. 6. The method of claim 1 wherein the file recipe is equivalent to metadata that is created when a virtual synthetic file is creating using one or more synthetic application program interfaces (APIs). 7. The method of claim 6 wherein the file recipe represents a relationship between the target files and one or more prior generations of base files created by one or more previous full and incremental backup operations. 8. A computer-implemented method of creating file recipes for fastcopy overwrite backup workloads (FCOW) in a deduplication backup system, comprising: receiving a target file copied through a fastcopy overwrite operation on a base file that overwrites incremental segments deduplicated by the backup system, wherein the target file is a virtual synthetic backup file having fixed size segmented (FSS) blocks and comprising change block tracking (CBT) workloads, and wherein the base file copied as the target file has a specific extended attribute is associated therewith indicating a FSS file; opening the target file in read/write mode, wherein the overwritten segments are inserted at respective offsets; trapping seek offsets and write byte lengths for the overwritten segments; and writing extent information comprising offset differences to a file recipe that facilitates file processing optimizations including differencing, replication, and tiering, wherein the fastcopy process copies metadata of the base file comprising inode information and a reference to a file fingerprint in a first directory tree to a second directory for the target file, and wherein a second directory tree is generated for the target file in which the new data blocks for the overwritten data are inserted over any updated old data blocks of the first Merkle tree. 9. The method of claim 8 wherein the FCOW backups use one of Data Domain Boost application program interfaces (APIs) or REST APIs. 10. A computer program product, comprising a non-transitory computer-readable medium having a computer-readable program code embodied therein, the computer-readable program code adapted to be executed by one or more processors to implement a method of creating file recipes for fastcopy overwrite backup workloads (FCOW) in a deduplication backup system, comprising: receiving a target file copied through a fastcopy overwrite operation on a base file that overwrites incremental segments deduplicated by the backup system, wherein the target file is a virtual synthetic backup file having fixed size segmented (FSS) blocks and comprising change block tracking (CBT) workloads, and wherein the base file copied as the target file has a specific extended attribute is associated therewith indicating a FSS file; opening the target file in read/write mode, wherein the overwritten segments are inserted at respective offsets; trapping seek offsets and write byte lengths for the overwritten segments; and writing extent information comprising offset differences to a file recipe that facilitates file processing optimizations including differencing, replication, and tiering, wherein the fastcopy process copies metadata of the base file comprising inode information and a reference to a file fingerprint in a first directory tree to a second directory for the target file, and wherein a second directory tree is generated for the target file in which the new data blocks for the overwritten data are inserted over any updated old data blocks of the first Merkle tree. 11. The computer program product of claim 10 wherein the deduplication backup system is executed by a data storage server running a Data Domain File System (DDFS). 12. The computer program product of claim 11 wherein the FCOW backups use one of Data Domain Boost application program interfaces (APIs) or REST APIs.

Assignees

Inventors

Classifications

  • by selection of backup contents · CPC title

  • G06F16/14Primary

    Details of searching files based on file metadata · CPC title

  • using de-duplication of the data · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12105594B2 cover?
Embodiments for processing fastcopy overwrite backup workloads (FCOW) as virtual synthetic backups, by detecting whether or not a backup workload comprises an FCOW workload in which a base file is fastcopied to a target file, wherein portions of the base file are overwritten through overwrites by new data in the target file by a deduplication backup process. For FCOW workloads, the process crea…
Who is the assignee on this patent?
Dell Products Lp
What technology area does this patent fall under?
Primary CPC classification G06F16/14. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Oct 01 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).