Methods and systems to reduce latency of input/output (i/o) operations based on file system optimizations during creation of common snapshots for synchronous replicated datasets of a primary copy of data at a primary storage system to a mirror copy of the data at a cross-site secondary storage system

US2024143554A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2024143554-A1
Application numberUS-202218148705-A
CountryUS
Kind codeA1
Filing dateDec 30, 2022
Priority dateOct 28, 2022
Publication dateMay 2, 2024
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Multi-site distributed storage systems and computer-implemented methods are described for improving a resumption time of input/output (I/O) operations during a common snapshot process for storage objects. A computer-implemented method comprises performing a baseline transfer from at least one storage object of a first storage node to at least one replicated storage object of a second storage node, starting the common snapshot process including stop processing of I/O operations, performing a snapshot create operation on the primary storage site for the at least one storage object of the first storage node, resuming processing of I/O operations, and assigning a new universal unique identifier (UUID) to the at least one storage object of the second storage node after resuming processing of I/O operations with the new UUID to identify when file system contents are different than the baseline transfer.

First claim

Opening claim text (preview).

What is claimed is: 1 . A computer-implemented method for reducing a resumption time of processing of input/output (I/O) operations during a common snapshot process performed by one or more processors of a multi-site distributed storage system with a primary storage site having a first storage node and a secondary storage site having a second storage node, the computer-implemented method comprising: establishing a synchronous replication relationship between at least one storage object of the first storage node of the primary storage site and at least one storage object of the second storage node of the secondary storage site; performing a baseline transfer from the at least one storage object of the first storage node to the at least one storage object of the second storage node; starting the common snapshot process including initiating hold state for the primary storage site to stop processing of I/O operations during a time window; performing a snapshot create operation on the primary storage site for the at least one storage object of the first storage node and sending the snapshot create operation to the secondary storage site to be performed on the at least one storage object of the second storage node of the secondary storage site; resuming processing of I/O operations and ending the hold state for the primary storage site; and assigning a new active file system (AFS) version universal unique identifier (UUID) to the at least one storage object of the second storage node after resuming processing of I/O operations with the new AFS version UUID to identify when AFS contents are different than the baseline transfer for synchronous replication between the primary storage site and the secondary storage site. 2 . The computer-implemented method of claim 1 , wherein assigning the new AFS version UUID occurs during a delete workflow to remove the synchronous replication relationship for the at least one storage object of the first storage node and the at least one storage object of the second storage node and guarantees that any subsequent update for an asynchronous replication relationship or resync transfer will detect a file system inconsistency between the baseline transfer between the primary storage site and the secondary storage site and the AFS contents. 3 . The computer-implemented method of claim 2 , further comprising: converting from the synchronous replication relationship to an asynchronous relationship from the at least one storage object of the first storage node to the at least one storage object of the second storage node; and initiating an asynchronous resynchronous workflow from the at least one storage object of the first storage node to the at least one storage object of the second storage node. 4 . The computer-implemented method of claim 3 , further comprising: detecting AFS divergence post the common snapshot process when AFS contents are different than the baseline transfer; performing a restore operation to remove file system inconsistencies due to the AFS divergence; and performing asynchronous transfers from the at least one storage object of the first storage node to the at least one storage object of the second storage node. 5 . The computer-implemented method of claim 1 , wherein the AFS UUID can be a multibit value to uniquely identify the storage object. 6 . The computer-implemented method of claim 1 , wherein assigning the new active file system (AFS) version universal unique identifier (UUID) to the at least one storage object of the second storage node after resuming processing of I/O operations reduces the resumption time of processing of input/output (I/O) operations during the common snapshot process. 7 . A distributed storage system having a primary storage site with a first storage node and a secondary storage site with a second storage node comprising: a processing resource; and a non-transitory computer-readable medium coupled to the processing resource, having stored therein instructions, which when executed by the processing resource cause the processing resource to: initiate a synchronous replication process for a file system including starting a synchronous data replication lifecycle for replicating data from a first storage object of a first storage node of the first storage site to a second storage object of a second storage node of second storage site; perform baseline and asynchronous data transfers from the first storage object to the second storage object including a baseline transfer that creates a snapshot copy of the first storage object and transfers the snapshot copy to the second storage object; asynchronously transferring new snapshot copies from the first storage object to the second storage object; and add a transient tag to a snapshot tag meta file for the first storage object of the first storage node and optionally also for the second storage object of the second storage node to grow the snapshot tag meta file during the synchronous replication process before fencing input/output (I/O) operations for the first storage object and the second storage object during a snapshot create request. 8 . The distributed storage system of claim 7 , wherein the instructions when executed by the processing resource cause the processing resource to: perform, the snapshot create request with aggregate affinity and thus fully utilize multithreading of processing resources in the file system. 9 . The distributed storage system of claim 7 , wherein adding the transient tag to a snapshot tag meta file for the first storage object of the first storage node and optionally also for the second storage object of the second storage node during the synchronous replication process before fencing I/O operations moves serial operations in the file system out of a client I/O hold window. 10 . The distributed storage system of claim 7 , wherein the instructions when executed by the processing resource cause the processing resource to: set a sync replication bit for the first storage object of the first storage node and also for the second storage object. 11 . The distributed storage system of claim 7 , wherein the instructions when executed by the processing resource cause the processing resource to: remove the transient tag from the snapshot tag meta file if a common snapshot for the snapshot create request occurs during a configurable time period such that the common snapshot is available for a subsequent resync operation. 12 . The distributed storage system of claim 7 , wherein the instructions when executed by the processing resource cause the processing resource to: store the transient tag in the snapshot tag meta file if a common snapshot for the snapshot create request did not occur during a configurable time period. 13 . The distributed storage system of claim 7 , wherein the instructions when executed by the processing resource cause the processing resource to: establish an insync state to indicate a synchronous replication for a data replication relationship between the first storage object and the second storage object. 14 . A non-transitory computer-readable storage medium embodying a set of instructions, which when executed by a processing resource cause the processing resource to: initiate a synchronous replication process for a file system including starting a synchronous data replication lifecycle for replicating data from a first storage object of a first storage node of a first storage site to a second storage object of a second storage node of a second storage site; perform baseline and asynchronous data transfers from the first storage

Assignees

Inventors

Classifications

  • in relation to data integrity, e.g. data losses, bit errors · CPC title

  • Distributed or networked storage systems, e.g. storage area networks [SAN], network attached storage [NAS] · CPC title

  • Replication mechanisms · CPC title

  • G06F16/178Primary

    Techniques for file synchronisation in file systems · CPC title

  • Details of file system snapshots on the file-level, e.g. snapshot creation, administration, deletion (error detection or correction of the data by redundancy in operations or in hardware G06F11/14, G06F11/16) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2024143554A1 cover?
Multi-site distributed storage systems and computer-implemented methods are described for improving a resumption time of input/output (I/O) operations during a common snapshot process for storage objects. A computer-implemented method comprises performing a baseline transfer from at least one storage object of a first storage node to at least one replicated storage object of a second storage no…
Who is the assignee on this patent?
Netapp Inc
What technology area does this patent fall under?
Primary CPC classification G06F16/178. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu May 02 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).