Data replication in a storage system

US2022164109A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2022164109-A1
Application numberUS-202217668946-A
CountryUS
Kind codeA1
Filing dateFeb 10, 2022
Priority dateJun 24, 2019
Publication dateMay 26, 2022
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method of replication in a distributed storage system, performed by the distributed storage system is provided. The method includes managing a first index for data or metadata in a first storage system, the first storage system having a first partitioning scheme. The method includes managing a second index for data or metadata in a second storage system, the second storage system having a second partitioning scheme. The method includes replicating the data or metadata from the first storage system to the second storage system, translating an identifier of the data or metadata from the first storage system, and mapping the replicated data or metadata into the second partitioning scheme, via the translating of the identifier of the data or metadata from the first storage system.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method comprising: managing a first index for data in a first storage system, the first storage system having a first partitioning scheme; managing a second index for data in a second storage system, the second storage system having a second partitioning scheme differing from the first partitioning scheme; replicating the data from the first storage system to the second storage system; transforming one or more portions of an identifier of the data from the first storage system, to form a pointer into one of a plurality of partitions in the second partitioning scheme; and mapping the replicated data into the second partitioning scheme, via the transforming of the identifier of the data from the first storage system. 2 . The method of claim 1 , wherein the first partitioning scheme comprises chunking of objects, to a first data chunk size and the second partitioning scheme comprises chunking of objects, to a second, differing data chunk size. 3 . The method of claim 1 , wherein managing the first index in the first storage system comprises assigning ownership of portions of the data in the first storage system to a first plurality of authorities in the first storage system, to partition the data of the first storage system according to the first partitioning scheme and record the ownership by the first plurality of authorities in the first index, wherein the managing the second index in the first storage system comprises assigning ownership of portions of the data in the second storage system to a second plurality of authorities in the second storage system, differing in number from the first plurality of authorities. 4 . The method of claim 1 , wherein: the identifier is an object identifier and the mapping via the transformation maps objects in the first storage system, which are chunked to the first data chunk size, to chunks of the second data chunk size. 5 . The method of claim 1 , wherein the transforming comprises: performing an XOR of bits from an authority field of an identifier of the data from the first storage system with further bits of the identifier to form a pointer to one of a plurality of authorities in the second storage system, based on results of the XOR. 6 . The method of claim 1 , wherein: the first partitioning scheme comprises partitioning the data of the first storage system into a first plurality of partitions with locality for the data and associated metadata. 7 . The method of claim 1 , further comprising: streaming, in parallel, data of each of a plurality of partitions in the first storage system according to the first partitioning scheme, wherein the transforming and the mapping are performed in parallel across such streams. 8 . A tangible, non-transitory, computer-readable media having instructions thereupon which, when executed by a processor, cause the processor to perform a method comprising: managing a first index for data in a first storage system, the first storage system having a first partitioning scheme; managing a second index for data in a second storage system, the second storage system having a second partitioning scheme differing from the first partitioning scheme; replicating the data from the first storage system to the second storage system; transforming one or more portions of an identifier of the data from the first storage system, to form a pointer into one of a plurality of partitions in the second partitioning scheme; and mapping the replicated data into the second partitioning scheme, via the transforming of the identifier of the data from the first storage system. 9 . The computer-readable media of claim 8 , wherein the first partitioning scheme comprises chunking of objects, to a first data chunk size and the second partitioning scheme comprises chunking of objects, to a second, differing data chunk size. 10 . The computer-readable media of claim 8 , wherein: the first storage system has a first plurality of authorities each owning a portion of the data of the first storage system in accordance with the first partitioning scheme and wherein the second storage system has a second plurality of authorities, differing in number from the first plurality of authorities and each owning a portion of the data of the second storage system in accordance with the second partitioning scheme. 11 . The computer-readable media of claim 8 , wherein managing the first index in the first storage system comprises assigning ownership of portions of the data in the first storage system to a first plurality of authorities in the first storage system, to partition the data of the first storage system according to the first partitioning scheme and record the ownership by the first plurality of authorities in the first index. 12 . The computer-readable media of claim 8 , wherein the transforming comprises: performing an XOR of bits from an authority field of an identifier of the data from the first storage system with further bits of the identifier, the authority field identifying one of a first plurality of authorities in the first storage system and forming a pointer to one of a second plurality of authorities in the second storage system, based on results of the XOR, the second plurality of authorities differing in number from the first plurality of authorities. 13 . The computer-readable media of claim 8 , wherein the transforming preserves locality for data and associated metadata, from a first plurality of partitions in the first storage system according to the first partitioning scheme, mapping into a second plurality of partitions in the second storage system according to the second partitioning scheme. 14 . The computer-readable media of claim 8 , wherein the method further comprises: forming a plurality of streams in parallel, one stream per each of a plurality of partitions in the first storage system according to the first partitioning scheme, wherein the performing the transformation and the using the result of the transformation to map are performed in parallel across the plurality of streams. 15 . A storage system, comprising: storage memory; and one or more processors, configurable to: managing a first index for data in a first storage system, the first storage system having a first partitioning scheme; managing a second index for data in a second storage system, the second storage system having a second partitioning scheme differing from the first partitioning scheme; replicating the data from the first storage system to the second storage system; transforming one or more portions of an identifier of the data from the first storage system, to form a pointer into one of a plurality of partitions in the second partitioning scheme; and mapping the replicated data into the second partitioning scheme, via the transforming of the identifier of the data from the first storage system. 16 . The storage system of claim 15 , wherein to map into the second partitioning scheme, using the result of the transforming comprises using the pointer to map the data from the storage system into the one of the plurality of partitions in the second storage system in accordance with the second partitioning scheme. 17 . The storage system of claim 15 , wherein: the one or more processors are configurable to assign ownership of portions of the data to a first plurality of authorities in accordance with the first partitioning scheme and wherein the second storage system has a second plurality of authorities, differing in number from the first plurality of authorities, an

Assignees

Inventors

Classifications

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2022164109A1 cover?
A method of replication in a distributed storage system, performed by the distributed storage system is provided. The method includes managing a first index for data or metadata in a first storage system, the first storage system having a first partitioning scheme. The method includes managing a second index for data or metadata in a second storage system, the second storage system having a sec…
Who is the assignee on this patent?
Pure Storage Inc
What technology area does this patent fall under?
Primary CPC classification G06F12/0246. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu May 26 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).