Systems and methods for distributive data storage

US2020409565A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2020409565-A1
Application numberUS-202017020096-A
CountryUS
Kind codeA1
Filing dateSep 14, 2020
Priority dateJan 4, 2017
Publication dateDec 31, 2020
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Some embodiments can include a system. In many embodiments, a system can comprise one or more processors and one or more non-transitory storage devices storing computing instructions configured to run on the one or more processors and perform: dividing a computer file into two or more portions; calculating a respective unique identifier for each respective portion of the two or more portions; creating two or more respective computer file portion replicas of each respective portion of the two or more portions, wherein each respective computer file portion replica of the two or more respective computer file portion replicas: can be for a corresponding portion of the two or more portions; and can comprises a same one of the respective unique identifier as the corresponding portion of the two or more portions; and facilitating storage of the two or more respective computer file portion replicas on a distributive data storage cluster. Other embodiments are disclosed herein.

First claim

Opening claim text (preview).

We claim: 1 . A system comprising: one or more processors; and one or more non-transitory storage devices storing computing instructions configured to run on the one or more processors and perform: dividing a computer file into two or more portions; calculating a respective unique identifier for each respective portion of the two or more portions; creating two or more respective computer file portion replicas of each respective portion of the two or more portions, wherein each respective computer file portion replica of the two or more respective computer file portion replicas: is for a corresponding portion of the two or more portions; and comprises a same one of the respective unique identifier as the corresponding portion of the two or more portions; and facilitating storage of the two or more respective computer file portion replicas on a distributive data storage cluster. 2 . The system of claim 1 , wherein each respective unique identifier comprises a respective portion hash value, a respective label, respective metadata, or a respective metadata tag. 3 . The system of claim 1 , wherein the one or more non-transitory storage devices storing the computing instructions are further configured to run on the one or more processors and perform: receiving, from an electronic device of a user, a request to utilize the computer file; sending a request to retrieve the computer file from the distributive data storage cluster; receiving, from the distributive data storage cluster, a set of the two or more respective computer file portion replicas, wherein the set of the two or more computer file portion replicas, when combined, comprises the computer file; and transmitting instructions allowing the electronic device of the user to utilize the computer file. 4 . The system of claim 3 , wherein the one or more non-transitory storage devices storing the computing instructions are further configured to run on the one or more processors and perform: calculating a respective retrieved unique identifier for each respective portion of the two or more portions associated with the set of the two or more respective computer file portion replicas, as received; and determining that the respective retrieved unique identifier for each respective portion of the two or more portions associated with the set of the two or more respective computer file portion replicas, as received, matches the respective unique identifier for each respective portion of the two or more portions, thereby confirming that the two or more respective computer file portion replicas, as received, have not been corrupted. 5 . The system of claim 3 , wherein the one or more non-transitory storage devices storing the computing instructions are further configured to run on the one or more processors and perform: merging the set of the two or more computer file portion replicas, as received, into the computer file. 6 . The system of claim 1 , wherein at least one respective computer file portion replica of the two or more respective computer file portion replicas comprises a fastest responding computer file portion replica. 7 . The system of claim 1 , wherein the computer file comprises an image file. 8 . The system of claim 1 , wherein creating the two or more respective computer file portion replicas of each respective portion of the two or more portions comprises: replicating each respective portion of the two or more portions by a respective replication factor based, at least in part, on a number of the two or more storage nodes. 9 . The system of claim 8 , wherein the respective replication factor is determined using an equation comprising: Q = N 2 + 1 , wherein: Q comprises the number of the two or more storage nodes; and N comprises the respective replication factor. 10 . The system of claim 1 , wherein the respective unique identifier for each respective portion of the two or more portions is different for each respective portion of the two or more portions. 11 . A method implemented via execution of computing instructions configured to run at one or more processors and configured to be stored at non-transitory computer-readable media, the method comprising: dividing a computer file into two or more portions; calculating a respective unique identifier for each respective portion of the two or more portions; creating two or more respective computer file portion replicas of each respective portion of the two or more portions, wherein each respective computer file portion replica of the two or more respective computer file portion replicas: is for a corresponding portion of the two or more portions; and comprises a same one of the respective unique identifier as the corresponding portion of the two or more portions; and facilitating storage of the two or more respective computer file portion replicas on a distributive data storage cluster. 12 . The method of claim 11 , wherein each respective unique identifier comprises a respective portion hash value, a respective label, respective metadata, or a respective metadata tag. 13 . The method of claim 11 , wherein the one or more non-transitory storage devices storing the computing instructions are further configured to run on the one or more processors and perform: receiving, from an electronic device of a user, a request to utilize the computer file; sending a request to retrieve the computer file from the distributive data storage cluster; receiving, from the distributive data storage cluster, a set of the two or more respective computer file portion replicas, wherein the set of the two or more computer file portion replicas, when combined, comprises the computer file; and transmitting instructions allowing the electronic device of the user to utilize the computer file. 14 . The method of claim 13 , wherein the one or more non-transitory storage devices storing the computing instructions are further configured to run on the one or more processors and perform: calculating a respective retrieved unique identifier for each respective portion of the two or more portions associated with the set of the two or more respective computer file portion replicas, as received; and determining that the respective retrieved unique identifier for each respective portion of the two or more portions associated with the set of the two or more respective computer file portion replicas, as received, matches the respective unique identifier for each respective portion of the two or more portions, thereby confirming that the two or more respective computer file portion replicas, as received, have not been corrupted. 15 . The method of claim 13 , wherein the one or more non-transitory storage devices storing the computing instructions are further configured to run on the one or more processors and perform: merging the set of the two or more computer file portion replicas, as received, into the computer file. 16 . The method of claim 11 , wherein at least one respective computer file portion replica of the two or more respective computer file portion replicas comprises a fastest responding computer file portion replica. 17 . The method of claim 11 , wherein the computer file comprises an imag

Assignees

Inventors

Classifications

  • G06F16/51Primary

    Indexing; Data structures therefor; Storage structures · CPC title

  • Replication mechanisms · CPC title

  • Distributed or networked storage systems, e.g. storage area networks [SAN], network attached storage [NAS] · CPC title

  • in relation to data integrity, e.g. data losses, bit errors · CPC title

  • Command handling arrangements, e.g. command buffers, queues, command scheduling · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2020409565A1 cover?
Some embodiments can include a system. In many embodiments, a system can comprise one or more processors and one or more non-transitory storage devices storing computing instructions configured to run on the one or more processors and perform: dividing a computer file into two or more portions; calculating a respective unique identifier for each respective portion of the two or more portions; c…
Who is the assignee on this patent?
Walmart Apollo Llc
What technology area does this patent fall under?
Primary CPC classification G06F16/51. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Dec 31 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).