Capacity exhaustion prevention for distributed storage

US10067696B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10067696-B2
Application numberUS-201615186576-A
CountryUS
Kind codeB2
Filing dateJun 20, 2016
Priority dateDec 18, 2015
Publication dateSep 4, 2018
Grant dateSep 4, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems and processes to prevent storage capacity exhaustion within distributed storage systems using chunk-based storage. The systems and processes control capacity by classifying various types of storage chunks into groups and impose used capacity thresholds on chunk groups.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for use with a storage node of a distributed storage system having a plurality of storage nodes, the storage node having a plurality of storage devices, the method comprising: maintaining statistics on storage capacity used within the storage devices; receiving a request to allocate a storage chunk having a given chunk type; using the chunk type to determine a chunk group from a plurality of chunk groups, each of the chunk groups having an assigned used capacity threshold, wherein the plurality of chunk groups include a user group associated with user data and a system group associated with system data; determining a node used capacity based on the statistics; and allocating a new storage chunk within the plurality of storage devices only if the node used capacity is less than the chunk group used capacity threshold, wherein a used capacity threshold assigned to the system group is greater than a used capacity threshold assigned to the user group such that the system can store new system data but not new user data when the node used capacity exceeds the used capacity threshold assigned to the user group. 2. The method of claim 1 further comprising: returning an error indicating insufficient storage capacity if the node used capacity is greater than the chunk group used capacity threshold. 3. The method of claim 1 wherein the used capacity threshold assigned to the system group is 100%. 4. The method of claim 1 wherein the system data includes erasure coding data and XOR data. 5. The method of claim 1 further including using hysteresis to prevent frequent switching between accepting new user data and not accepting new user data. 6. The method of claim 5 wherein the chunk groups further include a replication group associated with replication data. 7. The method of claim 6 wherein a used capacity threshold assigned to the replication group is greater than a used capacity threshold assigned to the user group such that the system can replicate data but not store new user data when the node used capacity exceeds the used capacity threshold assigned to the user group. 8. A distributed storage system, comprising: a plurality of storage nodes each having one or more storage devices and being configured to: maintain statistics on storage capacity used within the storage devices; receive a request to allocate a storage chunk having a given chunk type; use the chunk type to determine a chunk group from a plurality of chunk groups, each of the chunk groups having an assigned used capacity threshold, wherein the chunk groups include a user group associated with user data and a system group associated with system data; determine a node used capacity based on the statistics; and allocate a new storage chunk within the plurality of storage devices only if the node used capacity is less than the chunk group used capacity threshold, wherein a used capacity threshold assigned to the system group is greater than a used capacity threshold assigned to the user group such that the system can store new system data but not new user data when the node used capacity exceeds the used capacity threshold assigned to the user group. 9. The distributed storage system of claim 8 wherein each of the plurality of storage nodes is further configured to return an error indicating insufficient storage capacity if the node used capacity is greater than the chunk group used capacity threshold. 10. The distributed storage system of claim 8 wherein the used capacity threshold assigned to the system group is 100%. 11. The distributed storage system of claim 8 wherein each of the plurality of storage nodes is further configured to use hysteresis to prevent frequent switching between accepting new user data and not accepting new user data. 12. The distributed storage system of claim 11 wherein the chunk groups further include a replication group associated with replication data. 13. The distributed storage system of claim 12 wherein a used capacity threshold assigned to the replication group is greater than a used capacity threshold assigned to the user group such that the system can replicate data but not store new user data when the node used capacity exceeds the used capacity threshold assigned to the user group.

Assignees

Inventors

Classifications

  • Management of space entities, e.g. partitions, extents, pools · CPC title

  • by facilitating the interaction with a user or administrator · CPC title

  • Lifecycle management · CPC title

  • G06F3/067Primary

    Distributed or networked storage systems, e.g. storage area networks [SAN], network attached storage [NAS] · CPC title

  • G06F3/0619Primary

    in relation to data integrity, e.g. data losses, bit errors · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10067696B2 cover?
Systems and processes to prevent storage capacity exhaustion within distributed storage systems using chunk-based storage. The systems and processes control capacity by classifying various types of storage chunks into groups and impose used capacity thresholds on chunk groups.
Who is the assignee on this patent?
Emc Corp
What technology area does this patent fall under?
Primary CPC classification G06F3/067. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Sep 04 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).