System and method for adaptive data placement within a distributed file system

US9710478B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9710478-B2
Application numberUS-201414268059-A
CountryUS
Kind codeB2
Filing dateMay 2, 2014
Priority dateMay 2, 2014
Publication dateJul 18, 2017
Grant dateJul 18, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A system and method for adaptive data placement in a distributed file system is provided. Upon creation of a new subdirectory, a first heuristic procedure is formed to determine whether the newly created subdirectory should be created locally or on a remote flexible volume. Should it be determination be made to create or store the subdirectory on a remote flexible volume, a second heuristic procedure determines which of a plurality of flexible volumes should store the newly created subdirectory.

First claim

Opening claim text (preview).

What is claimed is: 1. A method comprising: receiving a command to create a data container; assigning, by a remote access module, the data container to one of a plurality of categories, wherein the plurality of categories comprises data containers being created off of a root directory, data containers being created off directory that is within a root directory and data containers being created off directory that is within a root directory; calculating, by the remote access module, an urgency value for the data container; calculating, by the remote access module, a disparity value for the data container; calculating, by the remote access module, a local factor for the data container; determining, by the remote access module, whether the data container is to be stored remotely, according to one of the plurality of categories, and the local factor; and in response to determining that the data container is to be stored remotely, determining, by the remote access module, one of a plurality of remote storage containers to store the data container by calculating a recent percentage for each of the plurality of remote storage containers, calculating a target percentage for each of the plurality of remote storage containers and selecting one of the plurality of remote storage containers having a greatest difference between the calculated recent percentage and the calculated target percentage. 2. The method of claim 1 wherein the data container comprises a subdirectory. 3. The method of claim 1 wherein the storage containers comprise flexible volumes. 4. The method of claim 1 wherein, in response to determining that the data container is not to be stored remotely, creating the data container on a storage container locally connected to a node. 5. The method of claim 1 wherein the local factor is calculated using the urgency value and the disparity value. 6. The method of claim 5 wherein the local factor is calculated by subtracting from a predefined value a value formed by multiplying the urgency value by the disparity value. 7. A network device comprising: a memory containing machine readable medium comprising medium executable code having stored thereon instructions for performing a method of adaptive data placement of a data container; a processing module coupled to the memory, the processor module configured to execute the machine executable code to: receive a command to create a data container; assign the data container to one of a plurality of categories, wherein the plurality of categories comprises data containers being created off of a root directory, data containers being created off directory that is within a root directory and data containers being created off directory that is within a root directory; calculate an urgency value for the data container; calculate a disparity value for the data container; calculate a local factor for the data container; determine whether the data container is to be stored remotely, according to one of the plurality of categories, and the local factor; and in response to determining that the data container is to be stored remotely, determine, by the remote access module, one of a plurality of remote storage containers to store the data container by calculating a recent percentage for each of the plurality of remote storage containers, calculating a target percentage for each of the plurality of remote storage containers and selecting one of the plurality of remote storage containers having a greatest difference between the calculated recent percentage and the calculated target percentage. 8. The network device of claim 7 wherein the one or more remote storage containers comprise flexible volumes. 9. The network device of claim 7 wherein the data container comprises a subdirectory. 10. The network device of claim 7 wherein the processor module is further configured to execute the machine executable code to, in response to determining that the data container is not to be stored on the at least one storage containers stored on storage devices associated with the node, determine on which of the one or more remote storage containers the data container is to be created. 11. The network device of claim 10 wherein the processor module is further configured to execute the machine executable code to query each other node of the cluster via the network to obtain information to calculate the recent percentage value. 12. A non-transitory machine readable medium having stored thereon instructions for performing a method of adaptive data placement of a data container, comprising machine executable code which, when executed by at least one machine, causes the machine to: receive a command to create a data container; assign the data container to one of a plurality of categories, wherein the plurality of categories comprises data containers being created off of a root directory, data containers being created off directory that is within a root directory and data containers being created off directory that is within a root directory; calculate an urgency value for the data container; calculate a disparity value for the data container; calculate a local factor for the data container; determine whether the data container is to be stored remotely, according to one of the plurality of categories, and the local factor; and in response to determining that the data container is to be stored remotely, determine one of a plurality of remote storage containers to store the data container by calculating a recent percentage for each of the plurality of remote storage containers, calculating a target percentage for each of the plurality of remote storage containers and selecting one of the plurality of remote storage containers having a greatest difference between the calculated recent percentage and the calculated target percentage. 13. The non-transitory machine readable medium of claim 12 wherein the one or more remote storage containers comprise flexible volumes.

Assignees

Inventors

Classifications

  • using directory or table look-up (use of a directory or look-up table in file systems G06F16/13) · CPC title

  • G06F16/16Primary

    File or folder operations, e.g. details of user interfaces specifically adapted to file systems · CPC title

  • Indexing; Web crawling techniques · CPC title

  • Distributed file systems · CPC title

  • Distributed indices · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9710478B2 cover?
A system and method for adaptive data placement in a distributed file system is provided. Upon creation of a new subdirectory, a first heuristic procedure is formed to determine whether the newly created subdirectory should be created locally or on a remote flexible volume. Should it be determination be made to create or store the subdirectory on a remote flexible volume, a second heuristic pro…
Who is the assignee on this patent?
Netapp Inc
What technology area does this patent fall under?
Primary CPC classification G06F16/16. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jul 18 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).