What technology area does this patent fall under?

Primary CPC classification G06F16/116. Mapped technology areas include Physics.

When was this patent published?

Publication date Tue Feb 21 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Distributed file system gateway

US9575974B2 · US · B2

Patent metadata
Field	Value
Publication number	US-9575974-B2
Application number	US-201314137706-A
Country	US
Kind code	B2
Filing date	Dec 20, 2013
Priority date	Oct 23, 2013
Publication date	Feb 21, 2017
Grant date	Feb 21, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Technology is disclosed for managing data in a distributed file system (“the technology”). The technology can gather metadata information associated with the data stored within a first file system, store the metadata information in association with a data identifier within a second file system, retrieve the stored metadata information using the data identifier from within the second file system and locate and retrieve the data associated with the metadata information from within first file system.

First claim

Opening claim text (preview).

What is claimed is: 1. A method, comprising: receiving a request, by a data node server, to access data through a first distributed file system, the request including a first data identifier associated with the data for accessing the data through the first distributed file system; wherein the received first data identifier is used by the data node server to determine that data associated with the first identifier has been evicted from the first distributed file system; identifying, utilizing the received first data identifier, a second data identifier associated with the data identified as being evicted from the first file system, the second data identifier being stored within the first distributed file system and utilized for generating a request for accessing the data evicted from the first file system and stored within a second distributed file system; retrieving, utilizing the identified second data identifier, the data from within the second distributed file system; removing any indication that the data retrieved from the second distributed file system has been evicted from the first distributed file system; converting the retrieved data from a first format of the second distributed file system to a second format of the first distributed file system; storing, by a gateway manager in the data node server, the converted data within the first distributed file system, wherein the stored data is retrieved from within the first distributed file system in response to another request for the data; and providing access to the converted retrieved data through the first distributed file system. 2. The method of claim 1 , further comprising: receiving a request to store the data within the first file system, the request to store the data including the second data identifier associated with the data; storing the second data identifier associated with the data within the first distributed file system; and storing information indicative of the data being evicted from the first distributed file system. 3. The method of claim 1 , wherein the first distributed file system and the second distributed file system are different types of file systems. 4. The method of claim 3 , wherein the first distributed file system is a Hadoop Distributed File System (HDFS), wherein the second distributed file system is a Network File System (NFS). 5. The method of claim 4 , wherein the NFS stores the data in the first format, wherein the first format is a file level format and the HDFS stores the data in the second format, wherein the second format is a block level format. 6. The method of claim 5 , wherein the first data identifier includes a data block identifier (ID), wherein the second data identifier includes a file pathname. 7. The method of claim 1 , wherein the second data identifier includes metadata information that enables the first distributed file system to locate and retrieve the requested data from the second distributed file system. 8. The method of claim 5 , wherein the second data identifier stored in the first distributed file system includes one or more sub data identifiers, each of the sub data identifiers corresponding to a portion of the data stored within the second distributed file system. 9. A system, comprising: a data node to receive a request to copy data from a first distributed file system to a second distributed file system; a gateway client to gather metadata information associated with the data stored within the first distributed file system, the metadata information including information to locate and retrieve the requested data from within the first distributed file system; wherein the gateway client marks data identifiers of data blocks as being evicted when metadata information is stored within the first distributed file system but data associated with the data identifiers is stored at the second distributed file system; a gateway manager to: store the gathered metadata information in association with a data identifier within the second distributed file system, the data identifier being used to request access to the data via the second distributed file system; a chunk store manager to convert the retrieved data from a first format to a second format; the gateway manager further to store the converted data within the second file system, wherein the stored data can be retrieved from within the second distributed file system in response to another request for the data; wherein the gateway manager in response to the another request determines that data identifier in the other request is marked as being evicted from the first distributed file system, retrieves metadata information from the first distributed file system; and generates a request for the data using the retrieved metadata information to request the data from the second distributed file system; and the data node further to send a confirmation indicating a completion of the data copy request; wherein the data node stores. 10. The system of claim 9 , further comprising: the data node to receive a request to access the data through the second distributed file system, the request including the data identifier; the gateway manager to: utilize the received data identifier to gather the metadata information, associated with the requested data, from within the second distributed file system, and provide access to the retrieved data through the second distributed file system; and the gateway client to utilize the gathered metadata information to retrieve the data from within the first distributed file system. 11. The system of claim 9 , wherein the first distributed file system and the second distributed file system are different types of file systems. 12. The system of claim 11 , wherein the first distributed file system is a Network File System (NFS), wherein the second distributed file system is a Hadoop Distributed File System (HDFS). 13. The system of claim 12 , wherein the data identifier includes a data block identifier (ID), wherein the metadata information include a file pathname. 14. A non-transitory computer readable storage medium storing computer executable instructions, comprising: instructions for receiving a request to access data through a first distributed file system, the request including a first data identifier associated with the data for accessing the data through the first distributed file system; wherein the received first data identifier is used by the data node server to determine that data associated with the first identifier has been evicted from the first distributed file system; instructions for identifying, utilizing the received first data identifier, a second data identifier associated with the data identified as being evicted from the first file system, the second data identifier being stored within the first distributed file system and utilized for generating a request for accessing the data evicted from the first file system and stored within a second distributed file system; instructions for retrieving, utilizing the identified second data identifier, the data from within the second distributed file system; instructions for removing any indication that the data retrieved from the second distributed file system has been evicted from the first distributed file system; instructions for converting the retrieved data from a first format of the second distributed file system to a second format of the first distributed file system; instructions for storing the converted data within the first distributed file system, wherein the stored data can be retrieved from within the first distributed file system in response to anothe

Assignees

Netapp Inc

Inventors

Classifications

G06F16/116Primary
Details of conversion of file system types or formats · CPC title
G06F16/1865
Transactional file systems · CPC title
G06F16/18
File system types · CPC title
G06F16/182
Distributed file systems · CPC title
G06F16/134
Distributed indices · CPC title

Patent family

Related publications grouped by family.

View patent family 52827138

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9575974B2 cover?: Technology is disclosed for managing data in a distributed file system (“the technology”). The technology can gather metadata information associated with the data stored within a first file system, store the metadata information in association with a data identifier within a second file system, retrieve the stored metadata information using the data identifier from within the second file system…
Who is the assignee on this patent?: Netapp Inc
What technology area does this patent fall under?: Primary CPC classification G06F16/116. Mapped technology areas include Physics.
When was this patent published?: Publication date Tue Feb 21 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).

How to read this patent

Abstract

First claim

Assignees

Inventors

Classifications

Patent family

External sources

Related patents

Replication and restoration

Genomic application data storage

Data management in distributed file systems

Distributed lock manager for file system objects in a shared file system

Frequently asked questions