What technology area does this patent fall under?

Primary CPC classification G06F16/215. Mapped technology areas include Physics.

When was this patent published?

Publication date Tue Sep 05 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Data node fencing in a distributed file system

US9753954B2 · US · B2

Patent metadata
Field	Value
Publication number	US-9753954-B2
Application number	US-201314024585-A
Country	US
Kind code	B2
Filing date	Sep 11, 2013
Priority date	Sep 14, 2012
Publication date	Sep 5, 2017
Grant date	Sep 5, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems and methods for data node fencing in a distributed file system to prevent data inconsistencies and corruptions are disclosed. An embodiment includes implementing a protocol whereby data nodes detect a failover and determine an active name node based on transaction identifiers associated with transaction requests. The data nodes also provide to the active name node block location information and an acknowledgment. The embodiment further includes a protocol whereby a name node refrains from issuing invalidation requests to the data nodes until the name node receives acknowledgments from all data nodes that are functional.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for maintaining data correctness in a Hadoop™ based distributed cluster during a failover, in which an original name node is switched to a backup name node due to failure of the original name node, the distributed cluster having a plurality of data nodes and one or more processors, the method being performed by the one or more processors and comprising: on the backup name node: assuming an active role to become a new active name node, upon detecting that the original name node has failed; flagging all of the plurality of data nodes as untrusted; for each data node among the plurality of data nodes: queuing, instead of issuing, commands intended for a data node until the data node is flagged as trusted, and upon receiving an acknowledgement from the data node acknowledging the assumption of the active role of the backup name node, flagging the data node as trusted; and on a respective data node: receiving a first command with a first transaction number from a first name node; receiving a second command with a second transaction number from a second name node, wherein the second transaction number is greater than the first transaction number; and sending an acknowledgment of an active role to the second name node. 2. The method of claim 1 , further comprising: sending a message to the data node, wherein the message includes a most recent transaction identifier known to the backup name node assuming the active role. 3. The method of claim 1 , wherein commands on any block with replicated data on untrusted data nodes are queued. 4. The method of claim 1 , further comprising receiving a data report in addition to the acknowledgment of the active role from the data node. 5. The method of claim 4 , wherein the data report includes information regarding location of replicated data stored in the data node. 6. The method of claim 4 , wherein each data report includes a list of pending deletions. 7. A Hadoop™ based distributed cluster comprising an original name node, a backup name node, and a distributed file system having a plurality of data nodes, wherein one or more processors of the backup name node are configured to perform: assuming an active role to become a new active name node, upon detecting that the original name node has failed; flagging all of the plurality of data nodes as untrusted; for each data node among the plurality of data nodes: queuing, instead of issuing, commands intended for a data node until the data node is flagged as trusted; and upon receiving an acknowledgement from the data node acknowledging the assumption of the active role of the backup name node, flagging the data node as trusted, and wherein one or more processors of a respective data node are configured to perform; receiving a first command with a first transaction number from a first name node; receiving a second command with a second transaction number from a second name node, wherein the second transaction number is greater than the first transaction number; and sending an acknowledgment of an active role to the second name node. 8. A machine-readable storage medium having stored thereon instructions which, when executed by one or more processors, configure the processors to performs a method in a Hadoop™ based distributed cluster comprising a plurality of name nodes and a plurality of data nodes and having a distributed file system, the method comprising: on the backup name node: assuming an active role to become a new active name node, upon detecting that the original name node has failed; flagging all of the plurality of data nodes as untrusted; for each data node among the plurality of data nodes; queuing, instead of issuing, commands intended for a data node until the data node is flagged as trusted, and upon receiving an acknowledgement from the data node acknowledging the assumption of the active role of the backup name node, flagging the data node as trusted; and on a respective data node: receiving a first command with a first transaction number from a first name node; receiving a second command with a second transaction number from a second name node, wherein the second transaction number is greater than the first transaction number; and sending an acknowledgment of an active role to the second name node. 9. The cluster of claim 7 , wherein the data nodes are configured to ignore commands from other name nodes that issue commands having a transaction identifier lower than a transaction identifier associated with a command issued by the backup name node.

Assignees

Cloudera Inc

Inventors

Classifications

G06F16/1824
implemented using Network-attached Storage [NAS] architecture (distributed or networked storage systems G06F3/067; protocols for distributed storage of data in a network H04L67/1097) · CPC title
G06F11/2046
where the redundant components share persistent storage (G06F11/2043 takes precedence) · CPC title
G06F16/215Primary
Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors · CPC title
H04L41/0836
to enhance reliability, e.g. reduce downtime · CPC title
G06F11/2038
with a single idle spare processing component · CPC title

Patent family

Related publications grouped by family.

View patent family 50275524

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9753954B2 cover?: Systems and methods for data node fencing in a distributed file system to prevent data inconsistencies and corruptions are disclosed. An embodiment includes implementing a protocol whereby data nodes detect a failover and determine an active name node based on transaction identifiers associated with transaction requests. The data nodes also provide to the active name node block location informa…
Who is the assignee on this patent?: Cloudera Inc
What technology area does this patent fall under?: Primary CPC classification G06F16/215. Mapped technology areas include Physics.
When was this patent published?: Publication date Tue Sep 05 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).