Online file system consistency check for container data on a clustered filesystem

US11650975B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11650975-B2
Application numberUS-202017119009-A
CountryUS
Kind codeB2
Filing dateDec 11, 2020
Priority dateDec 11, 2020
Publication dateMay 16, 2023
Grant dateMay 16, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Online file system consistency check for container data on a clustered file system is provided via identifying inodes (index nodes) of a group of files in a clustered file system based on a cyber-resiliency for the clustered file system; grouping the inodes based on a buffer size allocated to a FSCK (File System Consistency Check) operation; passing the inodes of to the FSCK operation in a single iteration when a total size of the inodes is less than the buffer size; or when the total size of the inodes is greater than the buffer size, identifying inodes that belong to a first container and that belong to a second container; passing the inodes that belong to the first container to the FSCK operation in a first iteration; and passing, after the first iteration completes, the inodes that belong to the second container to the FSCK operation in a second iteration.

First claim

Opening claim text (preview).

What is claimed is: 1. A method comprising: identifying inodes of a first group of files in a clustered file system based on a cyber-resiliency for the clustered file system; grouping the inodes based on a buffer size allocated to a FSCK (File System Consistency Check) operation for the clustered file system; in response to the buffer size being greater than a total size of the inodes of the first group of files, passing the inodes of the first group of files to the FSCK operation in a single iteration; in response to the buffer size being less than a total size of the inodes of a second group of files: identifying the inodes of the second group of files that belong to a first container and that belong to a second container; passing the inodes of the second group of files that belong to the first container to the FSCK operation in a first iteration; and passing, after the first iteration completes, the inodes of the second group of files that belong to the second container to the FSCK operation in a second iteration. 2. The method of claim 1 , further comprising: identifying the second group of files in the clustered file system based on the cyber-resiliency for the clustered file system; separating inodes of the second group of files; and grouping the inodes of the second group of files based on the buffer size allocated to the FSCK operation for the clustered file system. 3. The method of claim 1 , wherein identifying the first group of files in the clustered file system identifies the files that are important to the cyber-resiliency of the clustered file system further comprises: monitoring file operations captured; analyzing frequency patterns for how often the files are generated or updated by a given container; analyzing access patterns of how many other containers, besides the given container, access the files; and analyzing operational patterns for what types of operations are performed on the files by the other containers. 4. The method of claim 1 , wherein identifying the first group of files in the clustered file system identifies the files that are important to the cyber-resiliency of the clustered file system further comprises: analyzing a time of last access for a given file; and marking as the given file as one of passive or active based on whether the time of last access is within a threshold. 5. The method of claim 1 , further comprising: selecting a number of Erasure Code (EC) fragments to validate for each of the files passed to the FSCK operation; and scanning the files based on priority levels of the files. 6. The method of claim 5 , wherein the number of EC fragments are dynamically selected based on system load and resources allocated to the FSCK operation. 7. The method of claim 1 , wherein the first group of files consists of files assigned a priority level above a priority threshold for cyber-resiliency for the clustered file system. 8. A system, comprising: a processor; and a memory storing instructions that when executed by the processor enable performance of an operation that includes: identifying inodes of a first group of files in a clustered file system based on a cyber-resiliency for the clustered file system; grouping the inodes based on a buffer size allocated to a FSCK (File System Consistency Check) operation for the clustered file system; in response to the buffer size being greater than a total size of the inodes of the first group of files, passing the inodes of the first group of files to the FSCK operation in a single iteration; in response to the buffer size being less than a total size of the inodes of a second group of files: identifying the inodes of the second group of files that belong to a first container and that belong to a second container; passing the inodes of the second group of files that belong to the first container to the FSCK operation in a first iteration; and passing, after the first iteration completes, the inodes of the second group of files that belong to the second container to the FSCK operation in a second iteration. 9. The system of claim 8 , wherein the operation further includes: identifying the second group of files in the clustered file system based on the cyber-resiliency for the clustered file system; separating inodes of the second group of files; and grouping the inodes of the second group of files based on the buffer size allocated to the FSCK operation for the clustered file system. 10. The system of claim 8 , wherein identifying the first group of files in the clustered file system identifies the files that are important to the cyber-resiliency of the clustered file system further comprises: monitoring file operations captured; analyzing frequency patterns for how often the files are generated or updated by a given container; analyzing access patterns of how many other containers, besides the given container, access the files; and analyzing operational patterns for what types of operations are performed on the files by the other containers. 11. The system of claim 8 , wherein identifying the first group of files in the clustered file system identifies the files that are important to the cyber-resiliency of the clustered file system further comprises: analyzing a time of last access for a given file; and marking as the given file as one of passive or active based on whether the time of last access is within a threshold. 12. The system of claim 8 , further comprising: selecting a number of Erasure Code (EC) fragments to validate for each of the files passed to the FSCK operation; and scanning the files based on priority levels of the files. 13. The system of claim 12 , wherein the number of EC fragments are dynamically selected based on system load and resources allocated to the FSCK operation. 14. The system of claim 8 , wherein the first group of files consists of files assigned a priority level above a priority threshold for cyber-resiliency for the clustered file system. 15. A computer-readable storage medium including instructions that when executed by a processor enable performance of operations including: identifying inodes of a first group of files in a clustered file system based on a cyber-resiliency for the clustered file system; grouping the inodes based on a buffer size allocated to a FSCK (File System Consistency Check) operation for the clustered file system; and in response to the buffer size being greater than a total size of the inodes of the first group of files, passing the inodes of the first group of files to the FSCK operation in a single iteration; in response to the buffer size being less than a total size of the inodes of a second group of files: identifying the inodes of the second group of files that belong to a first container and that belong to a second container; passing the inodes of the second group of files that belong to the first container to the FSCK operation in a first iteration; and passing, after the first iteration completes, the inodes of the second group of files that belong to the second container to the FSCK operation in a second iteration. 16. The computer-readable storage medium of claim 15 , wherein the operations further include: identifying the second group of files in the clustered file system based on the cyber-resiliency for the clustered file system; separating inodes of the second group of files; and grouping the inodes of the second group of files based on the buffer size allocated to the FSCK operation for the clustered file system. 17. The computer-readable storag

Assignees

Inventors

Classifications

  • G06F16/182Primary

    Distributed file systems · CPC title

  • Ensuring data consistency and integrity · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11650975B2 cover?
Online file system consistency check for container data on a clustered file system is provided via identifying inodes (index nodes) of a group of files in a clustered file system based on a cyber-resiliency for the clustered file system; grouping the inodes based on a buffer size allocated to a FSCK (File System Consistency Check) operation; passing the inodes of to the FSCK operation in a sing…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06F16/182. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue May 16 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 10 related publications on this page (citations in our corpus or others sharing the same primary CPC).