Systems and methods for backup and restore of container-based persistent volumes

US12032453B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12032453-B2
Application numberUS-201916599752-A
CountryUS
Kind codeB2
Filing dateOct 11, 2019
Priority dateOct 11, 2019
Publication dateJul 9, 2024
Grant dateJul 9, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Methods and systems for backing up and restoring data on a worker node are described. According to some embodiments, the method includes using a job controller to spin up one or more backup jobs in the worker node. The method further includes utilizing, by the backup jobs, mount propagation to access one or more persistent volumes of a pod on the worker node. The method further includes performing, by the backup jobs, backup tasks on the persistent volumes. The method further includes applying a set of rules used by a scheduler to determine a location of the pod in order to ensure the backup jobs are scheduled on a same worker node. The method further includes storing, by the backup jobs, backup artifacts generated by the backup tasks, wherein the backup artifacts include deduplicated data.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method of backing up data on a worker node among a plurality of worker nodes, comprising: using a job controller to spin up one or more backup jobs in the worker node; utilizing, by the backup jobs, mount propagation to access one or more persistent volumes of a pod, among a plurality of pods, on the worker node; performing, by the backup jobs, backup tasks on the persistent volumes of the pod; applying a set of rules used by a scheduler to determine a location of the pod to ensure the backup jobs are scheduled on a same worker node where the pod is running, wherein the set of rules is defined based on custom labels on the plurality of worker nodes and label sectors specified in the plurality of pods; and storing, by the backup jobs, backup artifacts generated by the backup tasks, wherein the backup artifacts include deduplicated data, wherein, prior to performing the backup tasks on the persistent volumes of the pod, the method further includes: using pre-hooks to freeze a file system of the worker node, wherein the persistent volumes are mapped on the file system of the worker node; and cleaning up the backup jobs based on reaching a specified number of successful completions. 2. The method of claim 1 , further comprising: after performing the backup tasks on the persistent volumes of the pod, unfreezing the file system of the worker node. 3. The method of claim 2 , further comprising: collecting metadata associated with the backup tasks performed on the persistent volumes. 4. The method of claim 1 , further comprising: prior to storing the backup artifacts generated by the backup tasks, sending the backup artifacts to a data server to store the backup artifacts. 5. The method of claim 1 , wherein a value of the mount propagation is HostToContainer or Bidirectional. 6. The method of claim 1 , wherein the job controller is a kube-controller-manager. 7. The method of claim 1 , wherein the set of rules used by the scheduler is node affinity. 8. The method of claim 1 , wherein the backup artifacts further include logging information, configuration information across containers consisted in the pod, and storing secrets. 9. A computer-implemented method of restoring data on a worker node among a plurality of worker nodes, comprising: using a job controller to spin up one or more restore jobs in the worker node; utilizing, by the restore jobs, mount propagation to access one or more persistent volumes of a pod, among a plurality of pods, on the worker node; applying a set of rules used by a scheduler to determine a location of the pod to ensure the restore jobs are scheduled on a same worker node where the pod is running, wherein the set of rules is defined based on custom labels on the plurality of worker nodes and label sectors specified in the plurality of pods; retrieving, by the restore jobs, backup artifacts; and performing, by the restore jobs, restore tasks on the persistent volumes of the pod based on the retrieved backup artifacts, wherein prior to performing the restore tasks on the persistent volumes of the pod, the method includes: using pre-hooks to freeze a file system of the worker node, wherein the persistent volumes are mapped on the file system of the worker node; and cleaning up the restore jobs based on reaching a specified number of successful completions. 10. The method of claim 9 , further comprising: after performing the restore tasks on the persistent volumes of the pod, unfreezing the file system of the worker node. 11. The method of claim 10 , further comprising: collecting metadata associated with the restore tasks performed on the persistent volumes. 12. The method of claim 9 , wherein the backup artifacts are retrieved from a data server. 13. The method of claim 9 , wherein a value of the mount propagation is HostToContainer or Bidirectional. 14. The method of claim 9 , wherein the job controller is a kube-controller-manager. 15. The method of claim 9 , wherein the set of rules used by the scheduler is node affinity. 16. The method of claim 9 , wherein the backup artifacts include deduplicated data, logging information, configuration information across containers consisted in the pod, and storing secrets associated with the persistent volumes. 17. A non-transitory machine-readable medium having instructions stored therein, which when executed by a processor, cause the processor to perform operations, the operations comprising: using a job controller to spin up one or more backup jobs in a worker node among a plurality of worker nodes; utilizing, by the backup jobs, mount propagation to access one or more persistent volumes of a pod, among a plurality of pods, on the worker node; performing, by the backup jobs, backup tasks on the persistent volumes of the pod; applying a set of rules used by a scheduler to determine a location of the pod to ensure the backup jobs are scheduled on a same worker node where the pod is running, wherein the set of rules is defined based on custom labels on the plurality of worker nodes and label sectors specified in the plurality of pods; and storing, by the backup jobs, backup artifacts generated by the backup tasks, wherein the backup artifacts include deduplicated data, wherein, prior to performing the backup tasks on the persistent volumes of the pod, the operations further comprising: using pre-hooks to freeze a file system of the worker node, wherein the persistent volumes are mapped on the file system of the worker node; and cleaning up the backup jobs based on reaching a specified number of successful completions. 18. A data processing system, comprising: a processor; and a memory coupled to the processor to store instructions, which when executed by the processor, cause the processor to perform operations, the operations including: using a job controller to spin up one or more backup jobs in a worker node among a plurality of worker nodes; utilizing, by the backup jobs, mount propagation to access one or more persistent volumes of a pod, among a plurality of pods, on the worker node; performing, by the backup jobs, backup tasks on the persistent volumes of the pod; applying a set of rules used by a scheduler to determine a location of the pod to ensure the backup jobs are scheduled on a same worker node where the pod is running, wherein the set of rules is defined based on custom labels on the plurality of worker nodes and label sectors specified in the plurality of pods; and storing, by the backup jobs, backup artifacts generated by the backup tasks, wherein the backup artifacts include deduplicated data, wherein, prior to performing the backup tasks on the persistent volumes of the pod, the operations further comprising: using pre-hooks to freeze a file system of the worker node, wherein the persistent volumes are mapped on the file system of the worker node; and cleaning up the backup jobs based on reaching a specified number of successful completions. 19. A non-transitory machine-readable medium having instructions stored therein, which when executed by a processor, cause the processor to perform operations, the operations comprising: using a job controller to spin up one or more restore jobs in a worker node among a plurality of worker nodes; utilizing, by the restore jobs, mount propagation to access one or more persistent volumes of a pod, among a plurality of pods, on the worker node; applying a set of rules used by a scheduler to determine a location of the pod to ensure

Assignees

Inventors

Classifications

  • for networked environments · CPC title

  • Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors · CPC title

  • Using snapshots, i.e. a logical point-in-time copy of the data · CPC title

  • using de-duplication of the data · CPC title

  • Backup restoration techniques · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12032453B2 cover?
Methods and systems for backing up and restoring data on a worker node are described. According to some embodiments, the method includes using a job controller to spin up one or more backup jobs in the worker node. The method further includes utilizing, by the backup jobs, mount propagation to access one or more persistent volumes of a pod on the worker node. The method further includes perform…
Who is the assignee on this patent?
Emc Ip Holding Co Llc
What technology area does this patent fall under?
Primary CPC classification G06F11/1469. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jul 09 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).