Systems, methods, and computer program products for scheduling processing to achieve space savings

US9280571B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9280571-B2
Application numberUS-201414460032-A
CountryUS
Kind codeB2
Filing dateAug 14, 2014
Priority dateMar 30, 2012
Publication dateMar 8, 2016
Grant dateMar 8, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method performed in a system that has a plurality of volumes stored to storage hardware, the method including generating, for each of the volumes, a respective space saving potential iteratively over time and scheduling space saving operations among the plurality of volumes by analyzing each of the volumes for space saving potential and assigning priority of resources based at least in part on space saving potential.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for data deduplication, the method comprising: determining a cumulative space savings value obtained for previous deduplication operations on a data object; determining an average space savings for the data object by: identifying data received since a previous deduplication operation; performing a deduplication operation on the identified data; calculating a percent storage space reduction for the identified data based on the deduplication operation; and determining an average space savings value based on the cumulative space savings value weighted by a non-deduplicated size of the data object prior to the previous deduplication operation, and further based on the percent storage space reduction for the identified data weighted by a non-duplicated amount of the identified data; and assigning a deduplication priority to the data object based on the average space savings value. 2. The method of claim 1 , wherein the deduplication priority of the data object is processed in association with deduplication priorities of other data objects, said method further comprising performing a subsequent deduplication operation on the other data objects according to the deduplication priority of each of the data objects. 3. The method of claim 2 , wherein the deduplication priority of each of the data objects is further based on a size of a change log of the respective data object. 4. The method of claim 2 , wherein the deduplication priority of each of the data objects is further based on an overflow of a change log of the respective data object. 5. The method of claim 2 , wherein the deduplication priority of each of the data objects is further based a performance metric of a deduplication system. 6. The method of claim 2 , further comprising performing logging changes to one of the data objects in a change log based on the deduplication priority of the one data object. 7. The method of claim 1 , further comprising selecting the data object for deduplication from among a plurality of data objects based on the deduplication priority of the data object. 8. An apparatus comprising: a non-transitory, tangible computer readable storage medium storing a computer program for data duplication, wherein the computer program has instructions that, when executed by a computer processor, carry out: determining a cumulative space savings value obtained for previous deduplication operations on a data object; determining an average space savings for the data object by: identifying data received since a previous deduplication operation; performing a deduplication operation on the identified data; calculating a percent storage space reduction for the identified data based on the deduplication operation; and determining the average space savings value based on the cumulative space savings value weighted by the non-deduplicated size of the data object prior to the last deduplication operation, and further based on the percent storage space reduction for the identified data weighted by the non-duplicated amount of the identified data; and assigning a deduplication priority to the data object based on the average space savings value. 9. The apparatus of claim 8 , wherein the computer program has further instructions that carry out maintaining a change log of the data object based on the deduplication priority. 10. The apparatus of claim 8 , wherein the computer program has further instructions that carry out assigning the deduplication priority to the data object further based on a size of a change log for the data object. 11. The apparatus of claim 8 , wherein the computer program has further instructions that carry out assigning the deduplication priority to the data object further based on an overflow of a change log of the data object. 12. The apparatus of claim 8 , wherein the computer program has further instructions that carryout selecting the data object for deduplication from among a plurality of data objects based on the deduplication priority of the data object. 13. A storage system comprising: a storage device including a non-volatile storage medium and operable to store a data object thereupon; and a deduplication manager coupled to the storage device and operable to transact data with the storage device, wherein the deduplication manager is further operable to: determine a cumulative space savings value obtained for previous deduplication operations on the data object; determine an average space savings for the data object by: identifying data received since a previous deduplication operation; performing a deduplication operation on the identified data; calculating a percent storage space reduction for the identified data based on the deduplication operation; and determining an average space savings value based on the cumulative space savings value weighted by a non-deduplicated size of the data object prior to the previous deduplication operation, and further based on the percent storage space reduction for the identified data weighted by a non-duplicated amount of the identified data; and assign a deduplication priority to the data object based on the average space savings value. 14. The storage system of claim 13 , wherein the deduplication manager is further operable to schedule the data object for deduplication further based on a size of a change log for the data object. 15. The storage system of claim 13 , wherein the deduplication manager is further operable to schedule the data object for deduplication further based on an overflow of a change log for the data object. 16. The storage system of claim 13 , wherein the deduplication manager is further operable to schedule the data object for deduplication further based on a performance metric of the storage system. 17. The storage system of claim 13 , wherein the deduplication manager is further operable to log changes to the data object to a change log based on the deduplication priority. 18. The storage system of claim 15 , wherein the deduplication manager is further operable to selecting the data object for deduplication from among a plurality of data objects based on the deduplication priority of the data object.

Assignees

Inventors

Classifications

  • Physics · mapped topic

  • Physics · mapped topic

  • Physics · mapped topic

  • Physics · mapped topic

  • Redundancy elimination performed by the file system (error detection or correction of the data by redundancy in operations G06F11/14) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9280571B2 cover?
A method performed in a system that has a plurality of volumes stored to storage hardware, the method including generating, for each of the volumes, a respective space saving potential iteratively over time and scheduling space saving operations among the plurality of volumes by analyzing each of the volumes for space saving potential and assigning priority of resources based at least in part o…
Who is the assignee on this patent?
Netapp Inc
What technology area does this patent fall under?
Primary CPC classification G06F17/30303. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Mar 08 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).