Storage system
US-9201891-B2 · Dec 1, 2015 · US
US9965182B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-9965182-B2 |
| Application number | US-201514919204-A |
| Country | US |
| Kind code | B2 |
| Filing date | Oct 21, 2015 |
| Priority date | Oct 21, 2015 |
| Publication date | May 8, 2018 |
| Grant date | May 8, 2018 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Various embodiments for optimizing deduplication in a computing storage environment by a processor. Links between data regions are intelligently formed, based on up-to-date popularity statistics, including a number of times a particular one of the data regions was a target for a potential link with another one of the data regions.
Opening claim text (preview).
The invention claimed is: 1. A method for optimizing deduplication in a computing storage environment by a processor, comprising: intelligently forming links between data regions based on up-to-date popularity statistics, including a number of times a particular one of the data regions was a target for a potential link with another one of the data regions; managing, by an owner, a popularity index incorporating the popularity statistics, in one of a memory-only location and persistent memory location, wherein, over time, the popularity index is subjected to an aging mechanism pursuant to a decaying algorithm; creating, by a potential referrer one of the data regions to an owner one of the data regions, one of the intelligently formed links by searching the popularity index; deciding, by the owner one of the data regions, to accept the one of the intelligently formed links based on at least one of a plurality of predetermined factors; wherein deciding based on the at least one of the plurality of predetermined factors includes at least one of: considering a popularity metric of the owner one of the data regions, and considering at least one self-data management characteristic of the owner one of the data regions; if the one of the intelligently formed links is accepted by the owner one of the data regions, creating, by the potential referrer one of the data regions, the one of the intelligently formed links; and if the one of the intelligently formed links is rejected by the owner one of the data regions, writing data. 2. The method of claim 1 , further including initializing a system-wide parameter describing a minimum popularity value per region to indicate a popular owner of a corresponding data region, wherein the minimum popularity value is confirmed when testing the computing storage environment. 3. A system for optimizing deduplication in a computing storage environment, comprising: at least one processor, operational in the computing storage environment, wherein the at least one processor intelligently forms links between data regions based on up-to-date popularity statistics, including a number of times a particular one of the data regions was a target for a potential link with another one of the data regions; manages, by an owner, a popularity index incorporating the popularity statistics, in one of a memory-only location and persistent memory location, wherein, over time, the popularity index is subjected to an aging mechanism pursuant to a decaying algorithm; creates, by a potential referrer one of the data regions to an owner one of the data regions, one of the intelligently formed links by searching the popularity index; decides, by the owner one of the data regions, to accept the one of the intelligently formed links based on at least one of a plurality of predetermined factors; wherein deciding based on the at least one of the plurality of predetermined factors includes at least one of: considering a popularity metric of the owner one of the data regions, and considering at least one self-data management characteristic of the owner one of the data regions; if the one of the intelligently formed links is accepted by the owner one of the data regions, creates, by the potential referrer one of the data regions, the one of the intelligently formed links; and if the one of the intelligently formed links is rejected by the owner one of the data regions, writes data. 4. The system of claim 3 , wherein the at least one processor initializes a system-wide parameter describing a minimum popularity value per region to indicate a popular owner of a corresponding data region, further wherein the minimum popularity value is confirmed when testing the computing storage environment. 5. A computer program product for optimizing deduplication in a computing storage environment by a processor, the computer program product comprising a non-transitory computer-readable storage medium having computer-readable program code portions stored therein, the computer-readable program code portions comprising: a first executable portion that intelligently forms links between data regions based on up-to-date popularity statistics, including a number of times a particular one of the data regions was a target for a potential link with another one of the data regions; a second executable portion that manages, by an owner, a popularity index incorporating the popularity statistics, in one of a memory-only location and persistent memory location, wherein, over time, the popularity index is subjected to an aging mechanism pursuant to a decaying algorithm; a third executable portion that creates, by a potential referrer one of the data regions to an owner one of the data regions, one of the intelligently formed links by searching the popularity index; a fourth executable portion that decides, by the owner one of the data regions, to accept the one of the intelligently formed links based on at least one of a plurality of predetermined factors; wherein deciding based on the at least one of the plurality of predetermined factors includes at least one of: considering a popularity metric of the owner one of the data regions, and considering at least one self-data management characteristic of the owner one of the data regions; a fifth executable portion that, if the one of the intelligently formed links is accepted by the owner one of the data regions, creates, by the potential referrer one of the data regions, the one of the intelligently formed links; and a sixth executable portion that, if the one of the intelligently formed links is rejected by the owner one of the data regions, writes data. 6. The computer program product of claim 5 , further including a seventh executable portion that initializes a system-wide parameter describing a minimum popularity value per region to indicate a popular owner of a corresponding data region, further wherein the minimum popularity value is confirmed when testing the computing storage environment.
based on file chunks · CPC title
Saving storage space on storage systems · CPC title
Distributed or networked storage systems, e.g. storage area networks [SAN], network attached storage [NAS] · CPC title
De-duplication techniques · CPC title
Digital input from, or digital output to, record carriers {, e.g. RAID, emulated record carriers or networked record carriers} · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.