Data restructuring of deduplicated data

US9934232B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9934232-B2
Application numberUS-201414303708-A
CountryUS
Kind codeB2
Filing dateJun 13, 2014
Priority dateJun 13, 2014
Publication dateApr 3, 2018
Grant dateApr 3, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Various embodiments for enhancing storage of deduplicated data in a computing storage environment. Analytics are applied to at least one data storage characteristic observed in the computing storage environment to restructure the deduplicated data in a more sequential manner so as to enhance performance of the computing storage environment.

First claim

Opening claim text (preview).

The invention claimed is: 1. A method for enhancing storage of deduplicated data by a processor device in a computing storage environment, comprising: applying analytics to at least one data storage characteristic observed in the computing storage environment to restructure the deduplicated data in a sequential manner so as to enhance performance of the computing storage environment; and monitoring a type of storage device associated with the deduplicated data, wherein the analytics are withheld from application when the type of storage device is determined to be a solid-state device; wherein: the at least one data storage characteristic observed in the computing storage environment is at least a read pattern, a data movement characteristic, a block move count characteristic, a read and write throughput characteristic, and a latency characteristic, and restructuring the deduplicated data in the sequential manner is performed by applying the analytics to the read pattern, data movement characteristic, block move count characteristic, read and write throughput characteristic, and latency characteristic, each applied in at least one of a predetermined order, each according at least one of a predetermined weight, and each compared against at least one predetermined threshold to obtain an optimal placement of the deduplicated data in a rearranged, sequential, state; and further wherein one of the at least one predetermined thresholds comprises prohibiting a particular block of the deduplicated data from being restructured when the particular block of deduplicated data has been previously moved in the computing storage environment within a predefined timeframe. 2. The method of claim 1 , further including restructuring the deduplicated data in the sequential manner using a background process. 3. The method of claim 1 , wherein the at least one data storage characteristic observed in the computing storage environment is a read pattern, and further including performing the restructuring of the deduplicated data in the sequential manner based on analytics applied to the read pattern. 4. The method of claim 1 , wherein the at least one data storage characteristic observed in the computing storage environment is a data movement characteristic, and further including performing the restructuring of the deduplicated data in the sequential manner based on analytics applied to the data movement characteristic. 5. The method of claim 1 , wherein the at least one data storage characteristic observed in the computing storage environment is a block move count characteristic, and further including performing the restructuring of the deduplicated data in the sequential manner based on analytics applied to the block move count characteristic. 6. The method of claim 1 , wherein the at least one data storage characteristic observed in the computing storage environment is at least one of a read and write throughput and latency characteristic, and further including performing the restructuring of the deduplicated data in the sequential manner based on analytics applied to the read and write throughput and latency characteristics. 7. A system for enhancing storage of deduplicated data by a processor device in a computing storage environment, comprising: a processor device that applies analytics to at least one data storage characteristic observed in the computing storage environment to restructure the deduplicated data in a sequential manner so as to enhance performance of the computing storage environment; and monitors a type of storage device associated with the deduplicated data, wherein the analytics are withheld from application when the type of storage device is determined to be a solid-state device; wherein: the at least one data storage characteristic observed in the computing storage environment is at least a read pattern, a data movement characteristic, a block move count characteristic, a read and write throughput characteristic, and a latency characteristic, and restructuring the deduplicated data in the sequential manner is performed by applying the analytics to the read pattern, data movement characteristic, block move count characteristic, read and write throughput characteristic, and latency characteristic, each applied in at least one of a predetermined order, each according at least one of a predetermined weight, and each compared against at least one predetermined threshold to obtain an optimal placement of the deduplicated data in a rearranged, sequential, state; and further wherein one of the at least one predetermined thresholds comprises prohibiting a particular block of the deduplicated data from being restructured when the particular block of deduplicated data has been previously moved in the computing storage environment within a predefined timeframe. 8. The system of claim 7 , wherein the processor restructures the deduplicated data in the sequential manner using a background process. 9. The system of claim 7 , wherein the at least one data storage characteristic observed in the computing storage environment is a read pattern, and further wherein the processor performs the restructuring of the deduplicated data in the sequential manner based on analytics applied to the read pattern. 10. The system of claim 7 , wherein the at least one data storage characteristic observed in the computing storage environment is a data movement characteristic, and further wherein the processor performs the restructuring of the deduplicated data in the sequential manner based on analytics applied to the data movement characteristic. 11. The system of claim 7 , wherein the at least one data storage characteristic observed in the computing storage environment is a block move count characteristic, and further wherein the processor performs the restructuring of the deduplicated data in the sequential manner based on analytics applied to the block move count characteristic. 12. The system of claim 7 , wherein the at least one data storage characteristic observed in the computing storage environment is at least one of a read and write throughput and latency characteristic, and further wherein the processor performs the restructuring of the deduplicated data in the sequential manner based on analytics applied to the read and write throughput and latency characteristics. 13. A computer program product for performing video deduplication by a processor device, the computer program product comprising a non-transitory computer-readable storage medium having computer-readable program code portions stored therein, the computer-readable program code portions comprising: a first executable portion that applies analytics to at least one data storage characteristic observed in the computing storage environment to restructure the deduplicated data in a sequential manner so as to enhance performance of the computing storage environment; and monitors a type of storage device associated with the deduplicated data, wherein the analytics are withheld from application when the type of storage device is determined to be a solid-state device; wherein: the at least one data storage characteristic observed in the computing storage environment is at least a read pattern, a data movement characteristic, a block move count characteristic, a read and write throughput characteristic, and a latency characteristic, and restructuring the deduplicated data in the sequential manner is performed by applying the analytics to the read pattern, data movement characteristic, block move count characteristic, read and write throughput characteristic, and latency characteristic, each applied in at least one of a predetermined order

Assignees

Inventors

Classifications

  • Physics · mapped topic

  • Physics · mapped topic

  • Details of de-fragmentation performed by the file system (saving storage space on storage systems G06F3/0608; management of blocks in storage devices G06F3/064) · CPC title

  • based on file chunks · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9934232B2 cover?
Various embodiments for enhancing storage of deduplicated data in a computing storage environment. Analytics are applied to at least one data storage characteristic observed in the computing storage environment to restructure the deduplicated data in a more sequential manner so as to enhance performance of the computing storage environment.
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06F17/30135. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Apr 03 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).