Eliminating duplicate data by sharing file system extents

US9483487B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9483487-B2
Application numberUS-201414456914-A
CountryUS
Kind codeB2
Filing dateAug 11, 2014
Priority dateNov 30, 2009
Publication dateNov 1, 2016
Grant dateNov 1, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A hardware and/or software facility to enable emulated storage devices to share data stored on physical storage resources of a storage system. The facility may be implemented on a virtual tape library (VTL) system configured to back up data sets that have a high level of redundancy on multiple virtual tapes. The facility organizes all or a portion of the physical storage resources according to a common store data layout. By enabling emulated storage devices to share data stored on physical storage resources, the facility enables deduplication across the emulated storage devices irrespective of the emulated storage device to which the data is or was originally written, thereby eliminating duplicate data on the physical storage resources and improving the storage consumption of the emulated storage devices on the physical storage resources.

First claim

Opening claim text (preview).

We claim: 1. A physical storage device comprising: a first data set written to a first emulated storage device, wherein the first data set includes a first section of data; and a second data set written to a second emulated storage device, wherein the second data set includes the first section of data, and wherein the first section of data is written to a storage location on the physical storage device that is shared by the first and second emulated storage devices, wherein the physical storage device is organized according to a common store data layout, and wherein the storage location corresponds to a portion of a common store extent, said common store extend including a plurality of common store units. 2. The physical storage device of claim 1 wherein the common store extent is referenced by an offset in a common store unit, an uncompressed length of the common store extent, and a number of common store units included in the common store extent. 3. The physical storage device of claim 1 wherein each of the plurality of common store units includes a fixed number of contiguous data blocks. 4. The physical storage device of claim 1 wherein each of the plurality of common store units includes a variable number of contiguous data blocks. 5. The physical storage device of claim 1 wherein each of the plurality of common store units includes a header, one or more record compression units of compressed data, and boundary information mapping the beginning of each record compression unit to an offset in compressed storage of the physical storage device. 6. The physical storage device of claim 1 wherein the emulated storage devices are virtual tapes of a virtual tape library. 7. The physical storage device of claim 1 wherein a deduplication technique is applied to the physical storage device to eliminate duplicate data stored thereon irrespective of the emulated storage device to which the duplicate data was originally written. 8. A physical storage device comprising: a first data set written to a first emulated storage device, wherein the first data set includes a first section of data; and a second data set written to a second emulated storage device, wherein the second data set includes the first section of data, and wherein the first section of data is written to a storage location on the physical storage device that is shared by the first and second emulated storage devices, wherein the physical storage device is organized according to a common store data layout, and wherein the storage location corresponds to a portion of a common store extent that is referenced in whole by the first emulated storage device, and referenced in part by the second emulated storage device. 9. The physical storage device of claim 8 , wherein the emulated storage devices are virtual tapes of a virtual tape library. 10. The physical storage device of claim 8 , wherein a deduplication technique is applied to the physical storage device to eliminate duplicate data stored thereon irrespective of the emulated storage device to which the duplicate data was originally written. 11. One or more computer-readable memories storing computer-executable instructions for implementing a method for deduplication at a storage server, comprising: instructions for storing a first data set at a first emulated storage device, wherein the first data set includes a first section of data; and instructions for storing a second data set at a second emulated storage device, wherein the second data set includes the first section of data, and wherein the first section of data is stored at a storage location on the physical storage device that is shared by the first and second emulated storage devices, wherein the physical storage device is organized according to a common store data layout, and wherein the storage location corresponds to a portion of a common store extent, the common store extent including a plurality of common store units. 12. The one or more computer-readable memories of claim 11 , wherein each of the plurality of common store units includes a header, one or more record compression units of compressed data, and boundary information mapping the beginning of each record compression unit to an offset in compressed storage of the physical storage device. 13. The one or more computer-readable memories of claim 11 , wherein the common store extent is referenced by an offset in a common store unit, an uncompressed length of the common store extent, and a number of common store units included in the common store extent. 14. The one or more computer-readable memories of claim 11 , further comprising: instructions for applying a deduplication technique to the physical storage device to eliminate duplicate data stored thereon irrespective of the emulated storage device to which the duplicate data was originally written. 15. One or more computer-readable memories storing computer-executable instructions for implementing a method for deduplication at a storage server, comprising: instructions for storing a first data set at a first emulated storage device, wherein the first data set includes a first section of data; and instructions for storing a second data set at a second emulated storage device, wherein the second data set includes the first section of data, and wherein the first section of data is stored at a storage location on the physical storage device that is shared by the first and second emulated storage devices, wherein the physical storage device is organized according to a common store data layout, and wherein the storage location corresponds to a portion of a common store extent that is referenced in whole by the first emulated storage device, and referenced in part by the second emulated storage device.

Assignees

Inventors

Classifications

  • G06F3/0641Primary

    De-duplication techniques · CPC title

  • Physics · mapped topic

  • in relation to data integrity, e.g. data losses, bit errors · CPC title

  • Physics · mapped topic

  • Libraries, e.g. tape libraries, jukebox · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9483487B2 cover?
A hardware and/or software facility to enable emulated storage devices to share data stored on physical storage resources of a storage system. The facility may be implemented on a virtual tape library (VTL) system configured to back up data sets that have a high level of redundancy on multiple virtual tapes. The facility organizes all or a portion of the physical storage resources according to …
Who is the assignee on this patent?
Netapp Inc
What technology area does this patent fall under?
Primary CPC classification G06F3/0641. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Nov 01 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).