Managing data sets of a storage system
US-9086811-B2 · Jul 21, 2015 · US
US9690508B1 · US · B1
| Field | Value |
|---|---|
| Publication number | US-9690508-B1 |
| Application number | US-201615277126-A |
| Country | US |
| Kind code | B1 |
| Filing date | Sep 27, 2016 |
| Priority date | Sep 27, 2016 |
| Publication date | Jun 27, 2017 |
| Grant date | Jun 27, 2017 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A method for anonymizing a data set dump includes detecting an error in an original data set and generating a copy of the original data set. Like the original data set, the copy contains an index and a plurality of members. The method reads the index to locate members within the copy that are reachable by the index. The method then converts the copy to a scrubbed copy by overwriting customer data within the members, while retaining the index, structure of the members, and quantity of data within the data set. In certain embodiments, the method further locates lost members within the copy that are not referenced by the index, and overwrites customer data within the lost members. The scrubbed copy may then be transmitted to a technician for examination since all potentially sensitive/confidential data has been removed. A corresponding system and computer program product are also disclosed.
Opening claim text (preview).
The invention claimed is: 1. A method for anonymizing a data set dump, the method comprising: detecting an error in an original data set; generating a copy of the original data set, the copy comprising an index and a plurality of members; reading the index to locate members within the copy that are reachable by the index; and converting the copy to a scrubbed copy by overwriting customer data within the members, while retaining the index, structure of the members, and quantity of data within the data set. 2. The method of claim 1 , further comprising transmitting the scrubbed copy to a technician for examination. 3. The method of claim 1 , further comprising finding lost members within the copy that are not referenced by the index. 4. The method of claim 3 , wherein converting the copy to the scrubbed copy comprises overwriting customer data within the lost members. 5. The method of claim 1 , wherein the original data set is a Partitioned Data Set Extended (PDSE) data set. 6. The method of claim 1 , wherein overwriting the customer data comprises overwriting the customer data with random data. 7. The method of claim 1 , wherein overwriting the customer data comprises, for each member, determining a relative page number of a beginning of a member's linear space, and overwriting a linked list of pages from the beginning of the member's linear space. 8. A computer program product for anonymizing a data set dump, the computer program product comprising a computer-readable storage medium having computer-usable program code embodied therein, the computer-usable program code configured to perform the following when executed by at least one processor: detect an error in an original data set; generate a copy of the original data set, the copy comprising an index and a plurality of members; read the index to locate members within the copy that are reachable by the index; and convert the copy to a scrubbed copy by overwriting customer data within the members, while retaining the index, structure of the members, and quantity of data within the data set. 9. The computer program product of claim 8 , wherein the computer-usable program code is further configured to transmit the scrubbed copy to a technician for examination. 10. The computer program product of claim 8 , wherein the computer-usable program code is further configured to find lost members within the copy that are not referenced by the index. 11. The computer program product of claim 10 , wherein converting the copy to the scrubbed copy comprises overwriting customer data within the lost members. 12. The computer program product of claim 8 , wherein the original data set is a Partitioned Data Set Extended (PDSE) data set. 13. The computer program product of claim 8 , wherein overwriting the customer data comprises overwriting the customer data with random data. 14. The computer program product of claim 8 , wherein overwriting the customer data comprises, for each member, determining a relative page number of a beginning of a member's linear space, and overwriting a linked list of pages from the beginning of the member's linear space. 15. A system for anonymizing a data set dump, the system comprising: at least one processor; at least one memory device operably coupled to the at least one processor and storing instructions for execution on the at least one processor, the instructions causing the at least one processor to: detect an error in an original data set; generate a copy of the original data set, the copy comprising an index and a plurality of members; read the index to locate members within the copy that are reachable by the index; and convert the copy to a scrubbed copy by overwriting customer data within the members, while retaining the index, structure of the members, and quantity of data within the data set. 16. The system of claim 15 , wherein the instructions further cause the at least one processor to transmit the scrubbed copy to a technician for examination. 17. The system of claim 15 , wherein the instructions further cause the at least one processor to find lost members within the copy that are not referenced by the index. 18. The system of claim 17 , wherein converting the copy to the scrubbed copy comprises overwriting customer data within the lost members. 19. The system of claim 15 , wherein the original data set is a Partitioned Data Set Extended (PDSE) data set. 20. The system of claim 15 , wherein overwriting the customer data comprises, for each member, determining a relative page number of a beginning of a member's linear space, and overwriting a linked list of pages from the beginning of the member's linear space.
Dumping, i.e. gathering error/state information after a fault for later diagnosis · CPC title
Distributed or networked storage systems, e.g. storage area networks [SAN], network attached storage [NAS] · CPC title
in relation to data integrity, e.g. data losses, bit errors · CPC title
in a storage system, e.g. in a DASD or network based storage system (drivers for digital recording or reproducing units G06F3/06; circuits for error detection or correction within digital recording or reproducing units G11B20/18; for distributed storage of data in networks, e.g. transport arrangements for network file system [NFS], storage area networks [SAN] or network attached storage [NAS], H04L67/1097) · CPC title
Replication mechanisms · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.