PDSE physical dump anonymizer

US9690508B1 · US · B1

Patent metadata
FieldValue
Publication numberUS-9690508-B1
Application numberUS-201615277126-A
CountryUS
Kind codeB1
Filing dateSep 27, 2016
Priority dateSep 27, 2016
Publication dateJun 27, 2017
Grant dateJun 27, 2017

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method for anonymizing a data set dump includes detecting an error in an original data set and generating a copy of the original data set. Like the original data set, the copy contains an index and a plurality of members. The method reads the index to locate members within the copy that are reachable by the index. The method then converts the copy to a scrubbed copy by overwriting customer data within the members, while retaining the index, structure of the members, and quantity of data within the data set. In certain embodiments, the method further locates lost members within the copy that are not referenced by the index, and overwrites customer data within the lost members. The scrubbed copy may then be transmitted to a technician for examination since all potentially sensitive/confidential data has been removed. A corresponding system and computer program product are also disclosed.

First claim

Opening claim text (preview).

The invention claimed is: 1. A method for anonymizing a data set dump, the method comprising: detecting an error in an original data set; generating a copy of the original data set, the copy comprising an index and a plurality of members; reading the index to locate members within the copy that are reachable by the index; and converting the copy to a scrubbed copy by overwriting customer data within the members, while retaining the index, structure of the members, and quantity of data within the data set. 2. The method of claim 1 , further comprising transmitting the scrubbed copy to a technician for examination. 3. The method of claim 1 , further comprising finding lost members within the copy that are not referenced by the index. 4. The method of claim 3 , wherein converting the copy to the scrubbed copy comprises overwriting customer data within the lost members. 5. The method of claim 1 , wherein the original data set is a Partitioned Data Set Extended (PDSE) data set. 6. The method of claim 1 , wherein overwriting the customer data comprises overwriting the customer data with random data. 7. The method of claim 1 , wherein overwriting the customer data comprises, for each member, determining a relative page number of a beginning of a member's linear space, and overwriting a linked list of pages from the beginning of the member's linear space. 8. A computer program product for anonymizing a data set dump, the computer program product comprising a computer-readable storage medium having computer-usable program code embodied therein, the computer-usable program code configured to perform the following when executed by at least one processor: detect an error in an original data set; generate a copy of the original data set, the copy comprising an index and a plurality of members; read the index to locate members within the copy that are reachable by the index; and convert the copy to a scrubbed copy by overwriting customer data within the members, while retaining the index, structure of the members, and quantity of data within the data set. 9. The computer program product of claim 8 , wherein the computer-usable program code is further configured to transmit the scrubbed copy to a technician for examination. 10. The computer program product of claim 8 , wherein the computer-usable program code is further configured to find lost members within the copy that are not referenced by the index. 11. The computer program product of claim 10 , wherein converting the copy to the scrubbed copy comprises overwriting customer data within the lost members. 12. The computer program product of claim 8 , wherein the original data set is a Partitioned Data Set Extended (PDSE) data set. 13. The computer program product of claim 8 , wherein overwriting the customer data comprises overwriting the customer data with random data. 14. The computer program product of claim 8 , wherein overwriting the customer data comprises, for each member, determining a relative page number of a beginning of a member's linear space, and overwriting a linked list of pages from the beginning of the member's linear space. 15. A system for anonymizing a data set dump, the system comprising: at least one processor; at least one memory device operably coupled to the at least one processor and storing instructions for execution on the at least one processor, the instructions causing the at least one processor to: detect an error in an original data set; generate a copy of the original data set, the copy comprising an index and a plurality of members; read the index to locate members within the copy that are reachable by the index; and convert the copy to a scrubbed copy by overwriting customer data within the members, while retaining the index, structure of the members, and quantity of data within the data set. 16. The system of claim 15 , wherein the instructions further cause the at least one processor to transmit the scrubbed copy to a technician for examination. 17. The system of claim 15 , wherein the instructions further cause the at least one processor to find lost members within the copy that are not referenced by the index. 18. The system of claim 17 , wherein converting the copy to the scrubbed copy comprises overwriting customer data within the lost members. 19. The system of claim 15 , wherein the original data set is a Partitioned Data Set Extended (PDSE) data set. 20. The system of claim 15 , wherein overwriting the customer data comprises, for each member, determining a relative page number of a beginning of a member's linear space, and overwriting a linked list of pages from the beginning of the member's linear space.

Assignees

Inventors

Classifications

  • Dumping, i.e. gathering error/state information after a fault for later diagnosis · CPC title

  • Distributed or networked storage systems, e.g. storage area networks [SAN], network attached storage [NAS] · CPC title

  • G06F3/0619Primary

    in relation to data integrity, e.g. data losses, bit errors · CPC title

  • in a storage system, e.g. in a DASD or network based storage system (drivers for digital recording or reproducing units G06F3/06; circuits for error detection or correction within digital recording or reproducing units G11B20/18; for distributed storage of data in networks, e.g. transport arrangements for network file system [NFS], storage area networks [SAN] or network attached storage [NAS], H04L67/1097) · CPC title

  • Replication mechanisms · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9690508B1 cover?
A method for anonymizing a data set dump includes detecting an error in an original data set and generating a copy of the original data set. Like the original data set, the copy contains an index and a plurality of members. The method reads the index to locate members within the copy that are reachable by the index. The method then converts the copy to a scrubbed copy by overwriting customer da…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06F3/0619. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jun 27 2017 00:00:00 GMT+0000 (Coordinated Universal Time) (B1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).