System and method for pick-and-drop sampling
US-2015379066-A1 · Dec 31, 2015 · US
US10169394B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-10169394-B2 |
| Application number | US-201414297128-A |
| Country | US |
| Kind code | B2 |
| Filing date | Jun 5, 2014 |
| Priority date | Jun 5, 2014 |
| Publication date | Jan 1, 2019 |
| Grant date | Jan 1, 2019 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A method, system, and computer program product for managing data sets of a storage facility is disclosed. The method, system, and computer program product include determining, by analyzing a first data set, that the first data set includes a first record having padded data. To identify the padded data, the method, system, and computer program product include comparing at least a portion of the first record of the first data set with a second record of a second data set. Next, the method, system, and computer program product include removing, from the first record of the first data set, the padded data.
Opening claim text (preview).
What is claimed is: 1. A computer-implemented method for managing data sets of a storage system by repairing a data set that includes padded data such that the data set can be read by an application that cannot read a data set including padded data, the method comprising: determining, by analyzing a first data set, that the first data set includes a first record having padded data, wherein padded data represents data added to a record to make a variable length record a fixed length record; comparing, to identify the padded data, at least a portion of the first record of the first data set with a second record of a second data set; and removing, from the first record of the first data set, the padded data identified in response to comparing at least the portion of the first record of the first data set with the second record of the second data set wherein the removing includes: deleting a segment of the first record matching a mask derived from a character pattern, and updating a record length for the first record; loading the first record, without padded data, into a temporary file with the first data set; storing an original file, with padded data, as a retained file with the first data set; storing the temporary file, without padded data, with a name of the original file; and accessing the temporary file by the application, wherein the application could not read the first record with padded data. 2. The method of claim 1 , wherein determining, by analyzing the first data set, that the first data set includes the first record having padded data includes: determining the first record is a fixed length record. 3. The method of claim 2 , further comprising: determining the first record is expected to be a variable length record. 4. The method of claim 1 , wherein determining, by analyzing the first data set, that the first data set includes the first record having padded data includes: determining the first record has been converted to a fixed length record from a variable length record. 5. The method of claim 1 , wherein determining, by analyzing the first data set, that the first data set includes the first record having padded data includes: determining the first data set is without a backup data set; and scanning at least the portion of the first record to resolve a character pattern. 6. The method of claim 5 , wherein comparing, to identify the padded data, at least the portion of the first record of the first data set with the second record of the second data set includes: comparing the character pattern of the first record of the first data set with the second record of the second data set, wherein the second data set is the first data set and the first record is different from the second record. 7. The method of claim 6 , wherein: scanning at least the portion of the first record for the character pattern includes scanning from a back end of the first record toward a front end of the first record until the character pattern stops; and comparing, to identify the padded data, at least the portion of the first record of the first data set with the second record of the second data set includes storing the character pattern and determining a mask derived from the character pattern matches at least a segment of a subsequent record of the first data set. 8. The method of claim 1 , wherein determining, by analyzing the first data set, that the first data set includes the first record having padded data includes: determining the second data set backs-up the first data set; and determining that both the first data set and the second data set include a type of record that is keyed. 9. The method of claim 8 , wherein comparing, to identify the padded data, at least the portion of the first record of the first data set with the second record of the second data set includes: searching, using a key from the second data set, the first data set for the key; and determining the key in the second record matches a like key in the first record. 10. The method of claim 9 , wherein comparing, to identify the padded data, at least the portion of the first record of the first data set with the second record of the second data set includes: scanning from a back end of the first record toward a front end of the first record to resolve a character pattern configured to identify the padded data as a segment which mismatches the second record; and storing a mask derived from the character pattern to identify the padded data. 11. The method of claim 1 , wherein determining, by analyzing the first data set, that the first data set includes the first record having padded data includes: determining the second data set backs-up the first data set; determining that both the first data set and the second data set include a type of record that is non-keyed; and scanning at least the portion of the first record to resolve a segment other than a character pattern. 12. The method of claim 11 , wherein comparing, to identify the padded data, at least the portion of the first record of the first data set with the second record of the second data set includes: searching, using the segment from the first record of the first data set, the second data set for the segment; and determining the segment in the first record matches a like segment in the second record. 13. The method of claim 12 , wherein comparing, to identify the padded data, at least the portion of the first record of the first data set with the second record of the second data set includes: scanning at least the portion of the first record to resolve the character pattern; and determining, to identify the padded data, a mask derived from the character pattern represents a feature in which the first record mismatches the second record. 14. A system for managing data sets in a storage facility, comprising: a remote device; and a host device, at least one of the remote device and the host device including a managing module, the managing module comprising: a determining module to determine, by analyzing a first data set, that the first data set includes a first record having padded data, wherein padded data represents data added to a record to make a variable length record a fixed length record; a comparing module to compare, to identify the padded data, at least a portion of the first record of the first data set with a second record of a second data set; and a removing module to remove, from the first record of the first data set, the padded data identified in response to comparing at least the portion of the first record of the first data set with the second record of the second data set; wherein removing includes; deleting a segment of the first record matching a mask derived from a character pattern, and updating record length for the first record; loading the first record, without padded data, into a temporary file with the first data set; storing an original file, with padded data, as a retained file with the first data set; storing the temporary file, without padded data, with a name of the original file; and accessing the temporary file by the application, wherein the application could not read the first record with padded data. 15. A computer program product comprising a computer readable storage medium having a computer readable program stored therein, wherein the computer readable program, when executed on a first computing device, causes the first computing device to: determine, by analyzing a first data set, that the first data set includes a first record having padded data, wherein the padded data was added to the f
Single storage device · CPC title
Physics · mapped topic
Physics · mapped topic
Saving storage space on storage systems · CPC title
Physics · mapped topic
Related publications grouped by family.
Answers are generated from the same data shown on this page.