Dynamic column synopsis for analytical databases

US10678784B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-10678784-B2
Application numberUS-201815861746-A
CountryUS
Kind codeB2
Filing dateJan 4, 2018
Priority dateMay 18, 2016
Publication dateJun 9, 2020
Grant dateJun 9, 2020

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method, computer program product, and system for processing attribute value information for a data set. The method, computer program product, and system includes receiving a first data query on the data set. The first data query includes a condition on at least one attribute of the data set. While processing the first data query, data blocks containing records of the data set may be scanned. The data blocks contain first data blocks that are full. Attribute value information may be generated for the at least one attribute for the first data blocks. The attribute value information may be stored and a second data query involving a condition on at least one of the at least one attribute may be processed using the stored attribute value information.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer program product for processing attribute value information for a data set, the computer program product comprising: one or more computer-readable storage medium and program instructions store on the one or more computer-readable storage media, the program instructions comprising: program instruction to receive a first data query on the data set, the first data query comprising a condition on an attribute of the data set; in response to processing the first data query, program instruction to scan data blocks containing records of the data set, the data blocks containing one or more data blocks that are full; program instruction to generate attribute value information for the attribute for each of the one or more data blocks; and program instruction to store the attribute value information for each of the one or more data blocks; program instructions to assign a status parameter to each data block of the one or more data blocks, the status parameter having a first value, a second value or a third value, the first value indicating a first state of the data block which identifies the associated data block has an associated attribute value information, the second value indicating a second state of the data block which identifies the associated data block does not have an associated attribute value information, the third value indicating a third state of the data block which identifies the associated data block does not have an associated attribute value information and the associated attribute value information is not to be generated; processing the first data query on the data set using the stored attribute value information; and program instruction to process a second data query on the data set using the stored attribute value information, the second data query comprising a second condition on the attribute. 2. The computer program product of claim 1 , wherein program instruction to process the second data query comprises: program instruction to regenerate the attribute value information of the first data block in response to determining that a first data block of the one or more data blocks is modified. 3. The computer program product of claim 1 , wherein program instruction to process the second data query further comprises: program instruction to compare the value of the attribute of a record to be inserted into a second data block of the one or more data blocks with the attribute value information of the second data block, and program instructions to assign a status parameter of the second data block to the second value. 4. The computer program product of claim 1 , wherein program instruction to process the second data query further comprises: program instruction to delete attribute value information of a fourth data block of the one or more data blocks and program instructions to assign a status parameter of the fourth data block to the second value. 5. The computer program product of claim 1 , wherein program instruction to process the second data query further comprises: program instruction to generate attribute value information of a fifth data block of the one or more data blocks. 6. The computer program product of claim 1 , wherein processing the second data query further comprises: program instruction to write a record into a sixth data block of the one or more data blocks; and program instruction to set a status parameter of the sixth data block to the second value. 7. The me computer program product of claim 1 , further comprising: program instruction to provide a first storage device and a second storage device, the first storage device having higher data access performance than the second storage device; program instruction to determine an access frequency to an eighth data block of the one or more data blocks; and program instruction to store attribute value information of the eighth data block in the first storage device based on the determined access frequency of the eighth data block. 8. The computer program product of claim 1 , further comprising: program instruction to determine an access frequency to a ninth data block of the one or more data blocks; and program instruction to delete attribute value information of the ninth data block dependent upon the determined access frequency being smaller than a predetermined frequency threshold. 9. The computer program product of claim 8 , further comprising: program instruction to assign a counter to the ninth data block; program instruction to increment the counter upon receiving a data query for accessing the ninth data block; and program instruction to regenerate the attribute value information for the ninth data block when the counter exceeds a minimum number of accesses. 10. The computer program product of claim 1 , the attribute value information comprising: a minimum attribute value information associated with each data block of the one or more data blocks; and a maximum attribute value information associated with each data block of the set of data blocks. 11. The computer program product of claim 1 , wherein the second data query comprises a read request. 12. The computer program product of claim 1 , further comprising: the status parameter having a fourth value indicating a fourth state of the data block which identifies the associated data block requires an updating of the associated attribute value information. 13. The computer program product of claim 1 , wherein program instruction to process the first data query further comprises: program instruction to lock the attribute value information of each block of the one or more data blocks; and program instruction to generate attribute value information comprises reading a lock status of each block of the one or more data blocks. 14. The computer program product of claim 13 , wherein: program instruction to process the second data query occurs prior to a completion of program instruction to process the first data query; and program instruction to prevent access to a lock status of each data block of the one or more data blocks. 15. The computer program product of claim 13 , further comprising: program instruction to unlock the attribute value information of each block of the one or more data blocks upon completion of program instruction to process of the first data query. 16. A computer program product for processing attribute value information for a data set, the computer program product comprising: one or more computer-readable storage medium and program instructions store on the one or more computer-readable storage media, the program instructions comprising: receiving a first data query on the data set, the first data query comprising a condition on an attribute of the data set; in response to program instructions to process the first data query, program instructions to scan a status parameter of each data block of the one or more data blocks data blocks of the data set; in response to a status parameter having a first value, program instructions to process the first data query using attribute value information of the associated data block; in response to a status parameter having a second value, program instructions to scan data blocks of the associated data block, program instructions to generate attribute value information, program instructions to store the attribute value information, program instructions to assign a first value to the status parameter, and program instructions to process the first data query using attribute value information of the associated data block; and in

Assignees

Inventors

Classifications

  • Iterative querying; Query formulation based on the results of a preceding query · CPC title

  • Column-oriented storage; Management thereof · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US10678784B2 cover?
A method, computer program product, and system for processing attribute value information for a data set. The method, computer program product, and system includes receiving a first data query on the data set. The first data query includes a condition on at least one attribute of the data set. While processing the first data query, data blocks containing records of the data set may be scanned. …
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06F16/2425. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jun 09 2020 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 6 related publications on this page (citations in our corpus or others sharing the same primary CPC).