Hierarchical data compression and computation
US-2016204798-A1 · Jul 14, 2016 · US
US9514148B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-9514148-B2 |
| Application number | US-201615077239-A |
| Country | US |
| Kind code | B2 |
| Filing date | Mar 22, 2016 |
| Priority date | Sep 30, 2014 |
| Publication date | Dec 6, 2016 |
| Grant date | Dec 6, 2016 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
According to embodiments of the present invention, machines, systems, methods and computer program products for hierarchical compression of data are presented comprising creating a compression hierarchy of compression nodes, wherein each compression node is associated with a compression operation to produce compressed data. An output of any of the compression nodes may be compressed by another compression node or the same compression node. A path of one or more compression nodes is determined through said compression hierarchy based upon compression statistics to compress data, and the data is compressed by the compression nodes of the path. Various computational techniques are presented herein for manipulating the compression hierarchy to defer or reduce computation during query evaluation.
Opening claim text (preview).
What is claimed is: 1. A method of hierarchical compression comprising: creating a compression hierarchy of compression nodes, wherein each compression node is associated with a compression operation to produce compressed data, and wherein creating the compression hierarchy of compression nodes comprises: creating a run length encoding compression node in the compression hierarchy; determining a path of one or more compression nodes through said compression hierarchy, based upon compression statistics, to compress data; and compressing the data by the compression nodes of the path, wherein compressing the data comprises: performing a filtering operation on the data using the run length encoding compression node, wherein the filtering operation is performed in part by applying a bitmask to a values field of the run length encoding compression node. 2. The method of claim 1 , further comprising selecting a path of the compression hierarchy having a measure of compression higher than any of the other paths. 3. The method of claim 1 , wherein the compression nodes are configured to compress one or more data types selected from a group consisting of: integer, character and double. 4. The method of claim 1 , wherein each compression node is selected from a group consisting of: a bitshaved compression node, a dictionary compression node, a run length encoding compression node, a character-based compression node and a delta compression node. 5. The method of claim 1 , wherein compressing the data further comprises: performing part of a join operation on the data using the run length encoding compression node, wherein the join operation is performed by applying input data to a values field of the run length encoding compression node and by applying repeat counts to a lengths field of the run length encoding compression node.
Conversion to or from run-length codes, i.e. by representing the number of consecutive digits, or groups of digits, of the same kind by a code word and a digit indicative of that kind · CPC title
Physics · mapped topic
Physics · mapped topic
Physics · mapped topic
Encoder aspects · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.