Hierarchical data compression and computation
US-2016197622-A1 · Jul 7, 2016 · US
US9836473B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-9836473-B2 |
| Application number | US-201414505730-A |
| Country | US |
| Kind code | B2 |
| Filing date | Oct 3, 2014 |
| Priority date | Oct 3, 2014 |
| Publication date | Dec 5, 2017 |
| Grant date | Dec 5, 2017 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
According to embodiments of the present invention, machines, systems, methods and computer program products for hardware acceleration are presented. A plurality of computational nodes for processing data is provided, each node performing a corresponding operation for data received at that node. A metric module is used to determine a compression benefit metric pertaining to performance of the corresponding operations of one or more computational nodes with recompressed data. An accelerator module recompresses data for processing by the one or more computational nodes based on the compression benefit metric indicating a benefit gained by using the recompressed data. A distribution function may be used to distribute data among a plurality of nodes.
Opening claim text (preview).
What is claimed is: 1. A data processing system comprising: a first processor implementing a plurality of computational nodes, each node performing a corresponding operation in a data flow for data received at that node; and a second special-purpose processor, wherein the first processor and second special-purpose processor are configured to: determine, via the first processor, a compression benefit metric pertaining to performance of the corresponding operations of one or more of said plurality of computational nodes in the data flow with recompressed data, wherein the first processor is configured to determine a priority for recompressing data based on the compression benefit metric; recompress the data according to the priority, via the second special-purpose processor, for processing by the one or more computational nodes in the data flow based on the compression benefit metric indicating a benefit in processing performance gained by using the recompressed data for computational operations; and perform the corresponding operation using the recompressed data, via the first processor, to provide the benefit in processing performance. 2. The data processing system of claim 1 , wherein the compression benefit metric is determined using a measure of data compression preserved by one or more destination computational nodes. 3. The data processing system of claim 1 , wherein the compression benefit metric is determined using a measure of an estimated reduction in a size of the data resulting from recompression. 4. The data processing system of claim 1 , wherein the compression benefit metric is determined using a measure of an estimated computational benefit from a destination node performing an operation on recompressed data. 5. The data processing system of claim 1 , wherein the compression benefit metric is determined using a measure of a computational benefit of a destination node performing an operation on data in a particular compressed form. 6. The data processing system of claim 1 , wherein one or more of said plurality of computational nodes process data without recompression in response to data awaiting recompression by the second special-purpose processor and being unavailable for processing. 7. The data processing system of claim 1 , wherein the second special-purpose processor is configured to: decompress compressed data; and compress the decompressed data and produce recompressed data. 8. The data processing system of claim 1 , wherein the second special-purpose processor includes a Field Programmable Gate Array (FPGA) or an Application Specific Integrated Circuit (ASIC) to recompress the data. 9. The data processing system of claim 1 , further including a plurality of processors interconnected by a network to process data in parallel, wherein each processor is configured to: apply a distribution function to distribute data among the plurality of processors; compress data prior to transmission to the plurality of processors; and compress data received from the plurality of processors. 10. A computer program product for processing data using a plurality of computational nodes, each node performing a corresponding operation in a data flow for data received at that node and implemented by a first processor and utilizing a second special-purpose processor, wherein the computer program product comprises a computer readable storage medium having computer readable program code embodied therewith, the computer readable program code, when executed by the first processor and second special-purpose processor, causing the first processor and second special-purpose processor to: determine, via the first processor, a compression benefit metric pertaining to performance of the corresponding operations of one or more of said plurality of computational nodes in the data flow with recompressed data, wherein determining a compression benefit metric further determines a priority for recompressing data based on the compression benefit metric; recompress the data according to the priority, via the second special-purpose processor, for processing by the one or more computational nodes in the data flow based on the compression benefit metric indicating a benefit in processing performance gained by using the recompressed data for computational operations; and perform the corresponding operation using the recompressed data, via the first processor, to provide the benefit in processing performance. 11. The computer program product of claim 10 , wherein the computer readable program code is configured to cause the first processor to: determine the compression benefit metric using one or more of the following: (a) a measure of data compression preserved by one or more destination computational nodes; (b) a measure of an estimated reduction in a size of the data resulting from recompression; (c) a measure of an estimated computational benefit from a destination node performing an operation on recompressed data; and (d) a measure of a computational benefit of a destination node performing an operation on data in a particular compressed form. 12. The computer program product of claim 10 , wherein the computer readable program code is configured to cause the first processor and second special-purpose processor to: apply a distribution function to distribute data among a plurality of processors interconnected by a network to process data in parallel; compress data prior to transmission to the plurality of processors; and compress data received from the plurality of processors. 13. The computer program product of claim 10 , wherein the computer readable program code is configured to cause one or more of said plurality of computational nodes to process data without recompression in response to data awaiting recompression by the second special-purpose processor and being unavailable for processing. 14. The computer program product of claim 10 , wherein the computer readable program code is configured to cause the second special-purpose processor to: decompress compressed data; and compress the decompressed data and produce recompressed data. 15. The computer program product of claim 10 , wherein the second special-purpose processor includes a Field Programmable Gate Array (FPGA) or an Application Specific Integrated Circuit (ASIC) to recompress the data.
Physics · mapped topic
Physics · mapped topic
Physics · mapped topic
Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors · CPC title
Ensuring data consistency and integrity · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.