Hierarchical data skipping using data sketches

US11954085B1 · US · B1

Patent metadata
FieldValue
Publication numberUS-11954085-B1
Application numberUS-202217934440-A
CountryUS
Kind codeB1
Filing dateSep 22, 2022
Priority dateSep 22, 2022
Publication dateApr 9, 2024
Grant dateApr 9, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A computer implemented method performs data skipping in a hierarchically organized computing system. A group of processor units determines leaf node data sketches for data in leaf nodes in the hierarchically organized computing system. The leaf node data sketches summarize attributes of data in the leaf nodes. The group of processor units aggregates the leaf node data sketches at intermediate nodes in the hierarchically organized computing system to form aggregated data sketches at the intermediate nodes and retains data sketches received at the intermediate nodes from a group of child nodes to form retained data sketches. The retained data sketches are one of leaf node data sketches and the aggregated data sketches. The group of processor units searches the data using the retained data sketches and the data skipping within the hierarchically organized computing system in response to queries made to the intermediate nodes in the hierarchically organized computing system.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer implemented method for data skipping in a hierarchically organized computing system, the computer implemented method comprising: determining, by a group of processor units, leaf node data sketches for data in leaf nodes in the hierarchically organized computing system, wherein the leaf node data sketches summarize attributes of the data in the leaf nodes; aggregating, by the group of processor units, the leaf node data sketches at intermediate nodes in the hierarchically organized computing system to form aggregated data sketches at the intermediate nodes; retaining, by the group of processor units, data sketches received at the intermediate nodes from a group of child nodes to form retained data sketches, wherein the retained data sketches are one of leaf node data sketches and the aggregated data sketches; searching, by the group of processor units, the data using the retained data sketches and the data skipping within the hierarchically organized computing system in response to queries made to the intermediate nodes in the hierarchically organized computing system; wherein the leaf nodes in the hierarchically organized computing system are client devices and the intermediate nodes in the hierarchically organized computing system are server computers, and further comprising: updating, by the group of processor units, the leaf node data sketches from the data in the leaf nodes in response to an event to form updated leaf node data sketches; and aggregating, by the group of processor units, the updated leaf node data sketches at the intermediate nodes in the hierarchically organized computing system. 2. The computer implemented method of claim 1 , wherein the event is one of a periodic event and a non-periodic event. 3. The computer implemented method of claim 1 , wherein updating the leaf node data sketches from the data in the leaf nodes comprises: determining, by the group of processor units, the updated leaf node data sketches from the data in changed leaf nodes, which are a subset of the leaf nodes; replacing prior corresponding leaf node data sketches with the updated leaf node data sketches; and aggregating, by the group of processor units, the updated leaf node data sketches at the intermediate nodes in the hierarchically organized computing system to form updated aggregated data sketches in response to replacing the prior corresponding leaf node data sketches with the updated leaf node data sketches. 4. The computer implemented method of claim 1 , wherein an intermediate node in the intermediate nodes is a data generator and the method further comprising: generating an intermediate data sketch for the data in the intermediate node; and aggregating child data sketches received from child nodes of the intermediate node and the intermediate data sketch generated by the intermediate node to form an aggregated data sketch. 5. The computer implemented method of claim 1 further comprising: receiving, by the group of processor units, a query at an intermediate node in the intermediate nodes, comparing, by the group of processor units, the query to the group of retained data sketches at the intermediate node; sending, by the group of processor units, the query to a set of child nodes in the group of child nodes based on a match of the query to a set of retained data sketches for the set of child nodes, wherein a child node in the group of child nodes is skipped when a retained data sketch received from the child node does not have a match to an associated retained data sketch at the intermediate node; aggregating, by the group of processor units, a group of results received in response to the query to form an aggregated result; and returning, by the group of processor units, the aggregated result to a requestor. 6. The computer implemented method of claim 5 further comprising: receiving, at a leaf node in the set of child nodes, the query sent from the intermediate node through a hierarchy in the hierarchically organized computing system; running the query on the data in the leaf node; and returning a result to the intermediate node through the hierarchy in the hierarchically organized computing system. 7. A computer system comprising: a group of processor units, wherein the group of processor units executes program instructions to: determine leaf node data sketches for data in leaf nodes in a hierarchically organized computing system, wherein the leaf node data sketches summarize attributes of the data in the leaf nodes; aggregate the leaf node data sketches at intermediate nodes in the hierarchically organized computing system to form aggregated data sketches at the intermediate nodes; retain data sketches received at the intermediate nodes from a group of child nodes to form retained data sketches, wherein the retained data sketches are one of leaf node data sketches and the aggregated data sketches; search the data using the retained data sketches and the data skipping within the hierarchically organized computing system in response to queries made to the intermediate nodes in the hierarchically organized computing system; wherein the leaf nodes in the hierarchically organized computing system are client devices and the intermediate nodes in the hierarchically organized computing system are server computers, and wherein the group of processor units executes program instructions to: update the leaf node data sketches from the data in the leaf nodes in response to an event to form updated leaf node data sketches; and aggregate the updated leaf node data sketches at the intermediate nodes in the hierarchically organized computing system. 8. The computer system of claim 7 , wherein the event is one of a periodic even and a non-periodic event. 9. The computer system of claim 7 , wherein as part of updating the leaf node data sketches from the data in the leaf nodes, the group of processor units executes program instructions to: determine the updated leaf node data sketches from the data in changed leaf nodes, which are a subset of the leaf nodes; replacing prior corresponding leaf node data sketches with the updated leaf node data sketches; and aggregate the updated leaf node data sketches at the intermediate nodes in the hierarchically organized computing system to form updated aggregated data sketches in response to replacing the prior corresponding leaf node data sketches with the updated leaf node data sketches. 10. The computer system of claim 7 , wherein an intermediate node in the intermediate nodes is a data generator and the group of processor units executes program instructions to: generate an intermediate data sketch for the data in the intermediate node; and aggregate child data sketches received from child nodes of the intermediate node and the intermediate data sketch generated by the intermediate node to form an aggregate data sketch. 11. The computer system of claim 7 , wherein the group of processor units executes program instructions to: receive a query at an intermediate node in the intermediate nodes, compare the query to the group of retained data sketches at the intermediate node; send the query to a set of child nodes in the group of child nodes based on a match of the query to a set of retained data sketches for the set of child nodes, wherein a child node in the group of child nodes is skipped when a retained data sketch received from the child node does not have a match to an associated retained data sketch at the intermediate node; aggregate a group of results received in response to the query to form an aggregated result; and returning, by the group of processor units, the aggregated resu

Assignees

Inventors

Classifications

  • Trees, e.g. B+trees · CPC title

  • Aggregation; Duplicate elimination · CPC title

  • Hierarchical databases, e.g. IMS, LDAP data stores or Lotus Notes · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11954085B1 cover?
A computer implemented method performs data skipping in a hierarchically organized computing system. A group of processor units determines leaf node data sketches for data in leaf nodes in the hierarchically organized computing system. The leaf node data sketches summarize attributes of data in the leaf nodes. The group of processor units aggregates the leaf node data sketches at intermediate n…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06F16/2246. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Apr 09 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 11 related publications on this page (citations in our corpus or others sharing the same primary CPC).