Aggregating and summarizing sequences of hierarchical records

US9928271B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9928271-B2
Application numberUS-201615194818-A
CountryUS
Kind codeB2
Filing dateJun 28, 2016
Priority dateJun 25, 2015
Publication dateMar 27, 2018
Grant dateMar 27, 2018

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Sequences of hierarchical records are aggregated and summarized. A capture log that includes a plurality of operations of a workload is received. A first data structure that models transaction types as sequences of nodes is created. The nodes identify operations in the workload. A present operation and a transaction identifier are read from the capture log. The transaction identifier is dissociated from a first node that identifies a prior operation. The transaction identifier is associated with a second node that identifies the present operation. In a second data structure that associates nodes with transaction identifiers, the first node is dissociated from the transaction identifier and the second node is associated with the transaction identifier. A summary of the workload is generated based, at least in part, on the first and second data structures. The summary includes signatures of transaction types and counts of instances of the transaction types.

First claim

Opening claim text (preview).

What is claimed is: 1. A method comprising: receiving, by one or more computer processors, a capture log of a workload, wherein the workload includes a sequence of operations that are associated with a plurality of transactions, and wherein the plurality of transactions are instances of transaction types; identifying, by one or more computer processors, a first data structure that models, at least in part, the transaction types as sequences of nodes, wherein each node identifies a respective operation in the sequence of operations in the workload, and wherein the first data structure is stored in one or more primary memories such that an out-of-memory condition is not triggered; reading, by one or more computer processors, a task of an uncommitted transaction from the capture log, wherein a present operation and a transaction identifier are associated with the task; dissociating, by one or more computer processors, the transaction identifier from a first node of the first data structure, wherein the first node identifies a prior operation of the uncommitted transaction; associating, by one or more computer processors, the transaction identifier with a second node of the first data structure, wherein the second node identifies the present operation of the uncommitted transaction; dissociating, by one or more computer processors, in a second data structure, the first node from the transaction identifier, wherein the first node was associated with the transaction identifier in the second data structure based, at least in part, on the prior association between the first node and the transaction identifier in the first data structure, and wherein the second data structure is stored in the one or more primary memories such that the out-of-memory condition is not triggered; associating, by one or more computer processors, in the second data structure, the second node with the transaction identifier, wherein the present operation represents a most-recently-identified operation and, from among the sequences of nodes, each transaction identifier in the second data structure is associated with only a node representing a respective most-recently-identified operation based, at least in part, on associations between each transaction identifier and respective nodes in the sequences of nodes of the first data structure; and generating, by one or more computer processors, based, at least in part, on the first and the second data structures, a summary of the workload that includes signatures of the transaction types and a count of the instances of each of the transaction types. 2. The method of claim 1 , wherein the first data structure is a hierarchical data structure that includes one or more trees and wherein creating the first data structure comprises: associating, by one or more computer processors, a first plurality of operations of the transaction types with root nodes of one or more trees that represent the signatures of the transaction types; and associating, by one or more computer processors, a second plurality of operations of the transaction types with descendant nodes of the root nodes. 3. The method of claim 2 , wherein the signatures of the transaction types are determined by traversing the trees. 4. The method of claim 3 , wherein the trees are traversed by a walk selected from the group consisting of a pre-order walk, a post-order walk, an in-order walk, and a level-order walk. 5. The method of claim 1 , wherein the second data structure is a hash table. 6. The method of claim 1 , wherein the first data structure and the second data structure are stored completely in memory. 7. The method of claim 1 , wherein a memory footprint of the first data structure increases linearly, and wherein the memory footprint is based, at least in part, on (i) a count of operations in the workload, (ii) a count of instances of transaction types, and (iii) a fraction of transaction types.

Assignees

Inventors

Classifications

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9928271B2 cover?
Sequences of hierarchical records are aggregated and summarized. A capture log that includes a plurality of operations of a workload is received. A first data structure that models transaction types as sequences of nodes is created. The nodes identify operations in the workload. A present operation and a transaction identifier are read from the capture log. The transaction identifier is dissoci…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06F17/30377. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Mar 27 2018 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).