Detection of data flow bottlenecks and disruptions based on operator timing profiles in a parallel processing environment

US9424160B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9424160-B2
Application numberUS-201514671102-A
CountryUS
Kind codeB2
Filing dateMar 27, 2015
Priority dateMar 18, 2014
Publication dateAug 23, 2016
Grant dateAug 23, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Data flow disruptions over a series of data processing operators can be detected by a computer system that generates a profile for data flow at an operator. The profile can include data input, processing, and output wait times. Using the profile, the system can detect potential flow disruptions. If the potential disruption satisfies a rule, it is considered a data flow disruption and a recommendation associated with the satisfied rule is identified. The recommendation and the operator identity is displayed.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer implemented method for detecting data flow disruptions over a series of data processing operators that are each configured to receive and store data in an input record block, process data from the input record block, store results of the processing in an output record block, and output data from the output record block to a next processing operator in the series, the method comprising: generating, for a particular processing operator in the series of data processing operators, a processing operator timing profile that includes: an input wait time based upon a period of time that a particular data set is stored in a particular input data record, an operator processing time based upon a period of time between a start of processing of the particular data set by the particular processing operator and a completion of the processing of the particular data set by the particular processing operator, and an output wait time based upon a period of time that the particular data set is stored in a particular output data record block; detecting, from the processing operator timing profile, a potential flow disruption condition; determining that the processing operator timing profile satisfies at least one rule from a set of flow disruption rules that are each associated with at least one corresponding recommendation; identifying, based on the at least one rule, a corresponding recommendation; and displaying, in response to identifying the corresponding recommendation, an identity of the particular processing operator and the corresponding recommendation, wherein the series of data processing operators are part of a system of operators working in a parallel processing environment, wherein the parallel processing environment comprises a conductor, a plurality of section leaders, and a plurality of players, and wherein a section leader of the plurality of section leaders is configured to create a record block I/O monitoring thread to implement the generating, for the particular processing operator in the series of processing operators, the processing operator timing profile in the parallel processing environment. 2. The computer implemented method of claim 1 , wherein the displaying the identity of the particular processing operator and the corresponding recommendation occurs via a user interface, the user interface part of a job execution design recommendation of a performance management system. 3. The computer implemented method of claim 1 , further comprising, in response to the identifying the corresponding recommendation, implementing the recommendation, based on settings in a performance management system. 4. A computer system for detecting data flow disruptions over a series of data processing operators that are each configured to receive and store data in an input record block, process data from the input record block, store results of the processing in an output record block, and output data from the output record block to a next processing operator in the series, the system comprising: at least one processor circuit configured to: generate, for a particular processing operator in the series of data processing operators, a processing operator timing profile that includes: an input wait time based upon a period of time that a particular data set is stored in a particular input data record, an operator processing time based upon a period of time between a start of processing of the particular data set by the particular processing operator and a completion of the processing of the particular data set by the particular processing operator, and an output wait time based upon a period of time that the particular data set is stored in a particular output data record block; detect, from the processing operator timing profile, a potential flow disruption condition; determine that the processing operator timing profile satisfies at least one rule from a set of flow disruption rules that are each associated with at least one corresponding recommendation; identify, based on the at least one rule, a corresponding recommendation; and display, in response to identifying the corresponding recommendation, an identity of the particular processing operator and the corresponding recommendation, wherein the series of data processing operators are part of a system of operators working in a parallel processing environment, wherein the parallel processing environment comprises a conductor, a plurality of section leaders, and a plurality of players, and wherein a section leader of the plurality of section leaders is configured to create a record block I/O monitoring thread to implement the generating, for the particular processing operator in the series of processing operators, the processing operator timing profile in the parallel processing environment. 5. The computer system of claim 4 , wherein the at least one processor circuit is configured to display the identity of the particular processing operator and the corresponding recommendation via a user interface, the user interface part of a job execution design recommendation of a performance management system. 6. The computer system of claim 4 , wherein the at least one processor circuit is further configured implement the recommendation, in response to the identifying the corresponding recommendation, based on settings in a performance management system. 7. A computer program product for detecting data flow disruptions over a series of data processing operators that are each configured to receive and store data in an input record block, process data from the input record block, store results of the processing in an output record block, and output data from the output record block to a next processing operator in the series, the computer program product comprising a computer readable storage medium having program instructions embodied therewith, wherein the computer readable storage medium is not a transitory signal per se, the program instructions executable by a computer processing circuit to cause the circuit to perform the method comprising: generating, for a particular processing operator in the series of data processing operators, a processing operator timing profile that includes: an input wait time based upon a period of time that a particular data set is stored in a particular input data record, an operator processing time based upon a period of time between a start of processing of the particular data set by the particular processing operator and a completion of the processing of the particular data set by the particular processing operator, and an output wait time based upon a period of time that the particular data set is stored in a particular output data record block; detecting, from the processing operator timing profile, a potential flow disruption condition; determining that the processing operator timing profile satisfies at least one rule from a set of flow disruption rules that are each associated with at least one corresponding recommendation; identifying, based on the at least one rule, a corresponding recommendation; and displaying, in response to identifying the corresponding recommendation, an identity of the particular processing operator and the corresponding recommendation, wherein the series of data processing operators are part of a system of operators working in a parallel processing environment, wherein the parallel processing environment comprises a conductor, a plurality of section leaders, and a plurality of players, and wherein a section leader of the plurality of section leaders is configured to create a record block I/O monitoring thread to implement the generating, for the particular processing operator in the series of processing operators, the processing operator

Assignees

Inventors

Classifications

  • by assessing time · CPC title

  • Error detection; Error correction; Monitoring (error detection, correction or monitoring in information storage based on relative movement between record carrier and transducer G11B20/18; monitoring, i.e. supervising the progress of recording or reproducing G11B27/36; in static stores G11C29/00) · CPC title

  • Techniques for rebalancing the load in a distributed system · CPC title

  • Visualisation of programs or trace data · CPC title

  • for I/O devices · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9424160B2 cover?
Data flow disruptions over a series of data processing operators can be detected by a computer system that generates a profile for data flow at an operator. The profile can include data input, processing, and output wait times. Using the profile, the system can detect potential flow disruptions. If the potential disruption satisfies a rule, it is considered a data flow disruption and a recommen…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06F11/3419. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Aug 23 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).