Apparatus and method for identifying performance bottlenecks in pipeline parallel processing environment
US-2015193368-A1 · Jul 9, 2015 · US
US9424160B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-9424160-B2 |
| Application number | US-201514671102-A |
| Country | US |
| Kind code | B2 |
| Filing date | Mar 27, 2015 |
| Priority date | Mar 18, 2014 |
| Publication date | Aug 23, 2016 |
| Grant date | Aug 23, 2016 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Data flow disruptions over a series of data processing operators can be detected by a computer system that generates a profile for data flow at an operator. The profile can include data input, processing, and output wait times. Using the profile, the system can detect potential flow disruptions. If the potential disruption satisfies a rule, it is considered a data flow disruption and a recommendation associated with the satisfied rule is identified. The recommendation and the operator identity is displayed.
Opening claim text (preview).
What is claimed is: 1. A computer implemented method for detecting data flow disruptions over a series of data processing operators that are each configured to receive and store data in an input record block, process data from the input record block, store results of the processing in an output record block, and output data from the output record block to a next processing operator in the series, the method comprising: generating, for a particular processing operator in the series of data processing operators, a processing operator timing profile that includes: an input wait time based upon a period of time that a particular data set is stored in a particular input data record, an operator processing time based upon a period of time between a start of processing of the particular data set by the particular processing operator and a completion of the processing of the particular data set by the particular processing operator, and an output wait time based upon a period of time that the particular data set is stored in a particular output data record block; detecting, from the processing operator timing profile, a potential flow disruption condition; determining that the processing operator timing profile satisfies at least one rule from a set of flow disruption rules that are each associated with at least one corresponding recommendation; identifying, based on the at least one rule, a corresponding recommendation; and displaying, in response to identifying the corresponding recommendation, an identity of the particular processing operator and the corresponding recommendation, wherein the series of data processing operators are part of a system of operators working in a parallel processing environment, wherein the parallel processing environment comprises a conductor, a plurality of section leaders, and a plurality of players, and wherein a section leader of the plurality of section leaders is configured to create a record block I/O monitoring thread to implement the generating, for the particular processing operator in the series of processing operators, the processing operator timing profile in the parallel processing environment. 2. The computer implemented method of claim 1 , wherein the displaying the identity of the particular processing operator and the corresponding recommendation occurs via a user interface, the user interface part of a job execution design recommendation of a performance management system. 3. The computer implemented method of claim 1 , further comprising, in response to the identifying the corresponding recommendation, implementing the recommendation, based on settings in a performance management system. 4. A computer system for detecting data flow disruptions over a series of data processing operators that are each configured to receive and store data in an input record block, process data from the input record block, store results of the processing in an output record block, and output data from the output record block to a next processing operator in the series, the system comprising: at least one processor circuit configured to: generate, for a particular processing operator in the series of data processing operators, a processing operator timing profile that includes: an input wait time based upon a period of time that a particular data set is stored in a particular input data record, an operator processing time based upon a period of time between a start of processing of the particular data set by the particular processing operator and a completion of the processing of the particular data set by the particular processing operator, and an output wait time based upon a period of time that the particular data set is stored in a particular output data record block; detect, from the processing operator timing profile, a potential flow disruption condition; determine that the processing operator timing profile satisfies at least one rule from a set of flow disruption rules that are each associated with at least one corresponding recommendation; identify, based on the at least one rule, a corresponding recommendation; and display, in response to identifying the corresponding recommendation, an identity of the particular processing operator and the corresponding recommendation, wherein the series of data processing operators are part of a system of operators working in a parallel processing environment, wherein the parallel processing environment comprises a conductor, a plurality of section leaders, and a plurality of players, and wherein a section leader of the plurality of section leaders is configured to create a record block I/O monitoring thread to implement the generating, for the particular processing operator in the series of processing operators, the processing operator timing profile in the parallel processing environment. 5. The computer system of claim 4 , wherein the at least one processor circuit is configured to display the identity of the particular processing operator and the corresponding recommendation via a user interface, the user interface part of a job execution design recommendation of a performance management system. 6. The computer system of claim 4 , wherein the at least one processor circuit is further configured implement the recommendation, in response to the identifying the corresponding recommendation, based on settings in a performance management system. 7. A computer program product for detecting data flow disruptions over a series of data processing operators that are each configured to receive and store data in an input record block, process data from the input record block, store results of the processing in an output record block, and output data from the output record block to a next processing operator in the series, the computer program product comprising a computer readable storage medium having program instructions embodied therewith, wherein the computer readable storage medium is not a transitory signal per se, the program instructions executable by a computer processing circuit to cause the circuit to perform the method comprising: generating, for a particular processing operator in the series of data processing operators, a processing operator timing profile that includes: an input wait time based upon a period of time that a particular data set is stored in a particular input data record, an operator processing time based upon a period of time between a start of processing of the particular data set by the particular processing operator and a completion of the processing of the particular data set by the particular processing operator, and an output wait time based upon a period of time that the particular data set is stored in a particular output data record block; detecting, from the processing operator timing profile, a potential flow disruption condition; determining that the processing operator timing profile satisfies at least one rule from a set of flow disruption rules that are each associated with at least one corresponding recommendation; identifying, based on the at least one rule, a corresponding recommendation; and displaying, in response to identifying the corresponding recommendation, an identity of the particular processing operator and the corresponding recommendation, wherein the series of data processing operators are part of a system of operators working in a parallel processing environment, wherein the parallel processing environment comprises a conductor, a plurality of section leaders, and a plurality of players, and wherein a section leader of the plurality of section leaders is configured to create a record block I/O monitoring thread to implement the generating, for the particular processing operator in the series of processing operators, the processing operator
by assessing time · CPC title
Error detection; Error correction; Monitoring (error detection, correction or monitoring in information storage based on relative movement between record carrier and transducer G11B20/18; monitoring, i.e. supervising the progress of recording or reproducing G11B27/36; in static stores G11C29/00) · CPC title
Techniques for rebalancing the load in a distributed system · CPC title
Visualisation of programs or trace data · CPC title
for I/O devices · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.