Parallel execution of stateful black box operators

US12293216B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12293216-B2
Application numberUS-202117554802-A
CountryUS
Kind codeB2
Filing dateDec 17, 2021
Priority dateDec 17, 2021
Publication dateMay 6, 2025
Grant dateMay 6, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Some embodiments provide a system and method to receive, as an input, configuration properties of a group of operators of a data pipeline, the data pipeline including a specified multiplicity greater than one (1); generate, as an output, a configuration for two new operators, including a first new operator and a second new operator; and automatically insert the first new operator and the second new operator into a deployment of the data pipeline, the first new operator being inserted before a number of replicas of the group of operators of the data pipeline corresponding to the specified multiplicity and the second new operator being inserted after the number of replicas of the group of operators of the data pipeline corresponding to the specified multiplicity.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method, the method comprising: receiving, as an input, configuration properties of a group of operators of a data pipeline, the data pipeline including a specified multiplicity greater than 1; generating, as an output, a configuration for two new operators, including a first new operator that transforms a dataset output of a preceding operator according to the configuration properties of the group of operators of the data pipeline and dispatches the transformed dataset to the replicas of the group of operators of the data pipeline and a second new operator that collects results of the replicas of the group of operators of the data pipeline and computes, based on the configuration properties of the group of operators of the data pipeline and an order preserving requirement, a result dataset that it provides to a succeeding operator; and automatically inserting the first new operator and the second new operator into a deployment of the data pipeline, the first new operator being inserted before a number of replicas of the group of operators of the data pipeline corresponding to the specified multiplicity and the second new operator being inserted after the number of replicas of the group of operators of the data pipeline corresponding to the specified multiplicity. 2. The method of claim 1 , wherein the configuration properties specify properties defining a parallel computation of an execution of the group of operators of the data pipeline. 3. The method of claim 2 , wherein the configuration properties comprise: characteristics of a restructure configuration of an input dataset into another dataset; characteristics of a transduction configuration of an input dataset with respect to an incremental state; and characteristics of an aggregation configuration of results obtained for each element of the input dataset. 4. The method of claim 1 , further comprising receiving a predefined indication of a value for at least one of the configuration properties. 5. The method of claim 1 , further comprising executing of the group of operators of the deployed data pipeline having the first new operator and the second new operator inserted therein, wherein the group of operators are executed in parallel. 6. The method of claim 1 , wherein an execution of the group of operators of the deployed data pipeline having the first new operator and the second new operator inserted therein generates a result equivalent to a result generated using a deployment of the data pipeline without any replicas of the group of operators of the data pipeline. 7. A system comprising: a memory storing processor-executable program code; and a processing unit to execute the processor-executable program code to cause the system to: receive, as an input, configuration properties of a group of operators of a data pipeline, the data pipeline including a specified multiplicity greater than 1; generate, as an output, a configuration for two new operators, including a first new operator that transforms a dataset output of a preceding operator according to the configuration properties of the group of operators of the data pipeline and dispatches the transformed dataset to the replicas of the group of operators of the data pipeline and a second new operator that collects results of the replicas of the group of operators of the data pipeline and computes, based on the configuration properties of the group of operators of the data pipeline and an order preserving requirement, a result dataset that it provides to a succeeding operator, and automatically insert the first new operator and the second new operator into a deployment of the data pipeline, the first new operator being inserted before a number of replicas of the group of operators of the data pipeline corresponding to the specified multiplicity and the second new operator being inserted after the number of replicas of the group of operators of the data pipeline corresponding to the specified multiplicity. 8. The system of claim 7 , wherein the configuration properties specify properties defining a parallel computation of an execution of the group of operators of the data pipeline. 9. The system of claim 8 , wherein the configuration properties comprise: characteristics of a restructure configuration mapping of an input dataset into another dataset; characteristics of a transduction configuration mapping of an input dataset with respect to an incremental state; and characteristics of an aggregation configuration mapping of results obtained for each element of the input dataset. 10. The system of claim 7 , further comprising receiving a predefined indication of a value for at least one of the configuration properties. 11. The system of claim 7 , further comprising executing of the group of operators of the deployed data pipeline having the first new operator and the second new operator inserted therein, wherein the group of operators are executed in parallel. 12. The system of claim 7 , wherein an execution of the group of operators of the deployed data pipeline having the first new operator and the second new operator inserted therein generates a result equivalent to a result generated using a deployment of the data pipeline without any replicas of the group of operators of the data pipeline. 13. A non-transitory, computer-readable medium storing program code, the program code executable by a computer processor to cause the processor to perform a method comprising: receiving, as an input, configuration properties of a group of operators of a data pipeline, the data pipeline including a specified multiplicity greater than 1; generating, as an output, a configuration for two new operators, including a first new operator that transforms a dataset output of a preceding operator according to the configuration properties of the group of operators of the data pipeline and dispatches the transformed dataset to the replicas of the group of operators of the data pipeline and a second new operator that collects results of the replicas of the group of operators of the data pipeline and computes, based on the configuration properties of the group of operators of the data pipeline and an order preserving requirement, a result dataset that it provides to a succeeding operator; and automatically inserting the first new operator and the second new operator into a deployment of the data pipeline, the first new operator being inserted before a number of replicas of the group of operators of the data pipeline corresponding to the specified multiplicity and the second new operator being inserted after the number of replicas of the group of operators of the data pipeline corresponding to the specified multiplicity. 14. The medium of claim 13 , wherein the configuration properties specify properties defining a parallel computation of an execution of the group of operators of the data pipeline. 15. The medium of claim 14 , wherein the configuration properties comprise: characteristics of a restructure configuration of an input dataset into another dataset; characteristics of a transduction configuration of an input dataset with respect to an incremental state; and characteristics of an aggregation configuration of results obtained for each element of the input dataset. 16. The medium of claim 13 , further comprising receiving a predefined indication of a value for at least one of the configuration properties. 17. The medium of claim 13 , further comprising executing of the group of operators of the deployed data pipeline having the f

Assignees

Inventors

Classifications

  • Program synchronisation; Mutual exclusion, e.g. by means of semaphores · CPC title

  • G06F9/4843Primary

    by program, e.g. task dispatcher, supervisor, operating system · CPC title

  • G06F9/46Primary

    Multiprogramming arrangements · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12293216B2 cover?
Some embodiments provide a system and method to receive, as an input, configuration properties of a group of operators of a data pipeline, the data pipeline including a specified multiplicity greater than one (1); generate, as an output, a configuration for two new operators, including a first new operator and a second new operator; and automatically insert the first new operator and the second…
Who is the assignee on this patent?
Sap Se
What technology area does this patent fall under?
Primary CPC classification G06F9/4843. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue May 06 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).