Computer-implemented system and method for trustless zero-knowledge contingent payment
US-2024249280-A1 · Jul 25, 2024 · US
US10467236B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-10467236-B2 |
| Application number | US-201414500330-A |
| Country | US |
| Kind code | B2 |
| Filing date | Sep 29, 2014 |
| Priority date | Sep 29, 2014 |
| Publication date | Nov 5, 2019 |
| Grant date | Nov 5, 2019 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
In each iteration of the process of mining association rules from transaction data by a cluster of computing systems, each mapper node in the cluster receives a split of the transaction data. Each mapper node scans the split to count an absolute support value of each candidate itemset for current search level(s), and passes the candidate itemsets and their support values to reducer nodes in the cluster. The number of reducer nodes will be determined adaptively based on the number of the candidate itemsets and the number of maximum available resource nodes in the cluster. Each reducer node combines the absolute support value of each candidate itemset, and finds frequent itemsets among them using a minimum support threshold. For each frequent itemset it finds, the reducer node creates association rule(s) satisfying a minimum confidence threshold, and exports all discovered frequent itemsets and association rules to a file system for storage.
Opening claim text (preview).
What is claimed is: 1. A computer program product for mining association rules from transaction data by a cluster of computing systems, the computer program product comprising a computer readable storage medium having program instructions embodied therewith, wherein the computer readable storage medium is not a transitory signal per se, the program instructions executable by a processor to cause the processor to perform a method comprising: receiving a split of a plurality of splits of the transaction data by each mapper node of a plurality of mapper nodes in the cluster; for each mapper node, scanning the split to count an absolute support value of each candidate itemset for one or more search levels, wherein the number of candidate itemsets varies at different search levels, wherein each search level is associated with a different rule size of a plurality of rule sizes, wherein each rule size of the plurality of rule sizes is a threshold number of candidate itemsets to be searched at a given search level, and wherein each successive search level of the different search levels is associated with a larger rule size than a previous search level of the different search levels; and passing the candidate itemsets and the absolute support value of each candidate itemset to a plurality of reducer nodes in the cluster; and for each reducer node, grouping the candidate itemsets and the absolute support value of each candidate itemset into a plurality of groups, wherein a number of the groups in the plurality of groups is based on a number of the candidate itemsets and a maximum number of reduce tasks that can be run in parallel in the cluster, wherein the number of reduce tasks varies at the different search levels to balance workloads in the cluster; combining the absolute support value of each candidate itemset received from each mapper node; finding frequent itemsets for each search level of the one or more search levels based on the threshold number of candidate itemsets for each search level, wherein the frequent itemsets comprise the candidate itemsets with the combined absolute support value satisfying a minimum support threshold; for each frequent itemset for the one or more search levels, creating by each reducer note one or more association rules, wherein the creating of the one or more association rules comprises: computing a confidence value for each potential association rule of a plurality of potential association rules, wherein each confidence value is based on at least two candidate itemsets being found together in one or more transactions; and creating the one or more association rules based on those potential rules whose respective confidence values satisfy the minimum confidence threshold; and exporting each frequent itemset for each of the one or more search levels and each created association rule to a file system for storage. 2. The computer program product of claim 1 , wherein for each reducer node: receiving the frequent itemsets and the absolute support values for the frequent itemsets of one or more previous search levels, wherein, for each frequent itemset for the one or more search levels, the creating of the one or more association rules satisfying the minimum confidence threshold comprises: calculating a confidence of the rule using the frequent itemsets and the absolute support values of the one or more previous search levels, and determining whether the calculated confidence of the rule satisfies the minimum confidence threshold. 3. The computer program product of claim 1 , wherein a number of the reducer nodes driven in a map-reduce job is set equal to the number of groups. 4. The computer program product of claim 1 , wherein the combining of the absolute support value of each candidate itemset received from each mapper node comprises: merging the absolute support values of the candidate itemsets in the group to form global support values for the candidate itemsets in the group, wherein the finding of the frequent itemsets comprising the candidate itemsets with the combined absolute support value satisfying the minimum support threshold comprises: pruning infrequent itemsets in the group, the infrequent itemsets comprising the global support value less than the minimum support threshold. 5. The computer program product of claim 1 , wherein the method further comprises: generating the candidate itemsets for a next search level from the frequent itemsets of the one or more search levels; and passing the candidate itemsets for the next search level to the plurality of mapper nodes. 6. The computer program product of claim 5 , wherein the method further comprises: determining whether the candidate itemsets at the next search level could be generated; determining whether a maximum rule size has been reached; and in response to determining that the candidate itemsets at the next search level could not generated, or that the maximum rule size has been reached, ending the mining of the association rules from the transaction data. 7. A system; comprising: a processor; and a computer readable storage medium having program instructions embodied therewith, wherein the computer readable storage medium is not a transitory signal per se, the program instructions executable by a processor to cause the processor to perform a method comprising: receiving a split of a plurality of splits of the transaction data by each mapper node of a plurality of mapper nodes in the cluster; for each mapper node, scanning the split to count an absolute support value of each candidate itemset for one or more search levels, wherein the number of candidate itemsets varies at different search levels, wherein each search level is associated with a different rule size of a plurality of rule sizes, wherein each rule size of the plurality of rule sizes is a threshold number of candidate itemsets to be searched at a given search level, and wherein each successive search level of the different search levels is associated with a larger rule size than a previous search level of the different search levels; and passing the candidate itemsets and the absolute support value of each candidate itemset to a plurality of reducer nodes in the cluster; and for each reducer node, grouping the candidate itemsets and the absolute support value of each candidate itemset into a plurality of groups, wherein a number of the groups in the plurality of groups is based on a number of the candidate itemsets and a maximum number of reduce tasks that can be run in parallel in the cluster, wherein the number of reduce tasks varies at the different search levels to balance workloads in the cluster; combining the absolute support value of each candidate itemset received from each mapper node; finding frequent itemsets for each search level of the one or more search levels based on the threshold number of candidate itemsets for each search level, wherein the frequent itemsets comprise the candidate itemsets with the combined absolute support value satisfying a minimum support threshold; for each frequent itemset for the one or more search levels, creating by each reducer note one or more association rules, wherein the creating of the one or more association rules comprises: computing a confidence value for each potential association rule of a plurality of potential association rules, wherein each confidence value is based on at least two candidate itemsets being found together in one or more transactions; and creating the one or more association rules based on those potential rules whose respective confidence values satisfy the minimum confidence threshold; and exporting each frequent itemset for each of the one or more search levels and each created association rule to
File search processing · CPC title
Query processing support for facilitating data mining operations in structured databases · CPC title
Schema design and management · CPC title
Data mining · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.