Multi-party prediction using feature contribution values

US11848915B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11848915-B2
Application numberUS-202017106253-A
CountryUS
Kind codeB2
Filing dateNov 30, 2020
Priority dateNov 30, 2020
Publication dateDec 19, 2023
Grant dateDec 19, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Techniques are provided for multi-party prediction using feature contribution values. One method comprises obtaining a first set of feature contribution values associated with respective ones of a plurality of machine learning models, wherein each machine learning model is trained using training data of a different party and each feature contribution value indicates a contribution by a corresponding feature to a prediction generated by the associated machine learning model; training an aggregate machine learning model using the obtained first sets of feature contribution values; receiving a second set of feature contribution values generated by applying data of at least one party to at least one machine learning model; and applying the second set of feature contribution values to the trained aggregate machine learning model to obtain a global prediction. Each feature contribution value may correspond to a masked feature, and the feature contribution values may not expose the source data of one party to another party.

First claim

Opening claim text (preview).

What is claimed is: 1. A method, comprising: obtaining a first set of feature contribution values associated with respective ones of a plurality of machine learning models, wherein each machine learning model is trained using training data of a different party and wherein each feature contribution value indicates a contribution by a corresponding feature to a prediction generated by the associated machine learning model; training an aggregate machine learning model by applying the obtained first sets of feature contribution values to the aggregate machine learning model; receiving at least one second set of feature contribution values generated by applying data of at least one party to one or more of the machine learning models; and applying the at least one second set of feature contribution values to the trained aggregate machine learning model to obtain a global prediction related to at least some of the data of one or more of the different parties; wherein the method is performed by at least one processing device comprising a processor coupled to a memory. 2. The method of claim 1 , wherein each of the feature contribution values corresponds to a masked feature. 3. The method of claim 1 , wherein each first set of obtained feature contribution values does not expose the source data of a respective party to a different party. 4. The method of claim 1 , wherein the feature contribution values comprise SHAP values. 5. The method of claim 1 , wherein the training of the aggregate machine learning model further employs at least one label generated by at least one machine learning model. 6. The method of claim 1 , wherein a training data set of a first party is not shared with another party. 7. The method of claim 1 , further comprising employing one or more data access controls to prevent data of a first party from being accessed by another party. 8. The method of claim 1 , wherein the different parties comprise one or more of different data owners, different users and different entities. 9. An apparatus comprising: at least one processing device comprising a processor coupled to a memory; the at least one processing device being configured to implement the following steps: obtaining a first set of feature contribution values associated with respective ones of a plurality of machine learning models, wherein each machine learning model is trained using training data of a different party and wherein each feature contribution value indicates a contribution by a corresponding feature to a prediction generated by the associated machine learning model; training an aggregate machine learning model by applying the obtained first sets of feature contribution values to the aggregate machine learning model; receiving at least one second set of feature contribution values generated by applying data of at least one party to one or more of the machine learning models; and applying the at least one second set of feature contribution values to the trained aggregate machine learning model to obtain a global prediction related to at least some of the data of one or more of the different parties. 10. The apparatus of claim 9 , wherein each first set of obtained feature contribution values does not expose the source data of a respective party to a different party. 11. The apparatus of claim 9 , wherein the feature contribution values comprise SHAP values. 12. The apparatus of claim 9 , wherein the training of the aggregate machine learning model further employs at least one label generated by at least one machine learning model. 13. The apparatus of claim 9 , further comprising employing one or more data access controls to prevent data of a first party from being accessed by another party. 14. The apparatus of claim 9 , wherein the different parties comprise one or more of different data owners, different users and different entities. 15. A non-transitory processor-readable storage medium having stored therein program code of one or more software programs, wherein the program code when executed by at least one processing device causes the at least one processing device to perform the following steps: obtaining a first set of feature contribution values associated with respective ones of a plurality of machine learning models, wherein each machine learning model is trained using training data of a different party and wherein each feature contribution value indicates a contribution by a corresponding feature to a prediction generated by the associated machine learning model; training an aggregate machine learning model by applying the obtained first sets of feature contribution values to the aggregate machine learning model; receiving at least one second set of feature contribution values generated by applying data of at least one party to one or more of the machine learning models; and applying the at least one second set of feature contribution values to the trained aggregate machine learning model to obtain a global prediction related to at least some of the data of one or more of the different parties. 16. The non-transitory processor-readable storage medium of claim 15 , wherein each first set of obtained feature contribution values does not expose the source data of a respective party to a different party. 17. The non-transitory processor-readable storage medium of claim 15 , wherein the feature contribution values comprise SHAP values. 18. The non-transitory processor-readable storage medium of claim 15 , wherein the training of the aggregate machine learning model further employs at least one label generated by at least one machine learning model. 19. The non-transitory processor-readable storage medium of claim 15 , further comprising employing one or more data access controls to prevent data of a first party from being accessed by another party. 20. The non-transitory processor-readable storage medium of claim 15 , wherein the different parties comprise one or more of different data owners, different users and different entities.

Assignees

Inventors

Classifications

  • wherein the identity of one or more communicating identities is hidden (cryptographic mechanisms or cryptographic arrangements for anonymous credentials or for identity based cryptographic systems H04L9/00) · CPC title

  • Inference or reasoning models · CPC title

  • Machine learning · CPC title

  • involving homomorphic encryption · CPC title

  • Ensemble learning · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11848915B2 cover?
Techniques are provided for multi-party prediction using feature contribution values. One method comprises obtaining a first set of feature contribution values associated with respective ones of a plurality of machine learning models, wherein each machine learning model is trained using training data of a different party and each feature contribution value indicates a contribution by a correspo…
Who is the assignee on this patent?
Emc Ip Holding Co Llc
What technology area does this patent fall under?
Primary CPC classification H04L63/0407. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Dec 19 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).