What technology area does this patent fall under?

Primary CPC classification G06N3/084. Mapped technology areas include Physics.

When was this patent published?

Publication date Tue Nov 14 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 5 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Verifiable deep learning training service

US11816575B2 · US · B2

Patent metadata
Field	Value
Publication number	US-11816575-B2
Application number	US-201816124657-A
Country	US
Kind code	B2
Filing date	Sep 7, 2018
Priority date	Sep 7, 2018
Publication date	Nov 14, 2023
Grant date	Nov 14, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Deep learning training service framework mechanisms are provided. The mechanisms receive encrypted training datasets for training a deep learning model, execute a FrontNet subnet model of the deep learning model in a trusted execution environment, and execute a BackNet subnet model of the deep learning model external to the trusted execution environment. The mechanisms decrypt, within the trusted execution environment, the encrypted training datasets and train the FrontNet subnet model and BackNet subnet model of the deep learning model based on the decrypted training datasets. The FrontNet subnet model is trained within the trusted execution environment and provides intermediate representations to the BackNet subnet model which is trained external to the trusted execution environment using the intermediate representations. The mechanisms release a trained deep learning model comprising a trained FrontNet subnet model and a trained BackNet subnet model, to the one or more client computing devices.

First claim

Opening claim text (preview).

What is claimed is: 1. A method, in a data processing system, comprising: executing, by a deep learning training service framework, a FrontNet subnet model of a deep learning model in a trusted execution environment of the deep learning training service framework; executing, by the deep learning training service framework, a BackNet subnet model of the deep learning model in the deep learning training service framework external to the trusted execution environment, wherein the FrontNet subnet model comprises a first predetermined number of consecutive layers of the deep learning model from an input layer of the deep learning model to an intermediate layer, and wherein the BackNet subnet model comprises a second predetermined number of consecutive layers of the deep learning model from a layer, subsequent to the intermediate layer, to an output layer of the deep learning model; decrypting, by a security module executing within the trusted execution environment, one or more encrypted training datasets; training, by training logic of the deep learning training service framework, the FrontNet subnet model and BackNet subnet model of the deep learning model based on the decrypted training datasets, wherein the FrontNet subnet model is trained within the trusted execution environment and provides intermediate representations to the BackNet subnet model which is trained external to the trusted execution environment using the intermediate representations; releasing, by the deep learning training service framework, a trained deep learning model comprising a trained FrontNet subnet model and a trained BackNet subnet model, to one or more client computing devices; and generating, by a fingerprint generation module executing within the trusted execution environment, one or more first fingerprint data structures for the one or more training datasets, wherein each first fingerprint data structure comprises a fingerprint that is a normalized feature embedding of a penultimate layer of the BackNet subnet model. 2. The method of claim 1 , wherein: the one or more client computing devices comprises a plurality of computing devices associated with a plurality of different training dataset providers, and wherein the security module executing within the trusted execution environment prevents training dataset providers from accessing training datasets provided by other training dataset providers, and each of the training datasets provided by the different training dataset providers are used during the training of the FrontNet subnet model and the BackNet subnet model of the deep learning model to generate the trained deep learning model, and wherein the same trained deep learning model is released to each of the different training dataset providers with the FrontNet subnet model encrypted with an encryption key specific to the training dataset provider. 3. The method of claim 1 , further comprising: receiving, by the deep learning training service framework, the one or more encrypted training datasets from the one or more client computers; and in response to receiving the one or more encrypted training datasets: prior to decrypting the one or more encrypted training datasets, authenticating, by the security module, training dataset providers of the one or more training dataset providers; and discarding, by the security module, any training datasets from training dataset providers that do not pass the authentication from further use during training of the FrontNet subnet model and BackNet subnet model of the deep learning model. 4. The method of claim 3 , further comprising, in response to receiving the one or more encrypted training datasets: verifying, by the security module, the integrity of the one or more training datasets; and discarding, by the security module, any training datasets that do not pass the verification from further use during training of the FrontNet subnet model and BackNet subnet model of the deep learning model. 5. The method of claim 1 , further comprising: generating, by a fingerprint generation module executing within the trusted execution environment, one or more first fingerprint data structures for the one or more training datasets, wherein each first fingerprint data structure comprises a fingerprint that is a feature embedding of a selected layer of the deep learning model; and storing, by the fingerprint generation module, the generated one or more first fingerprint data structures in an evidence storage. 6. The method of claim 5 , further comprising: processing, by the trained deep learning model, new input data to generate an output result and a second fingerprint data structure corresponding to the new input data; receiving, from a client device of the one or more client devices, a query comprising the second fingerprint data structure; searching, by a query module executing in the deep learning training service framework, the evidence storage for a first fingerprint data structure similar to the second fingerprint data structure based on a distance function that measures a similarity between embeddings of the fingerprint data structures in the evidence storage and the second fingerprint data structure, and identifying the first fingerprint data structure as a fingerprint data structure in the evidence storage having a smallest distance; and identifying, by the query module, a training dataset, of the one or more training datasets, and a corresponding training dataset provider based on an entry in the evidence storage corresponding to the first fingerprint data structure. 7. The method of claim 6 , further comprising: performing at least one of a debugging operation or a root cause analysis on the trained deep learning model based on the identified training dataset and identified corresponding training dataset provider. 8. The method of claim 1 , further comprising: negotiating, with a plurality of the training dataset providers, a customized partitioning point hyperparameter for the training dataset provider prior to training a corresponding instance of the deep learning model, wherein the customized partitioning point defines one of a last intermediate layer of the FrontNet subnet model or a first layer of the BackNet subnet model, and wherein the partitioning point hyperparameter is different for at least two of the training dataset providers; and configuring, for each instance of the deep learning model corresponding to each of the one or more training dataset providers, the first predetermined number of consecutive layers of the FrontNet subnet model and the second predetermined number of consecutive layers of the BackNet subnet model based on the customized partitioning point hyperparameter for the training dataset provider. 9. The method of claim 5 , wherein generating one or more first fingerprint data structures for the one or more training datasets comprises generating, for each training data instance in the one or more training datasets, a first fingerprint data structure comprising a tuple data structure that specifies a fingerprint, a class label of the training data instance used to train the deep learning model, a data source identifier, and a hash digest of the training data instance. 10. The method of claim 9 , wherein the fingerprint in the tuple data structure is a normalized feature embedding of a penultimate layer of the BackNet subnet model. 11. The method of claim 1 , further comprising: processing a query comprising a second fingerprint data structure based on the one or more first fingerprint data structures to identify a training dataset corresponding to a first fingerprint data structure based on a similarity evaluation of the fingerpr

Assignees

Inventors

Classifications

G06N3/09
Supervised learning · CPC title
G06N3/098
Distributed learning, e.g. federated learning · CPC title
G06N3/0499
Feedforward networks · CPC title
G06N3/084Primary
Backpropagation, e.g. using gradient descent · CPC title
G06F21/602
Providing cryptographic facilities or services · CPC title

Patent family

Related publications grouped by family.

View patent family 69719565

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11816575B2 cover?: Deep learning training service framework mechanisms are provided. The mechanisms receive encrypted training datasets for training a deep learning model, execute a FrontNet subnet model of the deep learning model in a trusted execution environment, and execute a BackNet subnet model of the deep learning model external to the trusted execution environment. The mechanisms decrypt, within the trust…
Who is the assignee on this patent?: IBM
What technology area does this patent fall under?: Primary CPC classification G06N3/084. Mapped technology areas include Physics.
When was this patent published?: Publication date Tue Nov 14 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 5 related publications on this page (citations in our corpus or others sharing the same primary CPC).

How to read this patent

Abstract

First claim

Assignees

Inventors

Classifications

Patent family

External sources

Related patents

Methods and apparatus for distributed use of a machine learning model

Privacy-preserving machine learning

Secure Training of Multi-Party Deep Neural Network

Accelerating deep neural network training with inconsistent stochastic gradient descent

Deep neural network processing on hardware accelerators with stacked memory

Frequently asked questions