Interpretability framework for differentially private deep learning

US2025036811A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2025036811-A1
Application numberUS-202418904462-A
CountryUS
Kind codeA1
Filing dateOct 2, 2024
Priority dateOct 30, 2020
Publication dateJan 30, 2025
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Data is received that specifies a bound for an adversarial posterior belief p c that corresponds to a likelihood to re-identify data points from the dataset based on a differentially private function output. Privacy parameters ε, δ are then calculated based on the received data that govern a differential privacy (DP) algorithm to be applied to a function to be evaluated over a dataset. The calculating is based on a ratio of probabilities distributions of different observations, which are bound by the posterior belief p c as applied to a dataset. The calculated privacy parameters are then used to apply the DP algorithm to the function over the dataset. Related apparatus, systems, techniques and articles are also described.

First claim

Opening claim text (preview).

What is claimed is: 1 . A system for training a machine learning model comprising: at least one data processor; memory storing instructions which, when executed by the at least one data processor, result in operations comprising: receiving a dataset; receiving at least one first user-generated privacy parameter which governs a differential privacy (DP) algorithm to be applied to a function evaluated over the received dataset; calculating, based on the received at least one first user-generated privacy parameter, at least one second privacy parameter based on a ratio or overlap of probabilities of distributions of different observations; applying, using the at least one second privacy parameter, the DP algorithm to the function over the received dataset to result in an anonymized function output; and anonymously training at least one machine learning model using the dataset after application of the DP algorithm to the function over the received dataset which, when deployed, is configured to classify input data. 2 . The system of claim 1 , wherein the operations further comprise: deploying the trained at least one machine learning model; receiving, by the deployed trained at least one machine learning model, input data. 3 . The system of claim 2 , wherein the operations further comprise: providing, by the deployed trained at least one machine learning model based on the input data, a classification. 4 . The system of claim 1 , wherein: the at least one first user-generated privacy parameter comprises a bound for an adversarial posterior belief p c that corresponds to a likelihood to re-identify data points from the dataset based on a differentially private function output; and the calculated at least one second privacy parameter comprises privacy parameters ε, δ; and the calculating is based on a conditional probability of distributions of different datasets given a differential private function output which are bound by the posterior belief p c as applied to the dataset. 5 . The system of claim 1 , wherein the at least one first user-generated privacy parameter comprises privacy parameters ε, δ; the calculated at least one second privacy parameter comprises an expected membership advantage p a that corresponds to a probability of an adversary successfully identifying a member in the dataset; and the calculating is based on a conditional probability of different possible datasets. 6 . The system of claim 1 , wherein the at least one first user-generated privacy parameter comprises privacy parameters ε, δ; the calculated at least one second privacy parameter comprises an adversarial posterior belief bound p c that corresponds to a likelihood to re-identify data points from the dataset based on a differentially private output. 7 . The system of claim 6 , wherein the calculating is based on a conditional probability of different possible datasets. 8 . A method comprising: receiving a dataset; receiving at least one first user-generated privacy parameter which governs a differential privacy (DP) algorithm to be applied to a function evaluated over the received dataset; calculating, based on the received at least one first user-generated privacy parameter, at least one second privacy parameter based on a ratio or overlap of probabilities of distributions of different observations; applying, using the at least one second privacy parameter, the DP algorithm to the function over the received dataset to result in an anonymized function output; and anonymously training at least one machine learning model using the dataset after application of the DP algorithm to the function over the received dataset which, when deployed, is configured to classify input data. 9 . The method of claim 8 , further comprising: deploying the trained at least one machine learning model; receiving, by the deployed trained at least one machine learning model, input data. 10 . The method of claim 9 , further comprising: providing, by the deployed trained at least one machine learning model based on the input data, a classification. 11 . The method of claim 8 , wherein: the at least one first user-generated privacy parameter comprises a bound for an adversarial posterior belief p c that corresponds to a likelihood to re-identify data points from the dataset based on a differentially private function output; and the calculated at least one second privacy parameter comprises privacy parameters ε, δ; and the calculating is based on a conditional probability of distributions of different datasets given a differential private function output which are bound by the posterior belief p c as applied to the dataset. 12 . The method of claim 8 , wherein the at least one first user-generated privacy parameter comprises privacy parameters ε, δ; the calculated at least one second privacy parameter comprises an expected membership advantage p a that corresponds to a probability of an adversary successfully identifying a member in the dataset; and the calculating is based on a conditional probability of different possible datasets. 13 . The method of claim 8 , wherein the at least one first user-generated privacy parameter comprises privacy parameters ε, δ; the calculated at least one second privacy parameter comprises an adversarial posterior belief bound p c that corresponds to a likelihood to re-identify data points from the dataset based on a differentially private output. 14 . The method of claim 13 , wherein the calculating is based on a conditional probability of different possible datasets. 15 . A non-transitory machine-readable storage medium having embodied thereon instructions executable by one or more machines to perform operations comprising: receiving a dataset; receiving at least one first user-generated privacy parameter which governs a differential privacy (DP) algorithm to be applied to a function evaluated over the received dataset; calculating, based on the received at least one first user-generated privacy parameter, at least one second privacy parameter based on a ratio or overlap of probabilities of distributions of different observations; applying, using the at least one second privacy parameter, the DP algorithm to the function over the received dataset to result in an anonymized function output; and anonymously training at least one machine learning model using the dataset after application of the DP algorithm to the function over the received dataset which, when deployed, is configured to classify input data. 16 . The non-transitory machine-readable storage medium of claim 15 , wherein the operations further comprise: deploying the trained at least one machine learning model; receiving, by the deployed trained at least one machine learning model, input data. 17 . The non-transitory machine-readable storage medium of claim 16 , wherein the operations further comprise: providing, by the deployed trained at least one machine learning model based on the input data, a classification. 18 . The non-transitory machine-readable storage medium of claim 15 , wherein: the at least one first user-generated privacy parameter comprises a bound for an adversarial posterior belief p c that corresponds to a likelihood to re-identify data points from the dataset based on a differentially private function output; and the calculated at least one second privacy parameter comprises privacy parameters ε, δ; and the calculating is based on a conditional probability of distributions of different

Assignees

Inventors

Classifications

  • Supervised learning · CPC title

  • Convolutional networks [CNN, ConvNet] · CPC title

  • characterised by the process organisation or structure, e.g. boosting cascade · CPC title

  • for evaluating statistical data {, e.g. average values, frequency distributions, probability functions, regression analysis (forecasting specially adapted for a specific administrative, business or logistic context G06Q10/04)} · CPC title

  • Machine learning · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2025036811A1 cover?
Data is received that specifies a bound for an adversarial posterior belief p c that corresponds to a likelihood to re-identify data points from the dataset based on a differentially private function output. Privacy parameters ε, δ are then calculated based on the received data that govern a differential privacy (DP) algorithm to be applied to a function to be evaluated over a dataset. The cal…
Who is the assignee on this patent?
Sap Se
What technology area does this patent fall under?
Primary CPC classification G06F21/6254. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Jan 30 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).