Generating unsupervised adversarial examples for machine learning

US12524677B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12524677-B2
Application numberUS-202117157077-A
CountryUS
Kind codeB2
Filing dateJan 25, 2021
Priority dateJan 25, 2021
Publication dateJan 13, 2026
Grant dateJan 13, 2026

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A trained machine learning model and a training dataset used to train the trained machine learning model can be received. Based on the training dataset, unsupervised adversarial examples can be generated. Robustness of the trained machine learning model can be determined using the generated unsupervised adversarial examples. The training dataset can be augmented with the generated unsupervised adversarial examples. The trained machine learning model can be retrained using the augmented training dataset.

First claim

Opening claim text (preview).

What is claimed is: 1 . A computer-implemented method comprising: receiving a trained machine learning model and a training dataset used to train the trained machine learning model; based on the training dataset and a loss function of the trained machine learning model, generating images representing adversarial examples, the adversarial examples being perturbed samples of the training dataset, the generating creating an adversarial example which is least similar to an original sample in the training dataset, and also satisfying an adversarial criterion that a loss associated with the adversarial example is less than that of the original sample in the training dataset; determining robustness of the trained machine learning model using the generated adversarial examples; augmenting the training dataset with the generated images representing adversarial examples; retraining the trained machine learning model using the augmented training dataset; performing by the retrained machine learning model, image classification; and displaying the original sample, a generated image representing the adversarial example, and a reconstructed image reconstructed using the retrained machine learning model. 2 . The method of claim 1 , wherein the adversarial example is randomly sampled. 3 . The method of claim 1 , wherein the adversarial example is sampled using an output of a convolutional layer of the trained machine learning model. 4 . The method of claim 1 , wherein the generating of the adversarial examples includes solving a minmax algorithm which finds the adversarial example that has minimum training loss and least similarity to the original sample. 5 . The method of claim 1 , wherein the trained machine learning model includes a neural network model. 6 . The method of claim 1 , wherein the trained machine learning model includes an autoencoder. 7 . The method of claim 1 , wherein the trained machine learning model includes a representation learning model. 8 . The method of claim 1 , wherein the trained machine learning model includes a contrastive learning model. 9 . The method of claim 1 , wherein the trained machine learning model includes an unsupervised machine learning model. 10 . A computer program product comprising a non-transitory computer readable storage medium having program instructions embodied therewith, the program instructions readable by a device to cause the device to: receive a trained machine learning model and a training dataset used to train the trained machine learning model; based on the training dataset and a loss function of the trained machine learning model, generate mages representing adversarial examples, the adversarial examples being perturbed samples of the training dataset, the generating creating an adversarial example which is least similar to an original sample in the training dataset, and also satisfying an adversarial criterion that a loss associated with the adversarial example is less than that of the original sample in the training dataset; augment the training dataset with the generated images representing adversarial examples; retrain the trained machine learning model using the augmented training dataset; perform by the retrained machine learning model, image classification; and display the original sample, a generated image representing the adversarial example, and a reconstructed image reconstructed using the retrained machine learning model. 11 . The computer program product of claim 10 , wherein the device is caused to determine robustness of the trained machine learning model using the generated adversarial examples. 12 . The computer program product of claim 10 , wherein the adversarial sample is randomly sampled. 13 . The computer program product of claim 10 , wherein the adversarial sample is sampled using an output of a convolutional layer of the trained machine learning model. 14 . The computer program product of claim 10 , wherein the generating adversarial examples includes solving a minmax algorithm which finds the adversarial example that has minimum training loss and least similarity to the original sample. 15 . A system comprising: a hardware processor; and a memory device coupled with the hardware processor; the hardware processor configured to: receive a trained machine learning model and a training dataset used to train the trained machine learning model; based on the training dataset and a loss function of the trained machine learning model, generate images representing adversarial examples, the adversarial examples being perturbed samples of the training dataset, the generating creating an adversarial example which is least similar to an original sample in the training dataset, and also satisfying an adversarial criterion that a loss associated with the adversarial example is less than that of the original sample in the training dataset; determine robustness of the trained machine learning model using the generated adversarial examples; augment the training dataset with the generated images representing adversarial examples; retrain the trained machine learning model using the augmented training dataset; perform by the retrained machine learning model, image classification; and display the original sample, a generated image representing the adversarial example, and a reconstructed image reconstructed using the retrained machine learning model.

Assignees

Inventors

Classifications

  • Combinations of networks · CPC title

  • Adversarial learning · CPC title

  • Weakly supervised learning, e.g. semi-supervised or self-supervised learning · CPC title

  • Convolutional networks [CNN, ConvNet] · CPC title

  • Auto-encoder networks; Encoder-decoder networks · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12524677B2 cover?
A trained machine learning model and a training dataset used to train the trained machine learning model can be received. Based on the training dataset, unsupervised adversarial examples can be generated. Robustness of the trained machine learning model can be determined using the generated unsupervised adversarial examples. The training dataset can be augmented with the generated unsupervised …
Who is the assignee on this patent?
IBM, Univ Nat Chung Hsing
What technology area does this patent fall under?
Primary CPC classification G06N3/088. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jan 13 2026 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 10 related publications on this page (citations in our corpus or others sharing the same primary CPC).