Systems and Methods for Black Box Optimization
US-2020097853-A1 · Mar 26, 2020 · US
US11836256B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-11836256-B2 |
| Application number | US-201916256107-A |
| Country | US |
| Kind code | B2 |
| Filing date | Jan 24, 2019 |
| Priority date | Jan 24, 2019 |
| Publication date | Dec 5, 2023 |
| Grant date | Dec 5, 2023 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
An adversarial robustness testing method, system, and computer program product include testing a robustness of a black-box system under different access settings via an accelerator.
Opening claim text (preview).
What is claimed is: 1. A computer-implemented adversarial robustness testing method for checking a learning performance of a black-box system, the method comprising: testing a robustness, against an adversarial attack, of a black-box system under different access settings via an accelerator by generating adversarial inputs of a model in a limited access setting of the different access settings, wherein the accelerator includes a function to reduce an attack space in the adversarial attack for query efficiency, wherein a robustness objective for the testing of the robustness of the black-box system uses system defined threat models for adversarial examples, and wherein a perturbed noise at each pixel of the perturbed example is imperceptible up to a predefined ε-tolerant threshold and a non-negative regularization parameter places emphasis on a distortion between the adversarial examples and a legitimate image. 2. The method of claim 1 , wherein the different access settings comprise: a soft-label setting; and a hard-label setting. 3. The method of claim 1 , further comprising: for a soft-label setting as one of the different access settings, using the accelerator and a gradient descent technique to find the adversarial examples and summarize a robustness statistic; and for a hard-label setting as one of the different access settings, using a smoothing function to summarize a robustness statistic. 4. The method of claim 1 , further comprising, given a legitimate input of a plurality of legitimate inputs having a correct class label, determining an optimal adversarial perturbation using the accelerator such that the perturbed example is misclassified to a target class including an incorrect class label by a deep neural network (DNN) model trained on the legitimate inputs. 5. The method of claim 1 , wherein the accelerator comprises a function including an efficient gradient estimation via a random directional estimate and averaging. 6. The method of claim 1 , wherein the accelerator comprises a function including a dimension reduction of an input. 7. The method of claim 1 , wherein the accelerator comprises a function including a problem splitting between a black-box loss function and a white-box adversarial distortion function. 8. The method of claim 1 , embodied in a cloud-computing environment. 9. A computer program product for adversarial robustness testing for checking a learning performance of a black-box system, the computer program product comprising a computer-readable storage medium having program instructions embodied therewith, the program instructions executable by a computer to cause the computer to perform: testing a robustness, against an adversarial attack, of a black-box system under different access settings via an accelerator by generating adversarial inputs of a model in a limited access setting of the different access settings, wherein the accelerator includes a function to reduce an attack space in the adversarial attack for query efficiency, wherein a robustness objective for the testing of the robustness of the black-box system uses system defined threat models for adversarial examples, and wherein a perturbed noise at each pixel of the perturbed example is imperceptible up to a predefined ε-tolerant threshold and a non-negative regularization parameter places emphasis on a distortion between the adversarial examples and the legitimate image. 10. The computer program product of claim 9 , wherein the different access settings comprise: a soft-label setting; and a hard-label setting. 11. The computer program product of claim 9 , further comprising: for a soft-label setting as one of the different access settings, using the accelerator and a gradient descent technique to find adversarial example and summarize a robustness statistic; and for a hard-label setting as one of the different access settings, using a smoothing function to summarize a robustness statistic. 12. The computer program product of claim 9 , further comprising, given a legitimate input of a plurality of legitimate inputs having a correct class label, determining an optimal adversarial perturbation using the accelerator such that the perturbed example is misclassified to a target class including an incorrect class label by a deep neural network (DNN) model trained on the legitimate inputs. 13. The computer program product of claim 9 , wherein the accelerator comprises a function including an efficient gradient estimation via a random directional estimate and averaging. 14. The computer program product of claim 9 , wherein the accelerator comprises a function including a dimension reduction of an input. 15. The computer program product of claim 9 , wherein the accelerator comprises a function including a problem splitting between a black-box loss function and a white-box adversarial distortion function. 16. An adversarial robustness testing system for checking a learning performance of a black-box system, the system comprising: a processor; and a memory, the memory storing instructions to cause the processor to perform: testing a robustness, against an adversarial attack, of a black-box system under different access settings via an accelerator by generating adversarial inputs of a model in a limited access setting of the different access settings, wherein the accelerator includes a function to reduce an attack space in the adversarial attack for query efficiency, wherein a robustness objective for the testing of the robustness of the black-box system uses system defined threat models for adversarial examples, and wherein a perturbed noise at each pixel of the perturbed example is imperceptible up to a predefined ε-tolerant threshold and a non-negative regularization parameter places emphasis on a distortion between the adversarial examples and the legitimate image. 17. The system of claim 16 , further comprising: for a soft-label setting as one of the different access settings, using the accelerator and a gradient descent technique to find the adversarial examples and summarize a robustness statistic; and for a hard-label setting as one of the different access settings, using a smoothing function to summarize a robustness statistic. 18. A computer-implemented adversarial robustness testing method for checking a learning performance of a black-box system, the method comprising: testing a robustness, against an adversarial attack, of the black-box system under a limited access setting to the black-box system: receiving a first classification of an input as an output from the black-box system; and determining a minimal change to the input such that a second classification is received as the output from the black-box system, wherein the testing includes a function to reduce an attack space in the adversarial attack for query efficiency, wherein a robustness objective for the testing of the robustness of the black-box system uses system defined threat models for adversarial examples, and wherein a perturbed noise at each pixel of the perturbed example is imperceptible up to a predefined ε-tolerant threshold and a non-negative regularization parameter places emphasis on a distortion between the adversarial examples and the legitimate image. 19. A computer-implemented adversarial robustness testing method for checking a learning performance of a black-box system, the method comprising: testing a robustness, against an adversarial attack, of the black-box system under a limited access setting to the black-box system: f
Auto-encoder networks; Encoder-decoder networks · CPC title
Convolutional networks [CNN, ConvNet] · CPC title
Supervised learning · CPC title
Assessing vulnerabilities and evaluating computer system security · CPC title
Architecture, e.g. interconnection topology · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.