Neural network negative rule extraction

US11907334B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11907334-B2
Application numberUS-202017115610-A
CountryUS
Kind codeB2
Filing dateDec 8, 2020
Priority dateDec 8, 2020
Publication dateFeb 20, 2024
Grant dateFeb 20, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A first classification is received from a neural network regarding a training dataset sent to the neural network. A modified training dataset with a perturbation of the training dataset is identified, where this modified training dataset causes the neural network to return a second classification. The perturbation is analyzed to identify a negative rule of the neural network.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method comprising: receiving a first classification from a neural network regarding a training dataset sent to the neural network, wherein the neural network is configured to identify positive and negative evidence; identifying a modified training dataset with a perturbation of the training dataset that causes the neural network to return a second classification that is different from the first classification; and analyzing the perturbation to identify a negative rule of the neural network, wherein the negative rule relates to an absence of an element in an input of the training dataset increasing a likelihood that the neural network returns the first classification, wherein a presence of the element increases a likelihood that the neural network returns the second classification. 2. The computer-implemented method of claim 1 , wherein analyzing the perturbation to identify the negative rule of the neural network includes identifying features that are present in the modified training dataset that are not in the training dataset. 3. The computer-implemented method of claim 2 , wherein analyzing the perturbation to identify the negative rule of the neural network includes identifying that some of the features that are not in the training dataset are more relevant to the training dataset. 4. The computer-implemented method of claim 3 , wherein identifying that some of the features that are not in the training dataset are more relevant includes using term frequency inverse document frequency. 5. The computer-implemented method of claim 1 , wherein analyzing the perturbation to identify the negative rule of the neural network includes identifying features of the training dataset that were most impacted by the perturbation. 6. The computer-implemented method of claim 1 , wherein the perturbation is determined using an adversarial attack method. 7. The computer-implemented method of claim 1 , wherein the perturbation is determined using a gradient-free optimization technique. 8. The computer-implemented method of claim 1 , further comprising interpreting output from the neural network using the negative rule. 9. The computer-implemented method of claim 1 , further comprising identifying the perturbation as a minimal perturbation to cause the neural network to return the second classification. 10. A system comprising: a processor; and a memory in communication with the processor, the memory containing instructions that, when executed by the processor, cause the processor to: receive a first classification from a neural network regarding a training dataset sent to the neural network, wherein the neural network is configured to identify positive and negative evidence; identify a modified training dataset with a perturbation of the training dataset that causes the neural network to return a second classification that is different from the first classification; and analyze the perturbation to identify a negative rule of the neural network, wherein the negative rule relates to an absence of an element in an input of the training dataset increasing a likelihood that the neural network returns the first classification, wherein a presence of the element increases a likelihood that the neural network returns the second classification. 11. The system of claim 10 , wherein analyzing the perturbation to identify the negative rule of the neural network includes identifying features that are present in the modified training dataset that are not in the training dataset. 12. The system of claim 11 , wherein analyzing the perturbation to identify the negative rule of the neural network includes identifying that some of the features that are not in the training dataset are more relevant to the training dataset using term frequency inverse document frequency. 13. The system of claim 10 , wherein analyzing the perturbation to identify the negative rule of the neural network includes identifying features of the training dataset that were most impacted by the perturbation to create the modified training dataset. 14. The system of claim 10 , wherein the perturbation is determined using an adversarial attack method. 15. The system of claim 10 , the memory containing additional instructions that, when executed by the processor, cause the processor to interpret output from the neural network using the negative rule. 16. A computer program product, the computer program product comprising a computer readable storage medium having program instructions embodied therewith, the program instructions executable by a computer to cause the computer to: receive a first classification from a neural network regarding a training dataset sent to the neural network, wherein the neural network is configured to identify positive and negative evidence; identify a modified training dataset with a perturbation of the training dataset that causes the neural network to return a second classification that is different from the first classification; and analyze the perturbation to identify a negative rule of the neural network, wherein the negative rule relates to an absence of an element in an input of the training dataset increasing a likelihood that the neural network returns the first classification, wherein a presence of the element increases a likelihood that the neural network returns the second classification. 17. The computer program product of claim 16 , wherein analyzing the perturbation to identify the negative rule of the neural network includes at least one of: identifying features that are present in the modified training dataset that are not in the training dataset; and identifying that some of the features that are not in the training dataset are more relevant to the training dataset using term frequency inverse document frequency. 18. The computer program product of claim 16 , wherein analyzing the perturbation to identify the negative rule of the neural network includes identifying features of the training dataset that were most impacted by the perturbation to create the modified training dataset. 19. The computer program product of claim 16 , wherein the perturbation is determined using an adversarial attack method. 20. The computer program product of claim 16 , the computer readable storage medium having additional program instructions embodied therewith that are executable by the computer to cause the computer to interpret output from the neural network using the negative rule.

Assignees

Inventors

Classifications

  • Supervised learning · CPC title

  • Adversarial learning · CPC title

  • characterised by the process organisation or structure, e.g. boosting cascade · CPC title

  • Learning methods · CPC title

  • G06N5/025Primary

    Extracting rules from data · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11907334B2 cover?
A first classification is received from a neural network regarding a training dataset sent to the neural network. A modified training dataset with a perturbation of the training dataset is identified, where this modified training dataset causes the neural network to return a second classification. The perturbation is analyzed to identify a negative rule of the neural network.
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06F18/2148. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Feb 20 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).