Game-theoretic frameworks for deep neural network rationalization

US11657271B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11657271-B2
Application numberUS-201916658122-A
CountryUS
Kind codeB2
Filing dateOct 20, 2019
Priority dateOct 20, 2019
Publication dateMay 23, 2023
Grant dateMay 23, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method and system of determining an output label rationale are provided. A first generator receives a first class of data and selects one or more input features from the first class of data. A first predictor receives the one or more selected input features from the first generator and predicts a first output label. A second generator receives a second class of data and selects one or more input features from the second class of data. A second predictor receives the one or more selected input features from the second generator and predicts a second output label. A discriminator receives the first and second output labels and determines whether the selected one or more input features from the first class of data or the selected features of the one or more input features from the second class of data, more accurately represents the first output label.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer implemented class-wise adversarial rationalization (CAR) system for determining an output label rationale, comprising: a first generator network operative to receive a first class of data and select one or more input features from the first class of data; a first predictor network operative to receive the one or more selected input features from the first generator network and predict a first output label based on the received one or more selected input features from the first generator network; a second generator network operative to receive a second class of data and select one or more input features from the second class of data; a second predictor network operative to receive the one or more selected input features from the second generator network and predict a second output label based on the received one or more selected input features from the second generator network; and a discriminator network operative to receive the first and second output labels and to determine whether the selected one or more input features from the first class of data or the selected features of the one or more input features from the second class of data, more accurately represents the first output label. 2. The system of claim 1 , wherein the first class of data represents a first rationale (R 1 ) consistent with the first output label. 3. The system of claim 1 , wherein the second class of data represents a second rationale (R 2 ) consistent with the second output label. 4. The system of claim 1 , wherein the second output label is a complement of the first output label. 5. The system of claim 1 , wherein the first and second class of data are each in natural language. 6. The system of claim 1 , wherein: the first generator network is trained on a subset of the first class of data; the second generator network is trained on a subset of the second class of data; and the first output label is inconsistent with the output label of the second output label. 7. The system of claim 1 , wherein the first generator network is configured to play a cooperative game with the discriminator network. 8. The system of claim 7 , wherein the second generator network is configured to play an adversarial game with the discriminator network. 9. The system of claim 8 , wherein the cooperative game between the first generator network and the discriminator network maximizes a predictive accuracy of the one or more input features selected by the first generator network to predict the first output label. 10. The system of claim 8 , wherein the adversarial game between the second generator network and the discriminator network is operative for the second generator network to attempt to convince the discriminator network that the features selected by the second generator network from the second class of data are consistent with the first output label. 11. The system of claim 1 , wherein the discriminator network is operative to provide a scaled result based on the selected one or more input features from the first class of data and the selected one or more input features from the second class of data. 12. The system of claim 1 , wherein the first generator network and the second generator network are each configured to convince the discriminator network that that they are factual generators for the first output label. 13. A computing device comprising: a processor; a storage device coupled to the processor; a program stored in the storage device, wherein an execution of the program by the processor configures the computing device to perform acts, comprising: receiving a first class of data by a first generator network; selecting one or more input features from the first class of data by the generator network; receiving, by a first predictor network, the one or more selected input features from the first generator network; predicting, by the first predictor network, a first output label based on the received one or more selected input features from the first generator network; receiving, by a second generator network, a second class of data; selecting, by the second generator network, one or more input features from the second class of data; receiving, by a second predictor network, the one or more selected input features from the second generator network; predicting, by the second predictor network, a second output label based on the received one or more selected input features from the second generator network; receiving, by a discriminator network, the first and second output labels; and determining, by the discriminator network, whether the selected one or more input features from the first class of data or the selected features of the one or more input features from the second class of data, more accurately represents the first output label. 14. The computing device of claim 13 , wherein: the first class of data represents a first rationale (R 1 ) consistent with the first output label; and the second class of data represents a second rationale (R 2 ) consistent with the second output label. 15. The computing device of claim 13 , wherein execution of the program by the processor further configures the computing device to perform acts, comprising: training the first generator network on a subset of the first class of data; and training the second generator network on a subset of the second class of data, wherein the first output label is inconsistent with the output label of the second output label. 16. The computing device of claim 13 , wherein execution of the program by the processor further configures the computing device to perform acts, comprising: playing a cooperative game between the first generator network and the discriminator network. 17. The computing device of claim 16 , wherein execution of the program by the processor further configures the computing device to perform acts, comprising: playing an adversarial game between the second generator network and the discriminator network. 18. The computing device of claim 17 , wherein the cooperative game between the first generator network and the discriminator network maximizes a predictive accuracy of the one or more input features selected by the first generator network to predict the first output label. 19. The computing device of claim 13 , wherein the first generator network and the second generator network are each configured to convince the discriminator network that that they are factual generators for the first output label. 20. A non-transitory computer readable storage medium tangibly embodying a computer readable program code having computer readable instructions that, when executed, causes a computer device to carry out a method, comprising: receiving a first class of data by a first generator network; selecting one or more input features from the first class of data by the generator network; receiving, by a first predictor network, the one or more selected input features from the first generator network; predicting, by the first predictor network, a first output label based on the received one or more selected input features from the first generator network; receiving, by a second generator network, a second class of data; selecting, by the second generator network, one or more input features from the second class of data; receiving, by a second predictor network, the one or more selected input features from the second generator network; predicting, by the second predictor network, a second output label based

Assignees

Inventors

Classifications

  • Reinforcement learning · CPC title

  • Generative networks · CPC title

  • Quantised networks; Sparse networks; Compressed networks · CPC title

  • Supervised learning · CPC title

  • Adversarial learning · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11657271B2 cover?
A method and system of determining an output label rationale are provided. A first generator receives a first class of data and selects one or more input features from the first class of data. A first predictor receives the one or more selected input features from the first generator and predicts a first output label. A second generator receives a second class of data and selects one or more in…
Who is the assignee on this patent?
IBM, Massachusetts Inst Technology
What technology area does this patent fall under?
Primary CPC classification G06N3/045. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue May 23 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).