Method and system for providing explanation of prediction generated by an artificial neural network model

US11315008B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11315008-B2
Application numberUS-201916281904-A
CountryUS
Kind codeB2
Filing dateFeb 21, 2019
Priority dateDec 31, 2018
Publication dateApr 26, 2022
Grant dateApr 26, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

This disclosure relates to method and system for providing an explanation for a prediction generated by an artificial neural network (ANN) model for a given input data. The method may include receiving the given input data and the prediction generated by the ANN model. The ANN model may be built and trained for a target application. The method may further include determining a plurality of relevant portions of the given input data. For each of the plurality of relevant portions, the method may further include fetching a portional prediction and a portional prediction score generated by the ANN model, and determining a degree of influence score based on the portional prediction score and a comparison between the portional prediction and the prediction. The method may further include providing the explanation for the prediction based on the degree of influence score of each of the plurality of relevant portions.

First claim

Opening claim text (preview).

What is claimed is: 1. A method of providing an explanation for a prediction generated by an artificial neural network (ANN) model for a given input data, the method comprising: receiving, by a prediction explanation device, the given input data and the prediction generated by the ANN model, wherein the ANN model is built and trained for a target application; determining, by the prediction explanation device, a plurality of relevant portions of the given input data; for each of the plurality of relevant portions, fetching, by the prediction explanation device, a portional prediction and a portional prediction score generated by the ANN model; and determining, by the prediction explanation device, a degree of influence score based on the portional prediction score and a comparison between the portional prediction and the prediction; and providing, by the prediction explanation device, the explanation for the prediction based on the degree of influence score of each of the plurality of relevant portions. 2. The method of claim 1 , wherein the given input data comprises at least one of text data, audio data, video data, and image data. 3. The method of claim 1 , wherein determining the plurality of relevant portions comprises: segmenting the given input data into a plurality of portions; and processing each of the plurality of portions to filter the plurality of relevant portions. 4. The method of claim 1 , wherein the target application comprises a text based application, wherein the ANN model comprises a recurrent neural network (RNN) model, and wherein each of the plurality of relevant portions comprises a relevant token from a tokenized text. 5. The method of claim 1 , wherein the providing the explanations further comprises determining a set of influential portions from among the plurality of relevant portions based on the degree of influence score of each of the plurality of relevant portions. 6. The method of claim 5 , further comprising retuning the ANN model based on the set of influential portions. 7. The method of claim 1 , wherein the providing the explanations comprises rendering each of the plurality of relevant portions along with the corresponding degree of influence score. 8. A system for providing an explanation for a prediction generated by an artificial neural network (ANN) model for a given input data, the system comprising: a prediction explanation device comprising at least one processor and a computer-readable medium storing instructions that, when executed by the at least one processor, cause the at least one processor to perform operations comprising: receiving the given input data and the prediction generated by the ANN model, wherein the ANN model is built and trained for a target application; determining a plurality of relevant portions of the given input data; for each of the plurality of relevant portions, fetching a portional prediction and a portional prediction score generated by the ANN model; and determining a degree of influence score based on the portional prediction score and a comparison between the portional prediction and the prediction; and providing the explanation for the prediction based on the degree of influence score of each of the plurality of relevant portions. 9. The system of claim 8 , wherein the given input data comprises at least one of text data, audio data, video data, and image data. 10. The system of claim 8 , wherein determining the plurality of relevant portions comprises: segmenting the given input data into a plurality of portions; and processing each of the plurality of portions to filter the plurality of relevant portions. 11. The system of claim 8 , wherein the target application comprises a text based application, wherein the ANN model comprises a recurrent neural network (RNN) model, and wherein each of the plurality of relevant portions comprises a relevant token from a tokenized text. 12. The system of claim 8 , wherein the providing the explanations further comprises determining a set of influential portions from among the plurality of relevant portions based on the degree of influence score of each of the plurality of relevant portions. 13. The system of claim 12 , wherein the operations further comprise retuning the ANN model based on the set of influential portions. 14. The system of claim 8 , wherein the providing the explanations comprises rendering each of the plurality of relevant portions along with the corresponding degree of influence score. 15. A non-transitory computer-readable medium storing computer-executable instructions that, when executed by at least one processor, cause the at least one processor to perform operations comprising: receiving a given input data and a prediction generated by an artificial neural network (ANN) model for the given input data, wherein the ANN model is built and trained for a target application; determining a plurality of relevant portions of the given input data; for each of the plurality of relevant portions, fetching a portional prediction and a portional prediction score generated by the ANN model; and determining a degree of influence score based on the portional prediction score and a comparison between the portional prediction and the prediction; and providing an explanation for the prediction based on the degree of influence score of each of the plurality of relevant portions. 16. The non-transitory computer-readable medium of the claim 15 , wherein the target application comprises a text based application, wherein the ANN model comprises a recurrent neural network (RNN) model, and wherein each of the plurality of relevant portions comprises a relevant token from a tokenized text. 17. The non-transitory computer-readable medium of the claim 15 , wherein determining the plurality of relevant portions comprises: segmenting the given input data into a plurality of portions; and processing each of the plurality of portions to filter the plurality of relevant portions. 18. The non-transitory computer-readable medium of the claim 15 , wherein the providing the explanations further comprises determining a set of influential portions from among the plurality of relevant portions based on the degree of influence score of each of the plurality of relevant portions. 19. The non-transitory computer-readable medium of the claim 18 , wherein the operations further comprise retuning the ANN model based on the set of influential portions. 20. The non-transitory computer-readable medium of the claim 15 , wherein the providing the explanations comprises rendering each of the plurality of relevant portions along with the corresponding degree of influence score.

Assignees

Inventors

Classifications

  • G06N3/048Primary

    Activation functions · CPC title

  • Recurrent networks, e.g. Hopfield networks · CPC title

  • characterised by memory or gating, e.g. long short-term memory [LSTM] or gated recurrent units [GRU] · CPC title

  • Supervised learning · CPC title

  • Explanation of inference; Explainable artificial intelligence [XAI]; Interpretable artificial intelligence · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11315008B2 cover?
This disclosure relates to method and system for providing an explanation for a prediction generated by an artificial neural network (ANN) model for a given input data. The method may include receiving the given input data and the prediction generated by the ANN model. The ANN model may be built and trained for a target application. The method may further include determining a plurality of rele…
Who is the assignee on this patent?
Wipro Ltd
What technology area does this patent fall under?
Primary CPC classification G06N3/048. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Apr 26 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).