Interpretation method for neural network model, electronic device and storage medium

US2023093528A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2023093528-A1
Application numberUS-202218070775-A
CountryUS
Kind codeA1
Filing dateNov 29, 2022
Priority dateJan 19, 2022
Publication dateMar 23, 2023
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

An interpretation method for a neural network model is provided. Input data and output data corresponding to the input data of a neural network model are acquired, in which the neural network model includes layers of networks connected sequentially, and each layer of network corresponds to a plurality of candidate concepts. A key inference path through which the output data is obtained by the neural network model based on the input data are acquired, in which the key inference path includes target concepts respectively used by the layers of networks when the input data is processed in the neural network model, in which the target concepts are selected from the plurality of candidate concepts. Interpretation information corresponding to the layers of networks is determined according to the target concepts corresponding to the layers of networks, respectively. The key inference path and the interpretation information are output.

First claim

Opening claim text (preview).

What is claimed is: 1 . An interpretation method for a neural network model, comprising: acquiring input data and output data corresponding to the input data of a neural network model, wherein the neural network model comprises layers of networks connected sequentially, and each layer of network corresponds to a plurality of candidate concepts; acquiring a key inference path through which the output data is obtained by the neural network model based on the input data, wherein the key inference path comprises target concepts respectively used by the layers of networks when the input data is processed in the neural network model, wherein the target concepts are selected from the plurality of candidate concepts; determining interpretation information corresponding to the layers of networks according to the target concepts corresponding to the layers of networks, respectively; and outputting the key inference path and the interpretation information. 2 . The method of claim 1 , wherein acquiring the key inference path through which the output data is obtained by the neural network model based on the input data comprises: acquiring a j th layer of network corresponding to the output data, wherein j is equal to N, and N is a total number of layers of networks in the neural network model; acquiring a target concept in the i th layer of network; acquiring quantitative relationships between candidate concepts in an i th layer of network and the target concept, respectively, wherein i is equal to j minus 1; determining a target concept in the layer of network according to the candidate concepts in the i th layer of network and the quantitative relationships; subtracting 1 from j, and executing acquiring the target concept in the j th layer of network when j is greater than 2; and generating the key inference path according to the target concepts in the layers of networks when j is equal to 2. 3 . The method of claim 2 , wherein determining the target concepts of the i th layer of network according to the candidate concepts in the i th layer of network and the quantitative relationships comprises: acquiring importance values of the quantitative relationships; ranking the quantitative relationships in a descending order of the importance values of the quantitative relationships to obtain a ranking result; taking out the quantitative relationships sequentially according to the ranking result, and acquiring candidate concepts corresponding to the quantitative relationships from the candidate concepts in the i th layer of network; accumulating estimated values of the candidate concepts corresponding to the quantitative relationships taken out until an accumulated value is greater than a preset threshold; and determining the target concept in the i th layer of network from the candidate concepts corresponding to the quantitative relationships taken out from the ranking result. 4 . The method of claim 1 , wherein the interpretation information comprises semantic information of the target concept. 5 . The method of claim 4 , wherein the interpretation information further comprises sample characteristics of a target sample corresponding to the target concept. 6 . The method of claim 1 , further comprising: acquiring a quantitative relationship between target concepts in two adjacent layers of networks according to the target concepts in the two adjacent layers of networks for any two adjacent layers of networks in the key inference path; and marking the quantitative relationship between the target concepts in the two adjacent layers of networks in the key inference path. 7 . An electronic device, comprising: at least one processor; and a memory communicatively connected with the at least one processor for storing instructions executable by the at least one processor; wherein the at least one processor is configured to: acquire input data and output data corresponding to the input data of a neural network model, wherein the neural network model comprises layers of networks connected sequentially, and each layer of network corresponds to a plurality of candidate concepts; acquire a key inference path through which the output data is obtained by the neural network model based on the input data, wherein the key inference path comprises target concepts respectively used by the layers of networks when the input data is processed in the neural network model, wherein the target concepts are selected from the plurality of candidate concepts; determine interpretation information corresponding to the layers of networks according to the target concepts corresponding to the layers of networks, respectively; and output the key inference path and the interpretation information. 8 . The electronic device of claim 7 , wherein the at least one processor is configured to: acquire a j th layer of network corresponding to the output data, wherein j is equal to N, and N is a total number of layers of networks in the neural network model; acquire a target concept in the j th layer of network; acquire quantitative relationships between candidate concepts in an i th layer of network and the target concept, respectively, wherein i is equal to j minus 1; determine a target concept in the i th layer of network according to the candidate concepts in the layer of network and the quantitative relationships; subtract 1 from j, and execute acquiring the target concept in the j th layer of network when j is greater than 2; and generate the key inference path according to the target concepts in the layers of networks when j is equal to 2. 9 . The electronic device of claim 8 , wherein the at least one processor is configured to: acquire importance values of the quantitative relationships; rank the quantitative relationships in a descending order of the importance values of the quantitative relationships to obtain a ranking result; take out the quantitative relationships sequentially according to the ranking result, and acquiring candidate concepts corresponding to the quantitative relationships from the candidate concepts in the layer of network; accumulate estimated values of the candidate concepts corresponding to the quantitative relationships taken out until an accumulated value is greater than a preset threshold; and determine the target concept in the layer of network from the candidate concepts corresponding to the quantitative relationships taken out from the ranking result. 10 . The electronic device of claim 7 , wherein the interpretation information comprises semantic information of the target concept. 11 . The electronic device of claim 10 , wherein the interpretation information further comprises sample characteristics of a target sample corresponding to the target concept. 12 . The electronic device of claim 7 , wherein the at least one processor is configured to: acquire a quantitative relationship between target concepts in two adjacent layers of networks according to the target concepts in the two adjacent layers of networks for any two adjacent layers of networks in the key inference path; and mark the quantitative relationship between the target concepts in the two adjacent layers of networks in the key inference path. 13 . A non-transitory computer-readable storage medium having stored therein computer instructions that, when executed by a computer, cause the computer to perform: acquiring input data and output data corresponding to the input data of a neural network model, wherein the neural network model comprises layers of networks connected sequentially, and each layer of network corresponds to a pl

Assignees

Inventors

Classifications

  • G06N3/044Primary

    Recurrent networks, e.g. Hopfield networks · CPC title

  • G06N3/04Primary

    Architecture, e.g. interconnection topology · CPC title

  • Classification techniques · CPC title

  • Learning methods · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2023093528A1 cover?
An interpretation method for a neural network model is provided. Input data and output data corresponding to the input data of a neural network model are acquired, in which the neural network model includes layers of networks connected sequentially, and each layer of network corresponds to a plurality of candidate concepts. A key inference path through which the output data is obtained by the n…
Who is the assignee on this patent?
Beijing Baidu Netcom Sci & Tech Co Ltd
What technology area does this patent fall under?
Primary CPC classification G06N3/044. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Mar 23 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).