Method and apparatus for machine reading comprehension, and non-transitory computer-readable recording medium

US12175206B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12175206-B2
Application numberUS-202217821227-A
CountryUS
Kind codeB2
Filing dateAug 22, 2022
Priority dateSep 7, 2021
Publication dateDec 24, 2024
Grant dateDec 24, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method and an apparatus for machine reading comprehension, and a non-transitory computer-readable recording medium are provided. In the method, a paragraph-question pair is obtained, and subword vectors corresponding to subwords in the paragraph-question pair are generated. Then, for each subword, relative positions of the subword with respect to the other subwords are determined based on distances, and self-attention information of the subword in a first part and mutual attention information of the subword in a second part are calculated by using the relative positions and the subword vector. Then, a fusion vector of the subword is generated based on the self-attention information and the mutual attention information. Then, the fusion vectors of the subwords are input to a decoder of a machine reading comprehension model so as to obtain an answer predicted by the decoder.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for machine reading comprehension, the method comprising: obtaining a paragraph-question pair including a paragraph and a question, and generating subword vectors corresponding to subwords in the paragraph-question pair; for each subword in the paragraph-question pair, determining, based on distances between the subword and other subwords in the paragraph-question pair, relative positions of the subword with respect to the other subwords in the paragraph-question pair, and calculating self-attention information of the subword in a first part and mutual attention information of the subword in a second part by using the relative positions and the subword vector of the subword, the first part being the paragraph or the question where the subword is located, in a case where the first part is the paragraph, the second part being the question, and in a case where the first part is the question, the second part being the paragraph; for each subword in the paragraph-question pair, generating, based on the self-attention information and the mutual attention information of the subword, a fusion vector of the subword; and inputting the fusion vectors of the subwords in the paragraph-question pair to a decoder of a machine reading comprehension model so as to obtain an answer predicted by the decoder. 2. The method for machine reading comprehension as claimed in claim 1 , wherein calculating the self-attention information of the subword in a first part includes for each subword in the paragraph-question pair, calculating self-attention weights of subwords in the first part by using the relative positions of the subword with respect to the subwords in the first part; and performing weighted summation on the subword vectors corresponding to the subwords in the first part by using the self-attention weights so as to obtain the self-attention information of the subword in the first part. 3. The method for machine reading comprehension as claimed in claim 1 , wherein calculating the mutual attention information of the subword in a second part includes for each subword in the paragraph-question pair, calculating mutual attention weights of subwords in the second part by using the relative positions of the subword with respect to the subwords in the second part; and performing weighted summation on the subword vectors corresponding to the subwords in the second part by using the mutual attention weights so as to obtain the mutual attention information of the subword in the second part. 4. The method for machine reading comprehension as claimed in claim 1 , wherein generating the fusion vector of the subword includes for each subword in the paragraph-question pair, fusing the self-attention information and the mutual attention information of the subword so as to obtain the fusion vector of the subword. 5. The method for machine reading comprehension as claimed in claim 1 , wherein generating the fusion vector of the subword includes for each subword in the paragraph-question pair, fusing the self-attention information and the mutual attention information of the subword so as to obtain attention information of the subword; and fusing the attention information of the subword and word boundary information of the subword in a word to which the subword belongs so as to obtain the fusion vector of the subword. 6. The method for machine reading comprehension as claimed in claim 5 , wherein fusing the attention information and the word boundary information includes determining, based on whether the subword is the first subword in the word to which the subword belongs, word starting information corresponding to the subword; determining, based on whether the subword is the last subword in the word to which the subword belongs, word ending information corresponding to the subword; and performing vector fusion on the attention information, the word starting information and the word ending information of the subword so as to obtain the fusion vector of the subword. 7. The method for machine reading comprehension as claimed in claim 4 , wherein fusing the self-attention information and the mutual attention information of the subword includes performing vector addition or vector merging on the self-attention information and the mutual attention information of the subword. 8. An apparatus for machine reading comprehension, the apparatus comprising: a memory storing computer-executable instructions; and one or more processors configured to execute the computer-executable instructions such that the one or more processors are configured to obtain a paragraph-question pair including a paragraph and a question, and generate subword vectors corresponding to subwords in the paragraph-question pair; for each subword in the paragraph-question pair, determine, based on distances between the subword and other subwords in the paragraph-question pair, relative positions of the subword with respect to the other subwords in the paragraph-question pair, and calculate self-attention information of the subword in a first part and mutual attention information of the subword in a second part by using the relative positions and the subword vector of the subword, the first part being the paragraph or the question where the subword is located, in a case where the first part is the paragraph, the second part being the question, and in a case where the first part is the question, the second part being the paragraph; for each subword in the paragraph-question pair, generate, based on the self-attention information and the mutual attention information of the subword, a fusion vector of the subword; and input the fusion vectors of the subwords in the paragraph-question pair to a decoder of a machine reading comprehension model so as to obtain an answer predicted by the decoder. 9. The apparatus for machine reading comprehension as claimed in claim 8 , wherein the one or more processors are configured to for each subword in the paragraph-question pair, calculate self-attention weights of subwords in the first part by using the relative positions of the subword with respect to the subwords in the first part; and perform weighted summation on the subword vectors corresponding to the subwords in the first part by using the self-attention weights so as to obtain the self-attention information of the subword in the first part. 10. The apparatus for machine reading comprehension as claimed in claim 8 , wherein the one or more processors are configured to for each subword in the paragraph-question pair, calculate mutual attention weights of subwords in the second part by using the relative positions of the subword with respect to the subwords in the second part; and perform weighted summation on the subword vectors corresponding to the subwords in the second part by using the mutual attention weights so as to obtain the mutual attention information of the subword in the second part. 11. The apparatus for machine reading comprehension as claimed in claim 8 , wherein the one or more processors are configured to for each subword in the paragraph-question pair, fuse the self-attention information and the mutual attention information of the subword so as to obtain the fusion vector of the subword. 12. The apparatus for machine reading comprehension as claimed in claim 8 , wherein the one or more processors are configured to for each subword in the paragraph-question pair, fuse the self-attention information and the mutual attention information of the subword so as to obtain attention information of the subword; and fuse the attention information of the subword

Assignees

Inventors

Classifications

  • Auto-encoder networks; Encoder-decoder networks · CPC title

  • Selection or weighting of terms from queries, including natural language queries · CPC title

  • Natural language generation · CPC title

  • Knowledge representation; Symbolic representation · CPC title

  • Natural language query formulation · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12175206B2 cover?
A method and an apparatus for machine reading comprehension, and a non-transitory computer-readable recording medium are provided. In the method, a paragraph-question pair is obtained, and subword vectors corresponding to subwords in the paragraph-question pair are generated. Then, for each subword, relative positions of the subword with respect to the other subwords are determined based on dis…
Who is the assignee on this patent?
Xiao Tianxiong, Cheng Rui, Dong Bin, and 3 more
What technology area does this patent fall under?
Primary CPC classification G06F40/30. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Dec 24 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).