User interfaces for navigation of knowledge graph source data
US-2024378461-A1 · Nov 14, 2024 · US
US2024256906A1 · US · A1
| Field | Value |
|---|---|
| Publication number | US-2024256906-A1 |
| Application number | US-202318401074-A |
| Country | US |
| Kind code | A1 |
| Filing date | Dec 29, 2023 |
| Priority date | Jan 27, 2023 |
| Publication date | Aug 1, 2024 |
| Grant date | — |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A method includes predicting, using the at least one processing device, a question type for each section of a document using a trained question type prediction model, each section including a different portion of the document. The method also includes generating, using the at least one processing device, multiple question-answer pairs using a trained question-answer generation model that receives the predicted question types and the document as input. Each question-answer pair includes (i) a question having a type corresponding to one of the predicted question types and being associated with content in the section corresponding to the type and (ii) an answer to the question. The method further includes outputting, using the at least one processing device, the question-answer pairs for use in training a question answering model.
Opening claim text (preview).
What is claimed is: 1 . A method comprising: predicting, using at least one processing device of an electronic device, a question type for each section of a document using a trained question type prediction model, each section including a different portion of the document; generating, using the at least one processing device, multiple question-answer pairs using a trained question-answer generation model that receives the predicted question types and the document as input, each question-answer pair comprising (i) a question having a type corresponding to one of the predicted question types and being associated with content in the section corresponding to the type and (ii) an answer to the question; and outputting, using the at least one processing device, the question-answer pairs for use in training a question answering model. 2 . The method of claim 1 , wherein the type of the question in each question-answer pair indicates that the associated question starts with one of: what, when, where, who, whom, which, whose, why, or how. 3 . The method of claim 1 , wherein the question type prediction model is trained to predict one or more question types for the document based on one or more entities described in the document. 4 . The method of claim 3 , further comprising: predicting a question type for each of the one or more entities using the question type prediction model; inputting the document and the question type for each of the one or more entities to the question-answer generation model; and generating multiple additional question-answer pairs using the question-answer generation model. 5 . The method of claim 1 , wherein: the question type prediction model is trained to output a list of possible question types for possible combinations of section and entity for the document; and generating the multiple question-answer pairs using the trained question-answer generation model comprises generating at least one question-answer pair for each of the possible combinations. 6 . The method of claim 5 , wherein: each question-answer pair is associated with a quality score; and a particular question-answer pair is not output if the quality score of the particular question-answer pair is less than a specified threshold. 7 . The method of claim 1 , further comprising: comparing the question-answer pairs to each other to determine any duplicate question-answer pairs; and removing one or more duplicate question-answer pairs before outputting the question-answer pairs. 8 . The method of claim 1 , wherein the question type prediction model and the question-answer generation model comprise large language models. 9 . An electronic device comprising: at least one processing device configured to: predict a question type for each section of a document using a trained question type prediction model, each section including a different portion of the document; generate multiple question-answer pairs using a trained question-answer generation model that receives the predicted question types and the document as input, each question-answer pair comprising (i) a question having a type corresponding to one of the predicted question types and being associated with content in the section corresponding to the type and (ii) an answer to the question; and output the question-answer pairs for use in training a question answering model. 10 . The electronic device of claim 9 , wherein the type of the question in each question-answer pair indicates that the associated question starts with one of: what, when, where, who, whom, which, whose, why, or how. 11 . The electronic device of claim 9 , wherein the question type prediction model is trained to predict one or more question types for the document based on one or more entities described in the document. 12 . The electronic device of claim 11 , wherein the at least one processing device is further configured to: predict a question type for each of the one or more entities using the question type prediction model; input the document and the question type for each of the one or more entities to the question-answer generation model; and generate multiple additional question-answer pairs using the question-answer generation model. 13 . The electronic device of claim 9 , wherein: the question type prediction model is trained to output a list of possible question types for possible combinations of section and entity for the document; and to generate the multiple question-answer pairs using the trained question-answer generation model, the at least one processing device is configured to generate at least one question-answer pair for each of the possible combinations. 14 . The electronic device of claim 13 , wherein: each question-answer pair is associated with a quality score; and the at least one processing device is configured to not output a particular question-answer pair if the quality score of the particular question-answer pair is less than a specified threshold. 15 . The electronic device of claim 9 , wherein the at least one processing device is further configured to: compare the question-answer pairs to each other to determine any duplicate question-answer pairs; and remove one or more duplicate question-answer pairs before outputting the question-answer pairs. 16 . The electronic device of claim 9 , wherein the question type prediction model and the question-answer generation model comprise large language models. 17 . A non-transitory machine-readable medium containing instructions that when executed cause at least one processor of an electronic device to: predict a question type for each section of a document using a trained question type prediction model, each section including a different portion of the document; generate multiple question-answer pairs using a trained question-answer generation model that receives the predicted question types and the document as input, each question-answer pair comprising (i) a question having a type corresponding to one of the predicted question types and being associated with content in the section corresponding to the type and (ii) an answer to the question; and output the question-answer pairs for use in training a question answering model. 18 . The non-transitory machine-readable medium of claim 17 , wherein: the question type prediction model is trained to predict one or more question types for the document based on one or more entities described in the document; and the non-transitory machine-readable medium further contains instructions that when executed cause the at least one processor to: predict a question type for each of the one or more entities using the question type prediction model; input the document and the question type for each of the one or more entities to the question-answer generation model; and generate multiple additional question-answer pairs using the question-answer generation model. 19 . The non-transitory machine-readable medium of claim 17 , wherein: the question type prediction model is trained to output a list of possible question types for possible combinations of section and entity for the document; and the instructions that when executed cause the at least one processor to generate the multiple question-answer pairs using the trained question-answer generation model comprise: instructions that when executed cause the at least one processor to generate at least one question-answer pair for each of the possible combinations. 20
using statistical methods · CPC title
Discourse or dialogue representation · CPC title
Semantic analysis · CPC title
Statistical methods, e.g. probability models · CPC title
Natural language generation · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.