Answer information generation method

US12411876B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12411876-B2
Application numberUS-202418747415-A
CountryUS
Kind codeB2
Filing dateJun 18, 2024
Priority dateNov 27, 2023
Publication dateSep 9, 2025
Grant dateSep 9, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method is provided. The method includes: obtaining, in response to receiving a question text from a user, a semantic vector of the question text and event information related to a specific field; obtaining a plurality of candidate documents from a document library of the specific field based on at least two of the semantic vector of the question text, the at least one piece of argument information and the event category; determining quality evaluation information for each candidate document in the plurality of candidate documents based on the event category; and determining at least one target document from the plurality of candidate documents based on the quality evaluation information of each candidate document and a correlation between each candidate document and the question text, to obtain, based on the at least one target document, answer information used to answer the question text.

First claim

Opening claim text (preview).

What is claimed is: 1. An answer information generation method based on a large language model, comprising: obtaining, in response to receiving a question text from a user, a semantic vector of the question text and event information related to a specific field, wherein the event information includes an event category concerning the question text and at least one piece of argument information in the question text; obtaining a plurality of candidate documents from a document library of the specific field based on at least two of the semantic vector of the question text, the at least one piece of argument information and the event category; determining, based on the event category, at least one document evaluation dimension corresponding to the event category, wherein: the at least one document evaluation dimension corresponds to at least one aspect of document quality, each document evaluation dimension of the at least one document evaluation dimension corresponds to a plurality of document categories, and the plurality of document categories are determined based on an aspect of document quality that a corresponding document evaluation dimension focuses on; determining, based on the event category, a quality score corresponding to each document category of the plurality of document categories of each document evaluation dimension; determining quality evaluation information for each candidate document in the plurality of candidate documents based on a document category corresponding to each of the at least one document evaluation dimension of the candidate document and a quality score corresponding to the event category for the document category; and determining at least one target document from the plurality of candidate documents based on the quality evaluation information of each candidate document and a correlation between each candidate document and the question text, to obtain, based on the at least one target document, answer information used to answer the question text. 2. The method according to claim 1 , wherein the determining the at least one target document from the plurality of candidate documents based on the quality evaluation information of each candidate document and a correlation between each candidate document and the question text comprises: determining a comprehensive score for each candidate document in the plurality of candidate documents based on the quality evaluation information of each candidate document and the correlation between each candidate document and the question text; and determining, as the at least one target document, at least one candidate document whose comprehensive score meets a preset condition in the plurality of candidate documents. 3. The method according to claim 2 , wherein the quality evaluation information includes a quality score corresponding to each of at least one document evaluation dimension of a corresponding candidate document, and wherein the determining the comprehensive score for each candidate document in the plurality of candidate documents based on the quality evaluation information of each candidate document and the correlation between each candidate document and the question text comprises: performing, for each candidate document in the plurality of candidate documents, following operations: inputting at least the question text, the candidate document, and the quality evaluation information for the candidate document into a trained ranking model; determining the correlation between the candidate document and the question text by using the ranking model based on at least the question text and the candidate document; and determining the comprehensive score for the candidate document by using the ranking model based on the quality evaluation information for the candidate document and the correlation between the candidate document and the question text. 4. The method according to claim 3 , wherein the document library includes a plurality of preset documents, wherein each preset document of the plurality of preset documents includes a corresponding document semantic vector, at least one document event category and at least one piece of document argument information, and wherein the obtaining the plurality of candidate documents from the document library of the specific field based on at least two of the semantic vector of the question text, the at least one piece of argument information, and the event category comprises: retrieving, based on the semantic vector of the question text and the corresponding document semantic vector of each preset document in the document library a plurality of first candidate documents with highest semantic similarities from the document library; obtaining a plurality of second candidate documents from the document library, wherein each second candidate document of the plurality of second candidate documents meets at least one of following conditions: a document event category of the second candidate document matches the event category; and the second candidate document includes at least one piece of document argument information that matches one or more pieces of argument information in the at least one piece of argument information; and obtaining the plurality of candidate documents based on the plurality of first candidate documents and the plurality of second candidate documents. 5. The method according to claim 4 , wherein obtaining the document semantic vector of each preset document in the document library comprises: performing, for each preset document in the document library, following operations: paragraphing the preset document to obtain at least one document paragraph; obtaining at least one paragraph semantic vector corresponding to the at least one document paragraph; and obtaining the document semantic vector of the preset document based on the at least one paragraph semantic vector. 6. The method according to claim 5 , wherein obtaining, based on the at least one target document, the answer information used to answer the question text comprises: organizing the question text and the at least one target document into an instruction text based on a preset instruction template; and inputting the instruction text into an answer information generation model to obtain the answer information by the answer information generation model. 7. An electronic device, comprising: one or more processors; and a memory storing one or more programs configured to be executed by the one or more processors, the one or more programs comprising instructions for: obtaining, in response to receiving a question text from a user, a semantic vector of the question text and event information related to a specific field, wherein the event information includes an event category concerning the question text and at least one piece of argument information in the question text; obtaining a plurality of candidate documents from a document library of the specific field based on at least two of the semantic vector of the question text, the at least one piece of argument information and the event category; determining, based on the event category, at least one document evaluation dimension corresponding to the event category, wherein: the at least one document evaluation dimension corresponds to at least one aspect of document quality, each document evaluation dimension of the at least one document evaluation dimension corresponds to a plurality of document categories, and the plurality of document categories are determined based on an aspect of document quality that a corresponding document evaluation dimension focuses on; determining, based on the event category, a quality score corresponding to each document category of the plurality of document categories of each doc

Assignees

Inventors

Classifications

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12411876B2 cover?
A method is provided. The method includes: obtaining, in response to receiving a question text from a user, a semantic vector of the question text and event information related to a specific field; obtaining a plurality of candidate documents from a document library of the specific field based on at least two of the semantic vector of the question text, the at least one piece of argument inform…
Who is the assignee on this patent?
Beijing Baidu Netcom Sci & Tech Co Ltd
What technology area does this patent fall under?
Primary CPC classification G06F40/186. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Sep 09 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).