Data query method and apparatus based on large model, electronic device, and storage medium

US12505092B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12505092-B2
Application numberUS-202418974155-A
CountryUS
Kind codeB2
Filing dateDec 9, 2024
Priority dateSep 18, 2024
Publication dateDec 23, 2025
Grant dateDec 23, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Data query method and apparatus based on large model, an electronic device, and a storage medium are disclosed, which relates to the field of artificial intelligence, specifically in natural language processing, deep learning, and large model technologies, applicable to scenarios such as dialogue systems and information retrieval. The method includes: performing entity recognition on a query to obtain the target entity in the query; obtaining a first related content associated with the target entity from internal information, and performing data analysis on the first related content using a large language model (LLM) to obtain a data analysis result; obtaining a second related content associated with the target entity from external information, and performing data generation on the second related content using the LLM to obtain a data generation result; obtaining a query result corresponding to the query based on the data analysis result and the data generation result.

First claim

Opening claim text (preview).

What is claimed is: 1 . A computer-implemented method for data query based on large model, comprising: performing entity recognition on a query input by a user in natural language to obtain a target entity in the query, where the target entity comprises one or more of geographic information, time information, and domain information; converting the target entity into an entity vector; obtaining a target content that matches the entity vector in a database from internal information so that the target entity in the query is correctly mapped to the corresponding field in the database; obtaining impact factors related to the domain information of the target entity based on a knowledge graph and analyzing the trends of the impact factors related to the geographic information and the time information of the target entity to obtain a relevant content of the target content; using the target content and the relevant content as first related content, and performing data analysis on the first related content using a large language model (LLM) to obtain a data analysis result; obtaining a second related content associated with the target entity from external information, and performing data generation on the second related content using the LLM to obtain a data generation result; and obtaining a query result corresponding to the query based on the data analysis result and the data generation result. 2 . The method of claim 1 , wherein performing data analysis on the first related content using the LLM to obtain the data analysis result comprises: analyzing the first related content using the LLM to obtain an analysis result; calibrating the query using the LLM based on the analysis result to obtain a calibration result; and summarizing the analysis result and the calibration result using the LLM to obtain the data analysis result. 3 . The method of claim 1 , wherein obtaining the second related content associated with the target entity from the external information comprises: obtaining external information that matches with the time of the query based on a publication time of the external information; and obtaining the second related content associated with the target entity from the external information that matches with the time of the query. 4 . The method of claim 1 , wherein performing data generation on the second related content using the LLM to obtain the data generation result comprises: filtering the second related content using the LLM to obtain a filtering result; and performing generation on the data filtering result using the LLM based on an information source corresponding to the filtering result to obtain the data generation result. 5 . The method of claim 1 , wherein performing entity recognition on the query to obtain the target entity in the query comprises: performing entity recognition on the query based on the LLM to obtain the target entity. 6 . An electronic device, comprising: at least one processor; and a memory communicatively connected with the at least one processor; wherein the memory stores instructions executable by the at least one processor, and the instructions are executed by the at least one processor to enable the at least one processor to perform a method for data query based on large model, wherein the method for data query based on large model comprises: performing entity recognition on a query input by a user in natural language to obtain a target entity in the query, where the target entity comprises one or more of geographic information, time information, and domain information; converting the target entity into an entity vector; obtaining a target content that matches the entity vector in a database from internal information so that the target entity in the query is correctly mapped to the corresponding field in the database; obtaining impact factors related to the domain information of the target entity based on a knowledge graph and analyzing the trends of the impact factors related to the geographic information and the time information of the target entity to obtain a relevant content of the target content; using the target content and the relevant content as first related content, and performing data analysis on the first related content using a large language model (LLM) to obtain a data analysis result; obtaining a second related content associated with the target entity from external information, and performing data generation on the second related content using the LLM to obtain a data generation result; and obtaining a query result corresponding to the query based on the data analysis result and the data generation result. 7 . The electronic device of claim 6 , wherein performing data analysis on the first related content using the LLM to obtain the data analysis result comprises: analyzing the first related content using the LLM to obtain an analysis result; calibrating the query using the LLM based on the analysis result to obtain a calibration result; and summarizing the analysis result and the calibration result using the LLM to obtain the data analysis result. 8 . The electronic device of claim 6 , wherein obtaining the second related content associated with the target entity from the external information comprises: obtaining external information that matches with the time of the query based on a publication time of the external information; and obtaining the second related content associated with the target entity from the external information that matches with the time of the query. 9 . The electronic device of claim 6 , wherein performing data generation on the second related content using the LLM to obtain the data generation result comprises: filtering the second related content using the LLM to obtain a filtering result; and performing generation on the data filtering result using the LLM based on an information source corresponding to the filtering result to obtain the data generation result. 10 . The electronic device of claim 6 , wherein performing entity recognition on the query to obtain the target entity in the query comprises: performing entity recognition on the query based on the LLM to obtain the target entity. 11 . A non-transitory computer readable storage medium with computer instructions stored thereon, wherein the computer instructions are used for causing a method for data query based on large model, wherein the method for data query based on large model comprises: performing entity recognition on a query input by a user in natural language to obtain a target entity in the query, where the target entity comprises one or more of geographic information, time information, and domain information; converting the target entity into an entity vector; obtaining a target content that matches the entity vector in a database from internal information so that the target entity in the query is correctly mapped to the corresponding field in the database; obtaining impact factors related to the domain information of the target entity based on a knowledge graph and analyzing the trends of the impact factors related to the geographic information and the time information of the target entity to obtain a relevant content of the target content; using the target content and the relevant content as first related content, and performing data analysis on the first related content using a large language model (LLM) to obtain a data analysis result; obtaining a second related content associated with the target entity from external information, and performing data generation on the second related content using the LLM to obtain a data generation result; and obtaining a query result corresponding to

Assignees

Inventors

Classifications

  • Named entity recognition · CPC title

  • Translation of natural language queries to structured queries · CPC title

  • Tablespace storage structures; Management thereof · CPC title

  • Natural language query formulation · CPC title

  • Ontology · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12505092B2 cover?
Data query method and apparatus based on large model, an electronic device, and a storage medium are disclosed, which relates to the field of artificial intelligence, specifically in natural language processing, deep learning, and large model technologies, applicable to scenarios such as dialogue systems and information retrieval. The method includes: performing entity recognition on a query to…
Who is the assignee on this patent?
Beijing Baidu Netcom Sci & Tech Co Ltd
What technology area does this patent fall under?
Primary CPC classification G06F16/2425. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Dec 23 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).