Method of retrieving data, method of training deep learning model, electronic device and storage medium

US2026093779A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2026093779-A1
Application numberUS-202418889695-A
CountryUS
Kind codeA1
Filing dateSep 19, 2024
Priority dateApr 8, 2024
Publication dateApr 2, 2026
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method of retrieving data, a method of training a deep learning model, an electronic device, and a storage medium are provided, which relate to a field of artificial intelligence technology, and in particular to fields of natural language processing and deep learning technologies. The method of retrieving data includes: determining M candidate texts from a text library based on a semantic information in a query to be processed, where M is an integer greater than or equal to 1; determining N candidate texts from the text library based on a keyword information in the query to be processed, where N is an integer greater than or equal to 1; and determining at least one target text based on the M candidate texts and the N candidate texts.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method of retrieving data, comprising: determining M candidate texts from a text library based on a semantic information in a query to be processed, wherein M is an integer greater than or equal to 1; determining N candidate texts from the text library based on a keyword information in the query to be processed, wherein N is an integer greater than or equal to 1; and determining at least one target text based on the M candidate texts and the N candidate texts. 2 . The method according to claim 1 , wherein the determining M candidate texts from a text library based on a semantic information in a query to be processed comprises: determining a semantic feature of the query to be processed; matching the semantic feature with a text feature in a first text feature library to obtain M first target text features, wherein the text feature in the first text feature library is obtained by processing a text in the text library using a first deep learning model, and the first deep learning model is trained based on a similarity between a semantic feature of a sample query and a text feature of a sample text; and determining the M candidate texts from the text library based on the M first target text features. 3 . The method according to claim 2 , wherein the determining a semantic feature of the query to be processed comprises: determining a query feature of the query to be processed; performing an attention processing on the query feature to obtain an attention feature; and determining the semantic feature of the query to be processed according to the attention feature. 4 . The method according to claim 2 , further comprising: processing the text in the text library by using the first deep learning model to obtain a feature of the text in the text library; and generating the first text feature library according to the feature of the text in the text library. 5 . The method according to claim 1 , wherein the determining N candidate texts from the text library based on a keyword information in the query to be processed comprises: determining a keyword feature in the query to be processed; matching the keyword feature with a text feature in a second text feature library to obtain N second target text features, wherein the text feature in the second text feature library is obtained by processing a text in the text library using a second deep learning model, and the second deep learning model is trained based on a similarity between a keyword feature of a sample query and a text feature of a sample text; and determining the N candidate texts from the text library according to the N second target text features. 6 . The method according to claim 5 , wherein the text in the text library comprises a title and a content, and the method further comprises: processing the title and the content by using the second deep learning model to obtain a title feature and a content feature, respectively; and generating the second text feature library according to the title feature and the content feature. 7 . The method according to claim 1 , wherein the determining at least one target text based on the M candidate texts and the N candidate texts comprises: concatenating, for each candidate text among the M candidate texts and the N candidate texts, the query to be processed and the candidate text to obtain a concatenated text; determining a similarity between the query to be processed and each candidate text according to the concatenated text; and determining the at least one target text from the M candidate texts and the N candidate texts according to the similarity between the query to be processed and each candidate text. 8 . A method of training a deep learning model, wherein the deep learning model comprises a query processing sub-model and a text processing sub-model, and the method comprises: determining a positive example text of a sample query and a negative example text of the sample query; processing the sample query by using the query processing sub-model to obtain a semantic feature of the sample query; processing the positive example text and the negative example text by using the text processing sub-model to obtain a feature of the positive example text and a feature of the negative example text, respectively; determining a similarity between the sample query and the positive example text and a similarity between the sample query and the negative example text according to the semantic feature, the feature of the positive example text and the feature of the negative example text; determining a loss of the deep learning model according to the similarity between the sample query and the positive example text and the similarity between the sample query and the negative example text; and adjusting a parameter of the deep learning model according to the loss. 9 . The method according to claim 8 , wherein the query processing sub-model comprises an encoding module, an attention module, and a fully connected module; and the processing the sample query by using the query processing sub-model to obtain a semantic feature of the sample query comprises: inputting the sample query into the encoding module to obtain a query feature of the sample query; inputting the query feature into the attention module to obtain an attention feature of the sample query; and inputting the attention feature into the fully connected module to obtain the semantic feature of the sample query. 10 . The method according to claim 8 , wherein a plurality of sample queries are provided, and the method further comprises: determining, for each sample query of the plurality of sample queries, the negative example text of the sample query and positive example texts and negative example texts of other sample queries among the plurality of sample queries other than the sample query as a set of negative example texts of the sample query. 11 . The method according to claim 10 , wherein the processing the positive example text and the negative example text by using the text processing sub-model so as to obtain a feature of the positive example text and a feature of the negative example text comprises: processing the positive example text and each negative example text in the set of negative example texts by using the text processing sub-model to obtain the feature of the positive example text and a feature of each negative example text in the set of negative example texts, respectively; and wherein the determining a loss of the deep learning model according to the similarity between the sample query and the positive example text and the similarity between the sample query and the negative example text comprises: determining the loss of the deep learning model according to the similarity between the sample query and the positive example text and a similarity between the sample query and each negative example text in the set of negative example texts. 12 . An electronic device, comprising: at least one processor; and a memory communicatively connected to the at least one processor, wherein the memory stores instructions executable by the at least one processor, and the instructions, when executed by the at least one processor, are configured to cause the at least one processor to at least: determine M candidate texts from a text library based on a semantic information in a query to be processed, wherein M is an integer greater than or equal to 1; determine N candidate texts from the text library based on a keyword information in the query to be processed, wherein N is an integer greater than or equal to 1; and determine at least one target text

Assignees

Inventors

Classifications

  • Semantic analysis · CPC title

  • Selection or weighting of terms from queries, including natural language queries · CPC title

  • Learning methods · CPC title

  • G06F18/22Primary

    Matching criteria, e.g. proximity measures · CPC title

  • G06F16/335Primary

    Filtering based on additional data, e.g. user or group profiles (filtering in web context G06F16/9535, G06F16/9536) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2026093779A1 cover?
A method of retrieving data, a method of training a deep learning model, an electronic device, and a storage medium are provided, which relate to a field of artificial intelligence technology, and in particular to fields of natural language processing and deep learning technologies. The method of retrieving data includes: determining M candidate texts from a text library based on a semantic inf…
Who is the assignee on this patent?
Beijing Baidu Netcom Sci & Tech Co Ltd
What technology area does this patent fall under?
Primary CPC classification G06F18/22. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Apr 02 2026 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).