Method of training dialog generation model for recommendations, method of generating recommendations, and device

US12430516B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12430516-B2
Application numberUS-202218056137-A
CountryUS
Kind codeB2
Filing dateNov 16, 2022
Priority dateFeb 21, 2022
Publication dateSep 30, 2025
Grant dateSep 30, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The present disclosure provides a method of training an information generation model, a method of generating an information, an electronic device, and a storage medium. A specific implementation solution of the method of training the information generation model includes: splitting a description information for a target object in an information pair into at least one description word, so as to obtain a description word sequence, wherein the information pair further includes a first recommendation information; inputting the description word sequence into a dialog generation model to obtain a probability vector sequence for the target object, wherein each probability vector in the probability vector sequence includes probability values for a plurality of predetermined words; and training the dialog generation model according to the probability vector sequence and the first recommendation information, so as to obtain the information generation model.

First claim

Opening claim text (preview).

What is claimed is: 1. A method of training an information generation model that is implemented by a processor, the method comprising: splitting a description information for a target object associated with a product to be promoted in an information pair into at least one description word, so as to obtain a description word sequence, wherein the information pair further comprises a first recommendation information; inputting the description word sequence into a dialog generation model configured to generate diverse and contextually relevant recommendation information to obtain a probability vector sequence for the target object, wherein each probability vector in the probability vector sequence comprises probability values for a plurality of predetermined words; and training the dialog generation model according to the probability vector sequence and the first recommendation information, to generate recommendation information exhibiting diversity and contextual alignment with the description information, as enabled by the discrete latent variable and multi-task learning configuration of the dialog generation model, so as to obtain the information generation model, wherein the probability vector sequence indicates a second recommendation information for the target object; wherein the training the dialog generation model according to the probability vector sequence and the first recommendation information comprises: determining, according to the probability vector sequence, a prediction probability that the second recommendation information comprises a prompt information; determining a first loss value of the dialog generation model according to the prediction probability; determining a second loss value of the dialog generation model according to an association relationship between the second recommendation information and the description information; and training the dialog generation model according to the first loss value, the second loss value, and the first recommendation information, wherein the dialog generation model comprises a pre-trained dialog generation model with a discrete latent variable; wherein the inputting the description word sequence into the dialog generation model to obtain the probability vector sequence for the target object comprises: inputting a random identification information and the description word sequence into the dialog generation model to enable multi-task learning by jointly training for association prediction and text generation tasks, so as to obtain the probability vector sequence and an association prediction value corresponding to the random identification information, and wherein the association prediction value indicates the association relationship between the second recommendation information and the description information, wherein the generated recommendation information is configured for presentation to a user via a human-computer interaction interface or used for display in connection with the promoted item. 2. The method according to claim 1 , further comprising: splitting the prompt information for the target object into at least one prompt word, so as to obtain a prompt word sequence; wherein the inputting the description word sequence into a dialog generation model to obtain a probability vector sequence for the target object further comprises: inputting the description word sequence and the prompt word sequence into the dialog generation model to obtain the probability vector sequence. 3. The method according to claim 1 , wherein the determining, according to the probability vector sequence, the prediction probability that the second recommendation information comprises the prompt information comprises: determining, according to a probability value in the probability vector sequence for each prompt word in the prompt word sequence, a probability that the second recommendation information contains each prompt word; and determining, according to at least one probability that the second recommendation information contains at least one prompt word, the prediction probability that the second recommendation information comprises the prompt information. 4. The method according to claim 1 , further comprising: determining, in response to repeated words being contained in the second recommendation information, a probability vector for the repeated words in the probability vector sequence as a target probability vector according to a position information of the repeated words in the second recommendation information; and determining a third loss value of the dialog generation model according to the target probability vector and the repeated words; wherein the training the dialog generation model according to the probability vector sequence and the first recommendation information further comprises: training the dialog generation model according to the first loss value, the second loss value, and the third loss value. 5. The method according to claim 1 , further comprising: splitting the first recommendation information into at least one recommendation word, so as to obtain a recommendation word sequence; determining, in response to repeated words being contained in the recommendation word sequence, a probability vector for the repeated words in the probability vector sequence as a target probability vector according to a position information of the repeated words in the recommendation word sequence; and determining a third loss value of the dialog generation model according to the target probability vector and the repeated words; wherein the training the dialog generation model according to the probability vector sequence and the first recommendation information further comprises: training the dialog generation model according to the first loss value, the second loss value, and the third loss value. 6. A method of generating an information that is implemented by a processor, the method comprising: splitting a description information for an object to be recommended into at least one description word, so as to obtain a description word sequence; inputting the description word sequence into an information generation model to obtain a probability vector sequence for the object to be recommended, wherein each probability vector in the probability vector sequence comprises probability values for a plurality of predetermined words; and determining a recommendation information for the object to be recommended, according to the probability vector sequence, wherein the information generation model is trained using a method of training an information generation model, comprising: splitting a description information for a target object associated with a product to be promoted in an information pair into at least one description word, so as to obtain a description word sequence, wherein the information pair further comprises a first recommendation information; inputting the description word sequence into a dialog generation model configured to generate diverse and contextually relevant recommendation information to obtain a probability vector sequence for the target object, wherein each probability vector in the probability vector sequence comprises probability values for a plurality of predetermined words; and training the dialog generation model according to the probability vector sequence and the first recommendation information, to generate recommendation information exhibiting diversity and contextual alignment with the description information, as enabled by the discrete latent variable and multi-task learning configuration of the dialog generation model, so as to obtain the information generation model, wherein the probability vector sequence indicates a second recommendation information for the target object; wherein the trai

Assignees

Inventors

Classifications

  • Lexical analysis, e.g. tokenisation or collocates · CPC title

  • Natural language generation · CPC title

  • Combinations of networks · CPC title

  • Recurrent networks, e.g. Hopfield networks · CPC title

  • Backpropagation, e.g. using gradient descent · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12430516B2 cover?
The present disclosure provides a method of training an information generation model, a method of generating an information, an electronic device, and a storage medium. A specific implementation solution of the method of training the information generation model includes: splitting a description information for a target object in an information pair into at least one description word, so as to …
Who is the assignee on this patent?
Beijing Baidu Netcom Sci & Tech Co Ltd
What technology area does this patent fall under?
Primary CPC classification G06F40/40. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Sep 30 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).