Method and apparatus for generating information assessment model

US11521016B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11521016-B2
Application numberUS-201916700593-A
CountryUS
Kind codeB2
Filing dateDec 2, 2019
Priority dateJun 20, 2019
Publication dateDec 6, 2022
Grant dateDec 6, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Embodiments of the present disclosure provide a method for generating an information assessment model, a method for determining the usefulness of comment information, apparatus, electronic device, and computer-readable medium. The method may include: acquiring training samples, the training samples including first sample comment information with a usefulness label and second sample comment information without a usefulness label; acquiring a predictor model and a discriminator model respectively constructed based on a generative network and a discrimination network in a generative adversarial network, and pre-training the predictor model using the first sample comment information, the predictor model being used to predict a usefulness label of a piece of comment information, the discriminator model being used to discriminate authenticity of a usefulness label; and training the predictor model and the discriminator model by iteratively performing a plurality of times of training operations, using the trained predictor model as an information assessment model.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for generating an information assessment model, comprising: acquiring training samples, the training samples comprising first sample comment information with a usefulness label and second sample comment information without a usefulness label; acquiring a predictor model and a discriminator model respectively constructed based on a generative network and a discrimination network in a generative adversarial network, and pre-training the predictor model using the first sample comment information, the predictor model being used to predict a usefulness label of a piece of comment information, the discriminator model being used to discriminate authenticity of a usefulness label; and training the predictor model and the discriminator model by iteratively performing a plurality times of training operations, using the trained predictor model as an information assessment model; wherein a training operation comprises: predicting a usefulness label of the second sample comment information using the predictor model; using the usefulness label of the first sample comment information as a real label, using the usefulness label of the second sample comment information as a false label, discriminating authenticity of the usefulness label of the first sample comment information and authenticity of the usefulness label of the second sample comment information by using the discriminator model; iteratively updating, based on an error of the discrimination result of the discriminator model, a parameter of the discriminator model and a reward function, the reward function being associated with the parameter of the discriminator model and a parameter of the predictor model; determining a desired reward of the predictor model based on the reward function and the predictor model, and iteratively updating the parameter of the predictor model based on an error of the desired reward. 2. The method according to claim 1 , wherein the discriminating authenticity of the usefulness label of the first sample comment information and authenticity of the usefulness label of the second sample comment information by using the discriminator model comprises: selecting usefulness labels of an equal number of first sample comment information and second sample comment information, and discriminating authenticity of the selected usefulness labels by using the discriminator model. 3. The method according to claim 1 , wherein the error of the discrimination result of the discriminator model comprises a sum of an opposite number of a first cross entropy and an opposite number of a second cross entropy; the first cross entropy comprises: a cross entropy between a first probability distribution of that the usefulness label of the first sample comment information is a real label and a second probability distribution of that the discriminator model discriminates the usefulness label of the first sample comment information as a real label; the second cross entropy comprises: a cross entropy between a third probability distribution of that the usefulness label of the second sample comment information predicted by the predictor model is a real label and a fourth probability distribution of that the discriminator model discriminates the usefulness label of the second sample comment information as a real label. 4. The method according to claim 1 , wherein when an input of the reward function is the second sample comment information, an output is a result of authenticity discrimination by the discriminator model for the usefulness label of the second sample comment information; and the determining a desired reward of the predictor model based on the reward function and the predictor model, and iteratively updating the parameter of the predictor model based on an error of the desired reward comprises: multiplying the reward function by a prediction result of the predictor model to calculate the desired reward, and iteratively updating, in response to determining that the desired reward does not reach a preset reward value condition, the parameter of the predictor model. 5. The method according to claim 4 , wherein when the input of the reward function is the first sample comment information and the discriminator model discriminates the usefulness label of the first sample comment information as a real label, the value of the reward function is 1; when the input of the reward function is the first sample comment information and the discriminator model discriminates the usefulness label of the first sample comment information as a false label, the value of the reward function is 0. 6. The method according to claim 1 , wherein the first sample comment information further has a usefulness index label, the usefulness index label is used to characterize a degree of usefulness, and the predictor model is further used to predict a usefulness index label of a piece of comment information; the training operation further comprises: predicting a usefulness index label of the second sample comment information using the predictor model; and determining a prediction error of the predictor model for the usefulness index label; and the iteratively updating the parameter of the predictor model based on an error of the desired reward comprises: iteratively updating the parameter of the predictor model based on the error of the desired reward and the prediction error of the predictor model for the usefulness index label alternately. 7. The method according to claim 1 , wherein the method further comprises: acquiring comment object information corresponding to the training samples; and the predictor model comprises an information preprocessing network and a classification network; wherein the information preprocessing network is used to convert a training sample and comment object information corresponding thereto into a mathematical representation, and the classification network predicts usefulness label of the training sample based on the mathematical representation. 8. The method according to claim 7 , wherein the information preprocessing network comprises a first bi-directional long short-term memory network and a second bi-directional long short-term memory network; the first bi-directional long short-term memory network converts the training sample into a first vector representation, and the second bi-directional long short-term memory network converts the comment object information corresponding to the training sample into a second vector representation; the information preprocessing network integrates, based on an attention mechanism, the first vector representation of the training sample and the second vector representation of the comment object information corresponding thereto into the mathematical representation corresponding to the training sample. 9. A method for determining the usefulness of comment information, comprising: acquiring a set of comment information; and inputting the comment information in the set of comment information into the information assessment model generated by the method for generating an information assessment model according to claim 1 , to obtain usefulness labels of the comment information. 10. The method according to claim 9 , wherein the method further comprises: presenting, based on the usefulness labels of the comment information, the comment information in the set of comment information. 11. An apparatus for determining the usefulness of comment information, comprising: at least one processor; and a memory storing instructions, the instructions when executed by the at least one processor, cause the at least one processor to perform operations, the operations comprisin

Assignees

Inventors

Classifications

  • G06N3/08Primary

    Learning methods · CPC title

  • characterised by the incorporation of unlabelled data, e.g. multiple instance learning [MIL], semi-supervised techniques using expectation-maximisation [EM] or naïve labelling · CPC title

  • Probabilistic or stochastic networks · CPC title

  • Selection of the most significant subset of features · CPC title

  • Combinations of networks · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11521016B2 cover?
Embodiments of the present disclosure provide a method for generating an information assessment model, a method for determining the usefulness of comment information, apparatus, electronic device, and computer-readable medium. The method may include: acquiring training samples, the training samples including first sample comment information with a usefulness label and second sample comment info…
Who is the assignee on this patent?
Baidu online network technology beijing co ltd
What technology area does this patent fall under?
Primary CPC classification G06N3/08. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Dec 06 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).