Method and device for evaluating comment quality, and computer readable storage medium
US-2020278976-A1 · Sep 3, 2020 · US
US11521016B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-11521016-B2 |
| Application number | US-201916700593-A |
| Country | US |
| Kind code | B2 |
| Filing date | Dec 2, 2019 |
| Priority date | Jun 20, 2019 |
| Publication date | Dec 6, 2022 |
| Grant date | Dec 6, 2022 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Embodiments of the present disclosure provide a method for generating an information assessment model, a method for determining the usefulness of comment information, apparatus, electronic device, and computer-readable medium. The method may include: acquiring training samples, the training samples including first sample comment information with a usefulness label and second sample comment information without a usefulness label; acquiring a predictor model and a discriminator model respectively constructed based on a generative network and a discrimination network in a generative adversarial network, and pre-training the predictor model using the first sample comment information, the predictor model being used to predict a usefulness label of a piece of comment information, the discriminator model being used to discriminate authenticity of a usefulness label; and training the predictor model and the discriminator model by iteratively performing a plurality of times of training operations, using the trained predictor model as an information assessment model.
Opening claim text (preview).
What is claimed is: 1. A method for generating an information assessment model, comprising: acquiring training samples, the training samples comprising first sample comment information with a usefulness label and second sample comment information without a usefulness label; acquiring a predictor model and a discriminator model respectively constructed based on a generative network and a discrimination network in a generative adversarial network, and pre-training the predictor model using the first sample comment information, the predictor model being used to predict a usefulness label of a piece of comment information, the discriminator model being used to discriminate authenticity of a usefulness label; and training the predictor model and the discriminator model by iteratively performing a plurality times of training operations, using the trained predictor model as an information assessment model; wherein a training operation comprises: predicting a usefulness label of the second sample comment information using the predictor model; using the usefulness label of the first sample comment information as a real label, using the usefulness label of the second sample comment information as a false label, discriminating authenticity of the usefulness label of the first sample comment information and authenticity of the usefulness label of the second sample comment information by using the discriminator model; iteratively updating, based on an error of the discrimination result of the discriminator model, a parameter of the discriminator model and a reward function, the reward function being associated with the parameter of the discriminator model and a parameter of the predictor model; determining a desired reward of the predictor model based on the reward function and the predictor model, and iteratively updating the parameter of the predictor model based on an error of the desired reward. 2. The method according to claim 1 , wherein the discriminating authenticity of the usefulness label of the first sample comment information and authenticity of the usefulness label of the second sample comment information by using the discriminator model comprises: selecting usefulness labels of an equal number of first sample comment information and second sample comment information, and discriminating authenticity of the selected usefulness labels by using the discriminator model. 3. The method according to claim 1 , wherein the error of the discrimination result of the discriminator model comprises a sum of an opposite number of a first cross entropy and an opposite number of a second cross entropy; the first cross entropy comprises: a cross entropy between a first probability distribution of that the usefulness label of the first sample comment information is a real label and a second probability distribution of that the discriminator model discriminates the usefulness label of the first sample comment information as a real label; the second cross entropy comprises: a cross entropy between a third probability distribution of that the usefulness label of the second sample comment information predicted by the predictor model is a real label and a fourth probability distribution of that the discriminator model discriminates the usefulness label of the second sample comment information as a real label. 4. The method according to claim 1 , wherein when an input of the reward function is the second sample comment information, an output is a result of authenticity discrimination by the discriminator model for the usefulness label of the second sample comment information; and the determining a desired reward of the predictor model based on the reward function and the predictor model, and iteratively updating the parameter of the predictor model based on an error of the desired reward comprises: multiplying the reward function by a prediction result of the predictor model to calculate the desired reward, and iteratively updating, in response to determining that the desired reward does not reach a preset reward value condition, the parameter of the predictor model. 5. The method according to claim 4 , wherein when the input of the reward function is the first sample comment information and the discriminator model discriminates the usefulness label of the first sample comment information as a real label, the value of the reward function is 1; when the input of the reward function is the first sample comment information and the discriminator model discriminates the usefulness label of the first sample comment information as a false label, the value of the reward function is 0. 6. The method according to claim 1 , wherein the first sample comment information further has a usefulness index label, the usefulness index label is used to characterize a degree of usefulness, and the predictor model is further used to predict a usefulness index label of a piece of comment information; the training operation further comprises: predicting a usefulness index label of the second sample comment information using the predictor model; and determining a prediction error of the predictor model for the usefulness index label; and the iteratively updating the parameter of the predictor model based on an error of the desired reward comprises: iteratively updating the parameter of the predictor model based on the error of the desired reward and the prediction error of the predictor model for the usefulness index label alternately. 7. The method according to claim 1 , wherein the method further comprises: acquiring comment object information corresponding to the training samples; and the predictor model comprises an information preprocessing network and a classification network; wherein the information preprocessing network is used to convert a training sample and comment object information corresponding thereto into a mathematical representation, and the classification network predicts usefulness label of the training sample based on the mathematical representation. 8. The method according to claim 7 , wherein the information preprocessing network comprises a first bi-directional long short-term memory network and a second bi-directional long short-term memory network; the first bi-directional long short-term memory network converts the training sample into a first vector representation, and the second bi-directional long short-term memory network converts the comment object information corresponding to the training sample into a second vector representation; the information preprocessing network integrates, based on an attention mechanism, the first vector representation of the training sample and the second vector representation of the comment object information corresponding thereto into the mathematical representation corresponding to the training sample. 9. A method for determining the usefulness of comment information, comprising: acquiring a set of comment information; and inputting the comment information in the set of comment information into the information assessment model generated by the method for generating an information assessment model according to claim 1 , to obtain usefulness labels of the comment information. 10. The method according to claim 9 , wherein the method further comprises: presenting, based on the usefulness labels of the comment information, the comment information in the set of comment information. 11. An apparatus for determining the usefulness of comment information, comprising: at least one processor; and a memory storing instructions, the instructions when executed by the at least one processor, cause the at least one processor to perform operations, the operations comprisin
Learning methods · CPC title
characterised by the incorporation of unlabelled data, e.g. multiple instance learning [MIL], semi-supervised techniques using expectation-maximisation [EM] or naïve labelling · CPC title
Probabilistic or stochastic networks · CPC title
Selection of the most significant subset of features · CPC title
Combinations of networks · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.