Implementing text generation

US11675983B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11675983-B2
Application numberUS-202117331526-A
CountryUS
Kind codeB2
Filing dateMay 26, 2021
Priority dateDec 22, 2020
Publication dateJun 13, 2023
Grant dateJun 13, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method for implementing text generation, a device and a medium are provided. The method includes: determining a target task type of a target text generation task from multiple task types supported by a pre-trained general text generation model; determining, based on a requirement of the target text generation task for a target output text, a first target output text attribute for the target text generation task from multiple output text attributes supported by the general text generation model; and fine tuning the general text generation model based on a target training data set associated with the target text generation task to obtain a task-specific text generation model, by taking task indication information for the target task type and first attribute indication information for the first target output text attribute as at least part of an input of the general text generation model.

First claim

Opening claim text (preview).

The invention claimed is: 1. A method, comprising: determining a target task type of a target text generation task from multiple task types supported by a general text generation model, wherein the general text generation model is pre-trained; determining, based on a requirement of the target text generation task for a target output text, at least one first target output text attribute for the target text generation task from multiple output text attributes supported by the general text generation model; determining second attribute indication information indicating a second target output text attribute for the target text generation task; and fine tuning the general text generation model based on a target training data set associated with the target text generation task to obtain a task-specific text generation model for the target text generation task, by: inputting task indication information for the target task type and first attribute indication information for the at least one first target output text attribute to a first encoder in the general text generation model; inputting the second attribute indication information to a second encoder in the general text generation model; and generating an output text based on an intermediate output of the first encoder and an intermediate output of the second encoder, wherein the target training data set is an additional data set other than pre-training data sets for pre-training the general text generation model. 2. The method according to claim 1 , wherein the general text generation model is pre-trained based on multiple pre-training data sets respectively comprising training input texts and training output texts associated with corresponding task types in the multiple task types. 3. The method according to claim 2 , wherein a training output text in at least one pre-training data set of the multiple pre-training data sets is labeled with different attribute values of at least one output text attribute of the multiple output text attributes. 4. The method according to claim 1 , wherein fine tuning the general text generation model further comprises: obtaining an attribute control model configured to control the decoder to output the target output text having a third target output text attribute based on the intermediate output of the first encoder and the intermediate output of the second encoder, wherein the third target output text attribute is not included in the multiple output text attributes; and fine tuning the attribute control model and the general text generation model jointly. 5. The method according to claim 4 , wherein the attribute control model comprises at least one of: an attribute classification model configured to determine, based on the intermediate output of the first encoder and the intermediate output of the second encoder, guidance information for guiding the decoder to generate the target output text having the third target output text attribute, wherein the guidance information indicates different attribute values of the third target output text attribute; and a language model configured to, in conjunction with the decoder, decode the intermediate output of the first encoder and the intermediate output of the second encoder to generate the target output text having the third target output text attribute. 6. The method according to claim 1 , wherein the target text generation task is a first target text generation task, and the method further comprises: determining a second target task type of a second target text generation task from the multiple task types; determining, based on a requirement of the second target text generation task for a target output text, a further target output text attribute for the second target text generation task from the multiple output text attributes; and fine tuning the general text generation model based on a second target training data set associated with the second target text generation task to obtain a further task-specific text generation model for the second target text generation task, by taking second task indication information for the second target task type and further attribute indication information for the further target output text attribute as at least part of an input of the general text generation model. 7. The method according to claim 1 , further comprising: obtaining a target input text for the target text generation task; applying the target input text, the task indication information and the first attribute indication information as an input of the task-specific text generation model; and executing the task-specific text generation model to obtain an output of the task-specific text generation model as the target output text of the target text generation task. 8. An electronic device, comprising: at least one processor; and a memory communicatively connected with the at least one processor, wherein the memory stores instructions executable by the at least one processor to cause the at least one processor to: determine a target task type of a target text generation task from multiple task types supported by a general text generation model, wherein the general text generation model is pre-trained; determine at least one first target output text attribute for the target text generation task from multiple output text attributes supported by the general text generation model based on a requirement of the target text generation task for a target output text; determine second attribute indication information indicating a second target output text attribute for the target text generation task; and fine tune the general text generation model based on a target training data set associated with the target text generation task to obtain a task-specific text generation model for the target text generation task, by: inputting task indication information for the target task type and first attribute indication information for the at least one first target output text attribute to a first encoder in the general text generation model; inputting the second attribute indication information to a second encoder in the general text generation model; and generating an output text based on an intermediate output of the first encoder and an intermediate output of the second encoder, wherein the target training data set is an additional data set other than pre-training data sets for pre-training the general text generation model. 9. The electronic device according to claim 8 , wherein the general text generation model is pre-trained based on multiple pre-training data sets respectively comprising training input texts and training output texts associated with corresponding task types in the multiple task types. 10. The electronic device according to claim 9 , wherein a training output text in at least one pre-training data set of the multiple pre-training data sets is labeled with different attribute values of at least one output text attribute of the multiple output text attributes. 11. The electronic device according to claim 8 , wherein fine tuning the general text generation model comprises: obtaining an attribute control model configured to control the decoder to output the target output text having a third target output text attribute based on the intermediate output of the first encoder and the intermediate output of the second encoder, wherein the third target output text attribute is not included in the multiple output text attributes; and fine tuning the attribute control model and the general text generation model jointly. 12. The electronic device according to claim 11 , wherein the attribute control model comprises at least on

Assignees

Inventors

Classifications

  • Tagging; Marking up (details of markup languages G06F40/143); Designating a block; Setting of attributes (style sheets, e.g. eXtensible Stylesheet Language Transformation [XSLT], G06F40/154) · CPC title

  • Semantic analysis · CPC title

  • G06F40/56Primary

    Natural language generation · CPC title

  • Character encoding · CPC title

  • Machine learning · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11675983B2 cover?
A method for implementing text generation, a device and a medium are provided. The method includes: determining a target task type of a target text generation task from multiple task types supported by a pre-trained general text generation model; determining, based on a requirement of the target text generation task for a target output text, a first target output text attribute for the target t…
Who is the assignee on this patent?
Beijing Baidu Netcom Sci & Tech Co Ltd
What technology area does this patent fall under?
Primary CPC classification G06F40/56. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jun 13 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 5 related publications on this page (citations in our corpus or others sharing the same primary CPC).