Systems and methods for controllable text summarization

US11934781B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11934781-B2
Application numberUS-202017125468-A
CountryUS
Kind codeB2
Filing dateDec 17, 2020
Priority dateAug 28, 2020
Publication dateMar 19, 2024
Grant dateMar 19, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Embodiments described herein provide a flexible controllable summarization system that allows users to control the generation of summaries without manually editing or writing the summary, e.g., without the user actually adding or deleting certain information under various granularity. Specifically, the summarization system performs controllable summarization through keywords manipulation. A neural network model is learned to generate summaries conditioned on both the keywords and source document so that at test time a user can interact with the neural network model through a keyword interface, potentially enabling multi-factor control.

First claim

Opening claim text (preview).

What is claimed is: 1. A method of controllable text summarization of a document, the method comprising: receiving, at a communication interface, an input text document; extracting, via a language model that is trained by a training dataset of a plurality of documents and a plurality of corresponding keywords, one or more keywords by sequence labeling the one or more keywords from the input text document; receiving, via a user interface, a control token sequence and one or more control parameters relating to a characteristic of a summary to be generated, wherein the characteristic of the summary includes any of: an entity mentioned in the input text document, a target length of the summary, and a type of the input text document, and wherein the one or more control parameters comprises a prompt corresponding to the type of the input text document; modifying the one or more keywords based on the received control token sequence; and generating, by the language model, the summary for the input text document based on the modified one or more keywords according to the one or more control parameters. 2. The method of claim 1 , wherein the prompt is selected from the group of: a first summary prefix that summarizes contributions of a research paper; a second summary prefix that summarizes invention purpose of a patent document; and a third summary prefix that summarizes the input text document in a guided question and answer format. 3. The method of claim 1 , further comprising: generating a first set of modified keywords and a first control parameter from the received control token sequence; generating, by the language model, a first version of the summary for the input text document based on the first set of modified keywords according to the first control parameter; generating a second set of modified keywords and a second control parameter from the received control token sequence; and generating, by the language model, a second version of the summary for the input text document based on the second set of modified keywords according to the second control parameter. 4. The method of claim 1 , wherein the language model is trained by: prepending a keyword sequence to a training source document separated with a special token; inputting the training source document with the keyword sequence to the language model; generating, by the language model, an output summary; and updating the language model by maximizing a conditional probability of an output summary conditioned on the training source document and the keyword sequence. 5. The method of claim 4 , further comprising: randomly dropping a subset of keywords from the keyword sequence during training. 6. A system of controllable text summarization of a document, the system comprising: a communication interface that receives an input text document; a memory that stores a language model that is trained by a training dataset of a plurality of documents and a plurality of corresponding keywords; and one or more hardware processors that: extracts, via the language model, one or more keywords by sequence labeling the one or more keywords from the input text document; receives, via the communication interface, a control token sequence and one or more control parameters relating to a characteristic of a summary to be generated, wherein the characteristic of the summary includes any of: an entity mentioned in the input text document, a target length of the summary, and a type of the input text document, and wherein the one or more control parameters comprises a prompt corresponding to the type of the input text document; modifies the one or more keywords based on the received control token sequence; and generates, by the language model, the summary for the input text document based on the modified one or more keywords according to the one or more control parameters. 7. The system of claim 6 , wherein the prompt is selected from the group of: a first summary prefix that summarizes contributions of a research paper; a second summary prefix that summarizes invention purpose of a patent document; and a third summary prefix that summarizes the input text document in a guided question and answer format. 8. The system of claim 6 , wherein the one or more hardware processors further: generates a first set of modified keywords and a first control parameter from the received control token sequence; generates, by the language model, a first version of the summary for the input text document based on the first set of modified keywords according to the first control parameter; generates a second set of modified keywords and a second control parameter from the received control token sequence; and generates, by the language model, a second version of the summary for the input text document based on the second set of modified keywords according to the second control parameter. 9. The system of claim 6 , wherein the language model is trained by: prepending a keyword sequence to a training source document separated with a special token; inputting the training source document with the keyword sequence to the language model; generating, by the language model, an output summary; and updating the language model by maximizing a conditional probability of an output summary conditioned on the training source document and the keyword sequence. 10. The system of claim 9 , wherein the one or more hardware processors further: randomly drops a subset of keywords from the keyword sequence during training. 11. A non-transitory processor-readable medium storing a plurality of processor-executable instructions for controllable text summarization of a document, the instructions being executed by one or more processors to perform operations comprising: receiving, at a communication interface, an input text document; extracting, via a language model that is trained by a training dataset of a plurality of documents and a plurality of corresponding keywords, one or more keywords by sequence labeling the one or more keywords from the input text document; receiving, via a user interface, a control token sequence and one or more control parameters relating to a characteristic of a summary to be generated, wherein the characteristic of the summary includes any of: an entity mentioned in the input text document, a target length of the summary, and a type of the input text document, and wherein the one or more control parameters comprises a prompt corresponding to the type of the input text document; modifying the one or more keywords based on the received control token sequence; and generating, by the language model, the summary for the input text document based on the modified one or more keywords according to the one or more control parameters. 12. The non-transitory processor-readable of claim 11 , wherein the prompt is selected from the group of: a first summary prefix that summarizes contributions of a research paper; a second summary prefix that summarizes invention purpose of a patent document; and a third summary prefix that summarizes the input text document in a guided question and answer format. 13. The non-transitory processor-readable of claim 11 , wherein the operations further comprise: generating a first set of modified keywords and a first control parameter from the received control token sequence; generating, by the language model, a first version of the summary for the input text document based on the first set of modified keywords according to the first control parameter; generating a second set of modified keywords and a second control parameter from the received control token sequence; and generatin

Assignees

Inventors

Classifications

  • G06F40/284Primary

    Lexical analysis, e.g. tokenisation or collocates · CPC title

  • Probabilistic graphical models, e.g. probabilistic networks · CPC title

  • Interaction with page-structured environments, e.g. book metaphor · CPC title

  • G06F16/345Primary

    Summarisation for human users · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11934781B2 cover?
Embodiments described herein provide a flexible controllable summarization system that allows users to control the generation of summaries without manually editing or writing the summary, e.g., without the user actually adding or deleting certain information under various granularity. Specifically, the summarization system performs controllable summarization through keywords manipulation. A neu…
Who is the assignee on this patent?
Salesforce Inc
What technology area does this patent fall under?
Primary CPC classification G06F40/284. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Mar 19 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).