Generative text summarization system and method

US11562144B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11562144-B2
Application numberUS-202016819655-A
CountryUS
Kind codeB2
Filing dateMar 16, 2020
Priority dateMar 16, 2020
Publication dateJan 24, 2023
Grant dateJan 24, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A generative automatic text summarization system and method is disclosed that may adopt a search and reranking strategy to improve the performance of a summarization task. The system and method may employ a transformer neural model to assist with the summarization task. The transformer neural model may be trained to learn human abstracts and may then be operable to generate abstractive summaries. With multiple summary hypothesis generated, a best-first search algorithm and reranking algorithm may be employed to select the best candidate summary as part of the output summary.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for a generative text summarization model, comprising: receiving an input text dataset; enlarging a search space for one or more candidate words to be selected for inclusion in a text summary, wherein the one or more candidate words included within the search space are ranked using a best-first search algorithm; and re-ranking the one or more candidate words to be included in the text summary using a soft-bound word-reward (SBWR) algorithm, wherein the SBWR algorithm applies a diminishing reward value to the one or more candidate words when the text summary exceeds a predicated length threshold, and wherein the SBWR algorithm applies an increased reward value to the one or more candidate words when the text summary is below the predicated length threshold. 2. The method of claim 1 , wherein the SBWR algorithm selects the one or more candidate words when the text summary is equivalent to the predicated length threshold. 3. The method of claim 1 , wherein the SBWR algorithm operates using the following formula: S ˆ sbwr ( x , y ) = S ⁡ ( x , y ) + r ⁢ ∑ i = 1 ❘ "\[LeftBracketingBar]" y ❘ "\[RightBracketingBar]" σ ⁡ ( ℒ pred - i ) . 4. The method of claim 1 further comprising: smoothing the diminishing reward value and the increased reward value using a sigmoid function. 5. The method of claim 1 further comprising: scaling the diminishing reward value and the increased reward value using a value that is trained to select the one or more candidate words to be included in the text summary. 6. The method of claim 1 , re-ranking the one or more candidate words when the input text dataset exceeds a predefined length threshold. 7. The method of claim 1 further comprising: calculating a BP normalization that applies a penalty to the one or more candidate words that do not meet the predicated length threshold. 8. The method of claim 7 , wherein the BP normalization is calculated by adding a logarithmic value of a brevity penalty with a length normalization scoring function. 9. The method of claim 8 , wherein the brevity penalty is designed so that the generative text summarization model does not produce a short translation from the input text dataset. 10. The method of claim 8 , wherein the brevity penalty includes a copy rate value that reduces the brevity penalty toward zero. 11. The method of claim 1 further comprising: training the generative text summarization model using a transformer neural model. 12. The method of claim 11 , wherein the transformer neural model includes an encoder machine learning algorithm and a decoder machine learning algorithm. 13. The method of claim 12 further comprising: inputting the input text dataset to the encoder machine learning algorithm; and inputting a target summary text dataset to the decoder machine learning algorithm. 14. The method of claim 13 , wherein the transformer neural model determines a probability value for one or more target summary tokens using one or more source tokens. 15. The method of claim 14 , wherein the transformer neural model determines the probability value for the one or more target summary tokens using the one or more source tokens based on the following equation: P ⁡ ( y | x ) = ∏ j = 1 ❘ "\[LeftBracketingBar]" y ❘ "\[RightBracketingBar]" P ⁡ ( y j | y < j , x ) . 16. A system operable to employ a generative text summarization model, comprising: a memory operable to store an input text dataset; a processor operable to: enlarge a search space for one or more candidate words to be selected for inclusion in a text summary, wherein the one or more candidate words included within the search space are ranked using a best-first search algorithm; and re-rank the one or more candidate words to be included in the text summary using a soft-bound word-reward (SBWR) algorithm, wherein the SBWR algorithm applies a diminishing reward value to the one or more candidate words when the text summary exceeds a predicated length threshold, and wherein the SBWR algorithm applies an increased reward value to the one or more candidate words when the text summary is below the predicated length threshold. 17. The system of claim 16 , wherein the SBWR algorithm selects the one or more candidate words when the text summary is equivalent to the predicated length threshold. 18. The system of claim 16 , wherein the processor is further operable to: smooth the diminishing reward value and the increased reward value using a s

Assignees

Inventors

Classifications

  • Physics · mapped topic

  • using ranking · CPC title

  • Non-supervised learning, e.g. competitive learning · CPC title

  • G06F40/30Primary

    Semantic analysis · CPC title

  • G06F40/56Primary

    Natural language generation · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11562144B2 cover?
A generative automatic text summarization system and method is disclosed that may adopt a search and reranking strategy to improve the performance of a summarization task. The system and method may employ a transformer neural model to assist with the summarization task. The transformer neural model may be trained to learn human abstracts and may then be operable to generate abstractive summarie…
Who is the assignee on this patent?
Bosch Gmbh Robert
What technology area does this patent fall under?
Primary CPC classification G06F40/30. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jan 24 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).