Text style transfer using reinforcement learning

US11314950B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11314950-B2
Application numberUS-202016830106-A
CountryUS
Kind codeB2
Filing dateMar 25, 2020
Priority dateMar 25, 2020
Publication dateApr 26, 2022
Grant dateApr 26, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A computer-implemented method is provided for transferring a target text style using Reinforcement Learning (RL). The method includes pre-determining, by a Long Short-Term Memory (LSTM) Neural Network (NN), the target text style of a target-style natural language sentence. The method further includes transforming, by a hardware processor using the LSTM NN, a source-style natural language sentence into the target-style natural language sentence that maintains the target text style of the target-style natural language sentence. The method also includes calculating an accuracy rating of a transformation of the source-style natural language sentence into the target-style natural language sentence based upon rewards relating to at least the target text style of the source-style natural language sentence.

First claim

Opening claim text (preview).

The invention claimed is: 1. A computer-implemented method for transferring a target text style using Reinforcement Learning (RL), comprising: pre-determining, by a Long Short-Term Memory (LSTM) Neural Network (NN), the target text style of a target-style natural language sentence; transforming, by a hardware processor using the LSTM NN, a source-style natural language sentence into the target-style natural language sentence that maintains the target text style of the target-style natural language sentence; and calculating an accuracy rating of a transformation of the source-style natural language sentence into the target-style natural language sentence based upon rewards relating to at least the target text style of the source-style natural language sentence, wherein the rewards comprise style rewards which are determined using a style classifier built upon a bidirectional recurrent neural network with an attention mechanism. 2. The computer-implemented method of claim 1 , wherein the rewards further comprise, semantic rewards and fluency rewards, and wherein said calculating step comprises evaluating the target-style natural language sentence with respect to content preservation, target text style, and fluency using the style rewards, the semantic rewards, and the fluency rewards, respectively. 3. The computer-implemented method of claim 2 , wherein the semantic rewards are determining using a semantic module configured to determine a Word Mover's Distance (WMD) as an embedding-based similarity metric calculated as a sum of distances between co-occurring words in the source-style natural language sentence relative to the target-style natural language sentence. 4. The computer-implemented method of claim 2 , wherein the fluency rewards are determined using a Recurrent Neural Network (RNN)-based language model. 5. The computer-implemented method of claim 1 , wherein the style classifier is pre-trained on a source and target corpus in style classification. 6. The computer-implemented method of claim 1 , wherein the style classifier is adversarially trained on target-style natural language sentences generated by said transforming step. 7. The computer-implemented method of claim 1 , further comprising guaranteeing, by a LSTM-based style discriminator, a style transfer strength of the transformation above a threshold amount. 8. The computer-implemented method of claim 1 , further comprising guaranteeing, by a sentence discriminator, a content preservation between the source-style natural language sentence and the target-style natural language sentence. 9. The computer-implemented method of claim 1 , further comprising guaranteeing, by a Recurrent Neural Network (RNN)-based language model, a fluency of the target-style natural language sentence. 10. The computer-implemented method of claim 1 , guiding, based on the rewards, the generator in performing a subsequent transformation that achieves a higher accuracy rating, responsive to the accuracy rating being below a threshold amount. 11. The computer-implemented method of claim 1 , repeating said guiding step until the accuracy rating is equal to or greater than the threshold amount. 12. A computer program product for transferring a target text style using Reinforcement Learning (RL), the computer program product comprising a non-transitory computer readable storage medium having program instructions embodied therewith, the program instructions executable by a computer to cause the computer to perform a method comprising: pre-determining, by a Long Short-Term Memory (LSTM) Neural Network (NN), the target text style of a target-style natural language sentence; transforming, by the LSTM NN, a source-style natural language sentence into the target-style natural language sentence that maintains the target text style of the target-style natural language sentence; and calculating an accuracy rating of a transformation of the source-style natural language sentence into the target-style natural language sentence based upon rewards relating to at least the target text style of the source-style natural language sentence, wherein the rewards comprise style rewards which are determined using a style classifier built upon a bidirectional recurrent neural network with an attention mechanism. 13. The computer program product of claim 12 , wherein the rewards further comprise semantic rewards and fluency rewards, and wherein said calculating step comprises evaluating the target-style natural language sentence with respect to content preservation, target style, and fluency using the style rewards, the semantic rewards, and the fluency rewards, respectively. 14. The computer program product of claim 12 , further comprising guaranteeing, by a LSTM-based style discriminator, a style transfer strength of the transformation above a threshold amount. 15. The computer program product of claim 12 , further comprising guaranteeing, by a sentence discriminator, a content preservation between the source-style natural language sentence and the target-style natural language sentence. 16. The computer program product of claim 12 , further comprising guaranteeing, by a Recurrent Neural Network (RNN)-based language model, a fluency of the target-style natural language sentence. 17. The computer program product of claim 12 , further comprising guiding, based on the rewards, the generator in performing a subsequent transformation that achieves a higher accuracy rating, responsive to the accuracy rating being below a threshold amount. 18. The computer program product of claim 17 , further comprising repeating said guiding step until the accuracy rating is equal to or greater than the threshold amount. 19. A computer processing system for transferring a target text style using Reinforcement Learning (RL), comprising: a memory device including program code stored thereon; a hardware processor, operatively coupled to the memory device, and configured to run the program code stored on the memory device to pre-determine, using a Long Short-Term Memory (LSTM) Neural Network (NN), the target text style of a target-style natural language sentence; transform, using the LSTM NN, a source-style natural language sentence into the target-style natural language sentence that maintains the target text style of the target-style natural language sentence; and calculate an accuracy rating of a transformation of the source-style natural language sentence into the target-style natural language sentence based upon rewards relating to at least the target text style of the source-style natural language sentence, wherein the rewards comprise style rewards which are determined using a style classifier built upon a bidirectional recurrent neural network with an attention mechanism.

Assignees

Inventors

Classifications

  • Probabilistic or stochastic networks · CPC title

  • Combinations of networks · CPC title

  • Recurrent networks, e.g. Hopfield networks · CPC title

  • Generative networks · CPC title

  • Auto-encoder networks; Encoder-decoder networks · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11314950B2 cover?
A computer-implemented method is provided for transferring a target text style using Reinforcement Learning (RL). The method includes pre-determining, by a Long Short-Term Memory (LSTM) Neural Network (NN), the target text style of a target-style natural language sentence. The method further includes transforming, by a hardware processor using the LSTM NN, a source-style natural language senten…
Who is the assignee on this patent?
IBM, Univ Illinois
What technology area does this patent fall under?
Primary CPC classification G06F40/253. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Apr 26 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 10 related publications on this page (citations in our corpus or others sharing the same primary CPC).