Methods and systems for text sequence style transfer by two encoder decoders

US11501159B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11501159-B2
Application numberUS-201916365637-A
CountryUS
Kind codeB2
Filing dateMar 26, 2019
Priority dateMar 26, 2019
Publication dateNov 15, 2022
Grant dateNov 15, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method for text sequence style transfer by two encoder-decoders, including generating, by a first encoder-decoder network model, an output sequence based on a first input sequence and an input sequence style, wherein the output sequence is associated with a second sequence, generating, by a second-encoder decoder network model, a prediction of the first input sequence based on the first input sequence, the output sequence, and a first input sequence style associated with the first input sequence, generating, by a classifier, a prediction of the first input sequence style based on the prediction of the first input sequence, and updating the neural network model based on comparisons between the output sequence and the second sequence, between the prediction of the first input sequence and the first input sequence, and between the prediction of the first input sequence style and the first input sequence style.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for training a neural network model comprising: generating, by a first encoder-decoder network model, an output sequence based on a first input sequence and an input sequence style, wherein the output sequence is associated with a second sequence; generating, by a second-encoder decoder network model, a prediction of the first input sequence based on the first input sequence, the output sequence, and a first input sequence style associated with the first input sequence; generating, by a classifier, a prediction of the first input sequence style based on the prediction of the first input sequence; and updating the neural network model based on comparisons between the output sequence and the second sequence, between the prediction of the first input sequence and the first input sequence, and between the prediction of the first input sequence style and the first input sequence style. 2. The method of claim 1 , wherein the neural network model is a sequence to sequence neural network model. 3. The method of claim 1 , wherein the first encoder-decoder network model and the second encoder-decoder network model each includes at least two recursive neural network models. 4. The method of claim 3 , wherein each of the at least two recursive neural network models include one or more long short-term memory models. 5. The method of claim 1 , wherein updating the neural network model based on comparisons between the output sequence and the second sequence, between the prediction of the first input sequence and the first input sequence, and between the prediction of the first input sequence style and the first input sequence style comprises: determining a first loss based on a comparison between the output sequence and the second sequence; determining a second loss based on a comparison between the prediction of the first input sequence and the first input sequence; determining a third loss based on a comparison between the prediction of the first input sequence style and the first input sequence style; and determining a total loss based on the first, second, and third losses. 6. The method of claim 1 , further comprising updating parameters of the neural network model based on the total loss. 7. The method of claim 1 , wherein the first input sequence, the second sequence, and the first input sequence style are known prior to the training. 8. A method for text sequence style transfer by two encoder decoders of a trained neural network model comprising: generating, by a trained first encoder-decoder network model, an output sequence based on a first input sequence and a first input sequence style, wherein the output sequence is associated with a second sequence; and generating, by a trained second encoder-decoder network model, a target sequence based on the first input sequence, the output sequence, and a target sequence style associated with the target sequence. 9. The method of claim 8 , wherein the trained neural network model is a sequence to sequence neural network model. 10. The method of claim 8 , wherein the trained first encoder-decoder network model and the trained second encoder-encoder network model each includes at least two recursive neural network models. 11. The method of claim 10 , wherein each of the at least two recursive neural network models include one or more long short-term memory models. 12. A system for training a neural network model comprising: a first encoder-decoder network model configured to generate an output sequence based on a first input sequence and an input sequence style, wherein the output sequence is associated with a second sequence; a second encoder-decoder network model configured to generate a prediction of the first input sequence based on the first input sequence, the output sequence, and a first input sequence style associated with the first input sequence; and a classifier configured to generate a prediction of the first input sequence style based on the prediction of the first input sequence; wherein the neural network model is updated based on comparisons between the output sequence and the second sequence, between the prediction of the first input sequence and the first input sequence, and between the prediction of the first input sequence style and the first input sequence style. 13. The system of claim 12 , wherein the neural network model is a sequence to sequence neural network model. 14. The system of claim 12 , wherein the first encoder-decoder network model and the second encoder-decoder network model each includes at least two recursive neural network models. 15. The system of claim 14 , wherein the at least two recursive neural network models include one or more long short-term memory models. 16. The system of claim 12 , wherein updating the neural network model based on comparisons between the output sequence and the second sequence, between the prediction of the first input sequence and the first input sequence, and between the prediction of the first input sequence style and the first input sequence style comprises: determining a first loss based on a comparison between the output sequence and the second sequence; determining a second loss based on a comparison between the prediction of the first input sequence and the first input sequence; determining a third loss based on a comparison between the prediction of the first input sequence style and the first input sequence style; and determining a total loss based on the first, second, and third losses. 17. The system of claim 12 , wherein updating the neural network model based on comparisons between the output sequence and the second sequence, between the prediction of the first input sequence and the first input sequence, and between the prediction of the first input sequence style and the first input sequence style further comprises: updating the parameters of the neural network model based on the total loss. 18. The system of claim 12 , wherein the first input sequence, the second sequence, and the first input sequence style are known prior to the training.

Assignees

Inventors

Classifications

  • G06N3/08Primary

    Learning methods · CPC title

  • Combinations of networks · CPC title

  • Recurrent networks, e.g. Hopfield networks · CPC title

  • using electronic means · CPC title

  • Physics · mapped topic

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11501159B2 cover?
A method for text sequence style transfer by two encoder-decoders, including generating, by a first encoder-decoder network model, an output sequence based on a first input sequence and an input sequence style, wherein the output sequence is associated with a second sequence, generating, by a second-encoder decoder network model, a prediction of the first input sequence based on the first input…
Who is the assignee on this patent?
Alibaba Group Holding Ltd
What technology area does this patent fall under?
Primary CPC classification G06N3/08. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Nov 15 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 11 related publications on this page (citations in our corpus or others sharing the same primary CPC).