Systems and methods for neural clinical paraphrase generation
US-2019034416-A1 · Jan 31, 2019 · US
US10810482B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-10810482-B2 |
| Application number | US-201615343987-A |
| Country | US |
| Kind code | B2 |
| Filing date | Nov 4, 2016 |
| Priority date | Aug 30, 2016 |
| Publication date | Oct 20, 2020 |
| Grant date | Oct 20, 2020 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
An apparatus and a method. The apparatus includes a plurality of long short term memory (LSTM) networks, wherein each of the plurality of LSTM networks is at a different network layer, wherein each of the plurality of LSTM networks is configured to determine a residual function, wherein each of the plurality of LSTM networks includes an output gate to control what is provided to a subsequent LSTM network, and wherein each of the plurality of LSTM networks includes at least one highway connection to compensate for the residual function of a previous LSTM network.
Opening claim text (preview).
What is claimed is: 1. An apparatus, comprising: a plurality of long short term memory (LSTM) networks, wherein each of the plurality of LSTM networks is at a different network layer; wherein each of the plurality of LSTM networks is configured to determine a residual function; wherein each of the plurality of LSTM networks includes an output gate to control what is provided to a subsequent LSTM network; and wherein each of the plurality of LSTM networks includes at least one highway connection to compensate for the residual function of a previous LSTM network and a projection matrix configured to control the at least one highway connection, wherein the projection matrix is further configured to control a ratio between a main path and a highway connection path. 2. The apparatus of claim 1 , wherein each LSTM network is further configured to: determine by a first function block f t L+1 =sigm(W xf L+1 x t L+1 +W hf L+1 h t−1 L+1 +W cf L+1 ⊙c t−1 L+1 +b f L+1 ); determine by a second function block i t L+1 =sigm(W xi L+1 x t L+1 +W hi L+1 h t−1 L+1 +W ci L+1 ⊙c t−1 L+1 +b i L+1 ); determine by a third function block j t L+1 =tanh(W xc L+1 x t L+1 +W hc L+1 h t−1 L+1 +b j L+1 ); determine by a fourth function block o t L+1 =sigm(W xo L+1 x t L+1 +W ho L+1 h t−1 L+1 +W co L+1 c t L+1 +b o L+1 ); determine by a fifth function block c t L+1 =f t L+1 ⊙c t−1 L+1 +i t L+1 ⊙j t L+1 ; and determine by a sixth function block h t L+1 as a function of c t L+1 , o t L+1 , and one of x t L+1 or ∑ i = 1 N W h , i L + 1 h t L + 1 - i , wherein x t L+1 is an input to the LSTM network, h t−1 L+1 is an output of a previous time in the LSTM network, c t−1 L+1 is a cell activation of the previous time in the LSTM network, W xf L+1 , W hf L+1 , W cf L+1 , W xi L+1 , W hi L+1 , W ci L+1 , W xc L+1 , W hc L+1 , W xo L+1 , W ho L+1 , and W co L+1 are weight matrices of the LSTM network, and b f L+1 , b i L+1 , b j L+1 , b o L+1 are pre-determined bias values of the LSTM network. 3. The apparatus of claim 2 , wherein f 1 and f 2 are each selected from a sigmoid function (sigm) and a hyperbolic tangent function (tanh). 4. The apparatus of claim 2 , wherein the sixth function block is further configured to determine h t L+1 =o t L+1 ⊙W proj L+1 tanh(c t L+1 )+(1−o t L+1 )⊙W h L+1 x t L+1 . 5. The apparatus of claim 2 , wherein the sixth function block is further configured to determine h t L + 1 = o t L + 1 ⊙ W proj L + 1 tanh ( c t L + 1 ) + ( 1 - o t L + 1 ) ⊙ ( ∑ i = 1 N W h , i L + 1 h t L + 1 - i ) . 6. The apparatus of claim 2 , wherein the sixth function block is further configured to determine h t L+1 =o t L+1 ⊙(W proj L+1 tanh(c t L+1 )+W h L+1 x t L+1 ). 7. The apparatus of claim 2 , wherein the sixth function block is further configured to determine h t L + 1 = o t L + 1 ⊙ ( W proj L + 1
Recurrent networks, e.g. Hopfield networks · CPC title
Learning methods · CPC title
characterised by memory or gating, e.g. long short-term memory [LSTM] or gated recurrent units [GRU] · CPC title
using simulation · CPC title
Combinations of networks · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.