System and method for controlling multidirectional operation of an elevator
US-2024425322-A1 · Dec 26, 2024 · US
US11256990B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-11256990-B2 |
| Application number | US-201716303101-A |
| Country | US |
| Kind code | B2 |
| Filing date | May 19, 2017 |
| Priority date | May 20, 2016 |
| Publication date | Feb 22, 2022 |
| Grant date | Feb 22, 2022 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training a recurrent neural network on training sequences using backpropagation through time. In one aspect, a method includes receiving a training sequence including a respective input at each of a number of time steps; obtaining data defining an amount of memory allocated to storing forward propagation information for use during backpropagation; determining, from the number of time steps in the training sequence and from the amount of memory allocated to storing the forward propagation information, a training policy for processing the training sequence, wherein the training policy defines when to store forward propagation information during forward propagation of the training sequence; and training the recurrent neural network on the training sequence in accordance with the training policy.
Opening claim text (preview).
What is claimed is: 1. A method performed by one or more computers for training a recurrent neural network on a plurality of training sequences using backpropagation through time, the method comprising: receiving a training sequence comprising a respective input at each of a number of time steps; determining a training policy for processing the training sequence, wherein the training policy defines when to store forward propagation information during forward propagation of the training sequence, the training policy being such as to balance a trade-off between caching of forward propagation information and re-computation of forward propagation information during backpropagation; wherein determining the training policy comprises: (i) obtaining data defining an amount of memory allocated to storing forward propagation information for use during backpropagation; (ii) determining forward propagation information in the form of candidates for storage, wherein the candidate forward propagation information can include hidden states for time steps of the recurrent neural network and/or internal states for time steps of the recurrent neural network; (iii) identifying one or more strategies for storing forward propagation information from memory, wherein each strategy includes a sequence of candidates for storage that will be stored if following the strategy; (iv) determining a computational cost for each strategy; wherein the computational cost Q 1 (t, m, y) of a strategy that includes saving a hidden state of a time step after choosing not to save hidden states for y time steps is given by: Q 1 ( t,m,y )= y+C ( y,m )+ C ( t−y,m− 1) where t is a number of time steps over which backpropagation is performed, m is a number of available memory units, C(y, m) is the lowest possible computational cost of backpropagating over a sequence of y time steps given the amount of allocated memory equal to m; and C(t−y, m−1) is the lowest possible computational cost of backpropagating over a sequence of t−y time steps given the amount of allocated memory equal to m−1, and the computational cost Q 2 (t, m, y) of a strategy that includes saving internal states of a time step after choosing not to save internal states for y time steps is given by: Q 2 ( t,m,y )= y+C ( y− 1, m )+ C ( t−y,m −α) where t is a number of time steps over which backpropagation is performed, m is a number of available memory units, α is a ratio of size of internal states for time steps to the size of hidden states for time steps, C(y−1, m) is the lowest possible computational cost of backpropagating over a sequence of y−1 time steps given the amount of allocated memory equal to m, and C(t−y, m−α) is the lowest possible computational cost of backpropagating over a sequence of t−y time steps given the amount of allocated memory equal to m−α; (v) selecting the strategy having the lowest computational cost; (vi) determining the position of next storage and type of forward propagation from the selected strategy; (vii) dividing the sequence of time steps in the training sequence to obtain a first subsequence and second subsequence, the first subsequence including time steps before the determined position of next storage, the second subsequence including time steps after the determined position of next storage; and recursively repeating steps (i)-(vii) for each obtained subsequence to select a position of next forward propagation information to save; the method further comprising: training the recurrent neural network on the training sequence in accordance with the training policy by: forward-propagating the inputs in the training sequence from a first time step in the sequence to a last time step in the sequence; during the forward propagating, storing forward propagation information in accordance with the training policy; and backpropagating gradients from the last time step in the sequence to the first time step in the sequence, comprising: determining, for each time step, whether additional forward propagation information is necessary to backpropagate the gradient for the time step and, if so, regenerating the additional forward propagation information using the stored forward propagation information. 2. The method of claim 1 , wherein the forward propagation information includes hidden states. 3. The method of claim 1 , wherein the forward propagation information includes internal states. 4. The method of claim 1 , wherein the forward propagation information includes both hidden states and internal states. 5. The method of claim 4 , wherein the training policy defines, for each time step, whether to store a hidden state, an internal state, or neither for the time step. 6. The method of claim 1 , wherein: determining the policy comprises: for each time slot, determining a computational cost of storing an associated piece of forward propagation information based on the number of time steps and the amount of memory allocated to storing the forward propagation information; adding to the training policy a set of pieces of forward propagation information that are determined to have the lowest computational cost; and training the recurrent neural network on the training sequence in accordance with the training policy comprises storing the set of pieces of forward propagation information that have been added to the training policy. 7. The method of claim 1 , further comprising: determining a computational cost of training the recurrent neural network on the training sequence in accordance with the policy. 8. The method of claim 7 , further comprising: providing data identifying the computational cost for presentation to a user. 9. One or more non-transitory computer storage media encoded with instructions that, when executed by one or more computers, cause the one or more computers to perform operations for training a recurrent neural network on a plurality of training sequences using backpropagation through time, the operations comprising: receiving a training sequence comprising a respective input at each of a number of time steps; determining a training policy for processing the training sequence, wherein the training policy defines when to store forward propagation information during forward propagation of the training sequence, the training policy being such as to balance a trade-off between caching of forward propagation information and re-computation of forward propagation information during backpropagation; wherein determining the training policy comprises: (i) obtaining data defining an amount of memory allocated to storing forward propagation information for use during backpropagation; (ii) determining forward propagation information in the form of candidates for storage, wherein the candidate forward propagation information can include hidden states for time steps of the recurrent neural network and/or internal states for time steps of the recurrent neural network; (iii) identifying one or more strategies for storing forward propagation information from memory, wherein each strategy includes a sequence of candidates for storage that will be stored if following the strategy; (iv) determining a computational cost for each strategy; wherein the computational cost Q 1 (t, m, y) of a strategy that includes saving a hidden state of a time step after choosing not to save hidden states for y time steps is given by: Q 1 ( t,m,y )= y+C ( y,m )+ C ( t−y,m− 1) where t is a number of time steps over which backpropagation is performed, m is a number of available memory units, C(y, m) is the lowest possible computational cost of backpropagating over a sequence of y time steps given the amount of allocated memory equal to m; and C(t−y, m−1) is the lowe
Recurrent networks, e.g. Hopfield networks · CPC title
Supervised learning · CPC title
characterised by memory or gating, e.g. long short-term memory [LSTM] or gated recurrent units [GRU] · CPC title
Backpropagation, e.g. using gradient descent · CPC title
Physics · mapped topic
Related publications grouped by family.
Answers are generated from the same data shown on this page.