Modular reinforcement-learning-based application manager
US-10802864-B2 · Oct 13, 2020 · US
US10949263B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-10949263-B2 |
| Application number | US-201916518717-A |
| Country | US |
| Kind code | B2 |
| Filing date | Jul 22, 2019 |
| Priority date | Aug 27, 2018 |
| Publication date | Mar 16, 2021 |
| Grant date | Mar 16, 2021 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
The current document is directed to automated reinforcement-learning-based application managers that obtain increased computational efficiency by reusing learned models and by using human-management experience to truncate state and observation vectors. Learned models of managed environments that receive component-associated inputs can be partially or completely reused for similar environments. Human managers and administrators generally use only a subset of the available metrics in managing an application, and that subset can be used as an initial subset of metrics for learning an optimal or near-optimal control policy by an automated reinforcement-learning-based application manager.
Opening claim text (preview).
The invention claimed is: 1. An automated reinforcement-learning-based application manager that manages a computing environment that includes one or more applications and one or more of a distributed computing system having multiple computer systems interconnected by one or more networks, a standalone computer system, and a processor-controlled user device, the reinforcement-learning based application manager comprising: one or more processors, one or more memories, and one or more communications subsystems; a set of actions that can be issued to the computing environment; a set of metrics; an iterative control process that repeatedly selects and issues a next action, according to a control policy that uses a dimensionally reduced computational-environment vector that includes metric values and that represents a current state of the computational environment obtained by projecting a computational-environment vector into a vector subspace defined by a subset of metrics used by human application managers to control distributed applications, and receives a reward and one of an observation and a metric vector from the computing environment, in response to execution of the issued next action, which the control process projects into the vector subspace and uses to attempt to learn an optimal or near-optimal control policy, over time. 2. The automated reinforcement-learning-based application manager of claim 1 wherein each action is represented by a set of values; and wherein representations of actions are each translated into one or more commands that are directed to computational-entity components of the computing environment. 3. The automated reinforcement-learning-based application manager of claim 2 wherein computational-environment vectors represent states and/or observations and include metric values. 4. The automated reinforcement-learning-based application manager of claim 3 wherein the subset of metrics used by human application managers is specified to the automated reinforcement-learning-based application manager via one or more of: a manager interface; a configuration files; and a hard coding. 5. The automated reinforcement-learning-based application manager of claim 1 projecting a computational-environment vector into a vector subspace defined by a subset of metrics used by human application managers to control distributed applications further comprises removing metric values from the computational-environment vector that do not correspond to metrics in the metric subset used by human application managers to control distributed applications. 6. The automated reinforcement-learning-based application manager of claim 1 wherein the model includes: a set of computing-environment-component inputs; a set of computing-environment-component outputs; and internal components that each includes learned information and that transform the information input to the model through the set of computing-environment-component inputs to outputs from the model through the set of computing-environment-component outputs, each internal component related one of a set of computing-environment-component-input ancestors and a subset of the set of computing-environment-component-input descendants. 7. The automated reinforcement-learning-based application manager of claim 6 wherein the learned information, learned by the different automated reinforcement-learning-based application manager, included in those internal components with identical or equivalent computing-environment-component-input ancestors or computing-environment-component-input descendants in the computing environment and the different computing environment is reused by the automated reinforcement-learning-based application manager. 8. The automated reinforcement-learning-based application manager of claim 7 wherein the computing-environment-component inputs and the computing-environment-component outputs also included stored information that may be reused for those computing-environment-component inputs and the computing-environment-component outputs associated with identical or equivalent components in the computing-environment-component-input descendants in the computing environment and the different computing environment. 9. The method of claim 7 wherein the computing-environment-component inputs and the computing-environment-component outputs also included stored information that may be reused for those computing-environment-component inputs and the computing-environment-component outputs associated with identical or equivalent components in the computing-environment-component-input descendants in the computing environment and the different computing environment. 10. The automated reinforcement-learning-based application manager of claim 6 wherein the model is a neural network; wherein the computing-environment-component inputs are input nodes; wherein the computing-environment-component outputs are output nodes; wherein the internal components are hidden nodes; and wherein the learned information is weights associated with nodes of the neural network. 11. The method of claim 6 wherein the model is a neural network; wherein the computing-environment-component inputs are input nodes; wherein the computing-environment-component outputs are output nodes; wherein the internal components are hidden nodes; and wherein the learned information is weights associated with nodes of the neural network. 12. An automated reinforcement-learning-based application manager that manages a computing environment that includes one or more applications and one or more of a distributed computing system having multiple computer systems interconnected by one or more networks, a standalone computer system, and a processor-controlled user device, the reinforcement-learning based application manager comprising: one or more processors, one or more memories, and one or more communications subsystems; a set of actions that can be issued to the computing environment; a set of metrics; a model partly or completely learned during operation of a different automated reinforcement-learning-based application manager that controls a different computing environment; and an iterative control process that repeatedly selects and issues a next action, according to a control policy that uses a computational-environment vector that includes metric values and that represents a current state of the computational environment, and receives a reward and one of an observation and a metric vector from the computing environment, in response to execution of the issued next action, which the control process uses to attempt to learn an optimal or near-optimal control policy, over time, by improving the model. 13. The automated reinforcement-learning-based application manager of claim 12 wherein each action is represented by a set of values; and wherein representations of actions are each translated into one or more commands that are directed to computational-entity components of the computing environment. 14. The automated reinforcement-learning-based application manager of claim 13 wherein computational-environment vectors represent states and/or observations and include metric values. 15. A method that improves the computational efficiency of an automated reinforcement-learning-based application manager having one or more processors, one or more memories, one or more communications subsystems, a set of actions that can be issued by an iterative control process to a computing environment, controlled by the automated reinforcement-learning-based application manager,
Probabilistic graphical models, e.g. probabilistic networks · CPC title
Activation functions · CPC title
based on simulated virtual individual or collective life forms, e.g. social simulations or particle swarm optimisation [PSO] · CPC title
Machine learning · CPC title
Logical partitioning of resources; Management or configuration of virtualized resources (specific details on emulation or internal functioning of virtual machines G06F9/455) · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.