Selecting one of plural layouts of virtual machines on physical machines
US-9092250-B1 · Jul 28, 2015 · US
US9916135B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-9916135-B2 |
| Application number | US-201615008571-A |
| Country | US |
| Kind code | B2 |
| Filing date | Jan 28, 2016 |
| Priority date | Dec 16, 2013 |
| Publication date | Mar 13, 2018 |
| Grant date | Mar 13, 2018 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A method for scaling a cloud infrastructure, comprises receiving at least one of resource-level metrics and application-level metrics, estimating parameters of at least one application based on the received metrics, automatically and dynamically determining directives for scaling application deployment based on the estimated parameters, and providing the directives to a cloud service provider to execute the scaling.
Opening claim text (preview).
What is claimed is: 1. A system, comprising: a memory and at least one processor coupled to the memory; and a modeling and optimization component, executed via the at least one processor, wherein the modeling and optimization component: receives at least one of resource-level metrics from a monitoring component and application-level metrics from at least one application; estimates parameters of the at least one application based on the at least one of the received resource-level and application-level metrics; and automatically and dynamically determines directives for scaling application deployment based on the estimated parameters; wherein at least one of the estimated parameters corresponds to an unobservable parameter of the at least one application and is estimated employing an estimation technique; and wherein the unobservable parameter comprises a background utilization parameter modeling resource utilization at a server due to jobs running on the server. 2. The system of claim 1 , wherein the modeling and optimization component performs the determining in response to changing workload demand. 3. The system of claim 1 , wherein the modeling and optimization component provides the directives to a cloud computing software component to execute the scaling. 4. The system of claim 1 , further comprising an execution component which determines placement of a virtual machine on a physical machine based on the directives and at least one of a colocation constraint, an availability constraint and a security constraint. 5. The system of claim 1 , wherein the modeling and optimization component performs the estimating using a Kalman filtering technique. 6. The system of claim 5 , wherein using the Kalman filtering technique comprises specifying a generic queueing-theoretic model. 7. The system of claim 1 , wherein the scaling is performed to meet performance goals specified in a service level agreement. 8. The system of claim 1 , wherein the modeling and optimization component performs the determining without user input about dynamically resizing deployment. 9. The system of claim 1 , wherein the unobservable parameter is used to predict future values of server utilization and response time. 10. The system of claim 1 wherein the estimation technique is employed without accessing or modifying the at least one application. 11. The system of claim 1 , wherein the directives for scaling comprise one or more directives indicating migration of one or more virtual machines for running the at least one application across one or more physical machines. 12. The system of claim 1 , further comprising an execution component which determines placement of a virtual machine on a physical machine based on the directives and a colocation constraint. 13. The system of claim 1 , further comprising an execution component which determines placement of a virtual machine on a physical machine based on the directives and a security constraint. 14. The system of claim 1 , further comprising an execution component which determines placement of a virtual machine on a physical machine based on the directives and an availability constraint. 15. A system, comprising: a memory and at least one processor coupled to the memory; and a modeling and optimization component, executed via the at least one processor, wherein the modeling and optimization component; receives at least one of resource-level metrics from a monitoring component and application-level metrics from at least one application; estimates parameters of the at least one application based on the at least one of the received resource-level and application-level metrics; and automatically and dynamically determines directives for scaling application deployment based on the estimated parameters; wherein at least one of the estimated parameters corresponds to an unobservable parameter of the at least one application and is estimated employing an estimation technique; and wherein the unobservable parameter is an existing parameter of the at least one application and hidden from the system comprising the modeling and optimization component. 16. A computer program product comprising a non-transitory computer readable storage medium having program instructions embodied therewith, the program instructions executable by a processor to cause the processor to perform a method comprising: receiving at least one of resource-level metrics and application-level metrics; estimating parameters of at least one application based on the at least one of the received resource-level and application-level metrics; and automatically and dynamically determining directives for scaling application deployment based on the estimated parameters; wherein at least one of the estimated parameters corresponds to an unobservable parameter of the at least one application and is estimated employing an estimation technique; wherein the unobservable parameter comprises a background utilization parameter modeling resource utilization at a server due to jobs running on the server.
in which an application is distributed across nodes in the network (software deployment G06F8/60; multiprogramming arrangements G06F9/46) · CPC title
Creating or negotiating SLA contracts, guarantees or penalties · CPC title
Network integration; Enabling network access in virtual machine instances · CPC title
Software maintenance or management · CPC title
Software metrics · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.