Scaling a cloud infrastructure

US9300552B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-9300552-B2
Application numberUS-201414554331-A
CountryUS
Kind codeB2
Filing dateNov 26, 2014
Priority dateDec 16, 2013
Publication dateMar 29, 2016
Grant dateMar 29, 2016

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method for scaling a cloud infrastructure, comprises receiving at least one of resource-level metrics and application-level metrics, estimating parameters of at least one application based on the received metrics, automatically and dynamically determining directives for scaling application deployment based on the estimated parameters, and providing the directives to a cloud service provider to execute the scaling.

First claim

Opening claim text (preview).

What is claimed is: 1. A method, comprising: receiving at least one of resource-level metrics and application-level metrics; estimating parameters of at least one application based on the at least one of the received resource-level and application-level metrics; and automatically and dynamically determining directives for scaling application deployment based on the estimated parameters; wherein at least one of the estimated parameters corresponds to an unobservable parameter of the at least one application and is estimated employing an estimation technique without accessing or modifying the at least one application; wherein the estimation technique continually adapts to dynamically changing system parameters during runtime; and wherein the receiving, estimating, and determining steps are performed by a computer system comprising a memory and at least one processor coupled to the memory. 2. The method of claim 1 , wherein the determining is performed in response to changing workload demand. 3. The method of claim 1 , further comprising providing the directives to a cloud service provider to execute the scaling. 4. The method of claim 1 , further comprising determining placement of a virtual machine on a physical machine based on the directives and an availability constraint. 5. The method of claim 1 , wherein the estimation technique comprises a Kalman filtering technique. 6. The method of claim 5 , wherein the Kalman filtering technique comprises specifying a generic queueing-theoretic model. 7. The method of claim 1 , wherein the scaling is performed to meet performance goals. 8. The method of claim 7 , wherein the performance goals are specified in a service level agreement. 9. The method of claim 1 , wherein the determining is performed without user input about dynamically resizing deployment. 10. The method of claim 1 , wherein the directives for scaling comprise one or more directives indicating migration of one or more virtual machines for running the at least one application across one or more physical machines. 11. The method of claim 1 , further comprising determining placement of a virtual machine on a physical machine based on the directives and a colocation constraint. 12. The method of claim 1 , further comprising determining placement of a virtual machine on a physical machine based on the directives and a security constraint. 13. The method of claim 1 , wherein the unobservable parameter comprises a background utilization parameter modeling resource utilization at a server due to jobs running on the server. 14. The method of claim 1 , wherein the unobservable parameter is used to predict future values of server utilization and response time. 15. The method of claim 1 , wherein the unobservable parameter is an existing parameter of the at least one application and hidden from the computer system performing the receiving, estimating, and determining steps.

Assignees

Inventors

Classifications

  • Software metrics · CPC title

  • Discovery or management of network topologies · CPC title

  • Hypervisor-specific management and integration aspects · CPC title

  • Creating or negotiating SLA contracts, guarantees or penalties · CPC title

  • Network integration; Enabling network access in virtual machine instances · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US9300552B2 cover?
A method for scaling a cloud infrastructure, comprises receiving at least one of resource-level metrics and application-level metrics, estimating parameters of at least one application based on the received metrics, automatically and dynamically determining directives for scaling application deployment based on the estimated parameters, and providing the directives to a cloud service provider t…
Who is the assignee on this patent?
IBM
What technology area does this patent fall under?
Primary CPC classification G06F8/61. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Mar 29 2016 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).