Right-sizing resource requests by applications in dynamically scalable computing environments

US12596574B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12596574-B2
Application numberUS-202217893864-A
CountryUS
Kind codeB2
Filing dateAug 23, 2022
Priority dateAug 27, 2021
Publication dateApr 7, 2026
Grant dateApr 7, 2026

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Methods, systems, and techniques for right-sizing resource requests for applications in a dynamically scalable computing environment. In one aspect, a method comprises monitoring resource usage of at least one computer resource by an application executing on a computer system, and monitoring resource requests for the computer resource(s) associated with the application. The method further comprises determining, for the computer resource(s), a resource usage upper bound associated with the application, testing the resource usage upper bound against at least one threshold, determining, from the testing, a resource request adjustment, and dynamically applying the resource request adjustment to the resource requests for the computer resource(s) associated with the application.

First claim

Opening claim text (preview).

The invention claimed is: 1 . A computer-implemented method for right-sizing resource requests for applications in a dynamically scalable computing environment, the method comprising: monitoring, by a computer system, resource usage of at least one computer resource by an application executing in the dynamically scalable computing environment; monitoring, by the computer system, resource requests for the at least one computer resource associated with the application; determining, by the computer system, for the at least one computer resource, a resource usage upper bound associated with the application; testing, by the computer system, the resource usage upper bound against at least one threshold; determining, by the computer system, from the testing, a resource request adjustment; and the computer system dynamically applying the resource request adjustment to the resource requests for the at least one computer resource associated with the application; wherein determining, for the at least one computer resource, a resource usage upper bound associated with the application comprises: testing whether, during a predetermined time period, there have been a predetermined number of instances where the resource usage has exceeded a mean of the resource usage of the computer resource by the application plus a predetermined number of standard deviations of the resource usage of the computer resource by the application; responsive to a determination that there have been a predetermined number of instances where the resource usage has exceeded a mean of the resource usage of the computer resource by the application plus a predetermined number of standard deviations of the resource usage of the computer resource by the application, setting the resource usage upper bound to a first predetermined percentage of a maximum resource usage of the computer resource by the application; and responsive to a determination that there have not been a predetermined number of instances where the resource usage of the computer resource by the application has exceeded a mean of the resource usage of the computer resource by the application plus a predetermined number of standard deviations of the resource usage of the computer resource by the application, setting the resource usage upper bound to a second predetermined percentage of the mean of the resource usage of the computer resource by the application. 2 . The method of claim 1 , wherein the resource request adjustment is applied automatically to the resource requests absent human intervention to reduce latency. 3 . The method of claim 1 , further comprising: evaluating performance of the resource request adjustment as applied to the resource requests; and dynamically modifying at least one of the threshold and the resource usage upper bound based on the performance of the resource request adjustment as applied to the resource requests. 4 . The method of claim 3 , wherein modifying at least one of the threshold and the resource usage upper bound based on the performance of the resource request adjustment as applied to the resource requests is performed by machine learning. 5 . The method of claim 4 , wherein evaluating performance of the resource request adjustment as applied to the resource requests and modifying at least one of the threshold and the resource usage upper bound based on the performance of the resource request adjustment as applied to the resource requests is carried out recursively. 6 . The method of claim 1 , wherein the threshold is determined by machine learning. 7 . The method of claim 1 , wherein the at least one computer resource includes at least one of CPU resource, memory resource, network utilization, disk I/O, graphical processing unit (GPU) uses and database usage. 8 . The method of claim 1 , wherein the application is a containerized application. 9 . The method of claim 1 , wherein the application is a virtual machine application. 10 . The method of claim 1 , wherein the resource usage upper bound is different from an actual maximum resource usage of the application. 11 . The method of claim 1 , wherein the resource requests include a limit value for the at least one computer resource, and wherein the resource usage upper bound differs from the limit value for the at least one computer resource. 12 . The method of claim 1 , wherein testing the resource usage upper bound against at least one threshold derived from the resource requests comprises: testing whether the resource usage upper bound divided by a mean of the resource requests exceeds a first predetermined value; responsive to a determination that the resource usage upper bound divided by the mean of the resource requests exceeds the first predetermined value, testing whether the resource usage upper bound divided by the mean of the resource requests exceeds a second predetermined value that is greater than the first predetermined value by more than a third predetermined value; and responsive to a determination that the resource usage upper bound divided by the mean of the resource requests fails to exceed the first predetermined value, testing whether subtracting the resource usage upper bound divided by the mean of the resource requests from a fourth predetermined value yields a result that exceeds a fifth predetermined value. 13 . The method of claim 12 , wherein determining, from the testing, the resource request adjustment comprises: responsive to a determination that the resource usage upper bound divided by the mean of the resource requests exceeds the second predetermined value by more than the third predetermined value, setting the resource request adjustment to increase the resource requests; responsive to a determination that subtracting the resource usage upper bound divided by the mean of the resource requests from the fourth predetermined value yields a result that exceeds the fifth predetermined value, setting the resource request adjustment to decrease the resource requests; responsive to a determination that the resource usage upper bound divided by the mean of the resource requests fails to exceed the second predetermined value by more than the third predetermined value, setting the resource request adjustment to leave the resource requests unchanged; and responsive to a determination that subtracting the resource usage upper bound divided by the mean of the resource requests from the fourth predetermined value yields a result that fails to exceed the fifth predetermined value, setting the resource request adjustment to leave the resource requests unchanged. 14 . The method of claim 13 , wherein setting the resource request adjustment to increase the resource requests comprises setting the resource request adjustment to increase the resource requests by a percentage equal to 100% multiplied by the lesser of (a) a sixth predetermined value and (b) a seventh predetermined value subtracted from the resource usage upper bound divided by the mean of the resource requests. 15 . The method of claim 13 , wherein setting the resource request adjustment to decrease the resource requests comprises setting the resource request adjustment to decrease the resource requests by a percentage equal to 100% multiplied by the lesser of (a) an eighth predetermined value and (b) the resource usage upper bound divided by the mean of the resource requests subtracted from a ninth predetermined value. 16 . A data processing system comprising at least one processor and memory coupled to the at least one processor, wherein the memory contains instruct

Assignees

Inventors

Classifications

  • Hypervisor-specific management and integration aspects · CPC title

  • Resource capping · CPC title

  • Monitor · CPC title

  • the resources being hardware resources other than CPUs, Servers and Terminals · CPC title

  • Distribution of virtual machine instances; Migration and load balancing · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12596574B2 cover?
Methods, systems, and techniques for right-sizing resource requests for applications in a dynamically scalable computing environment. In one aspect, a method comprises monitoring resource usage of at least one computer resource by an application executing on a computer system, and monitoring resource requests for the computer resource(s) associated with the application. The method further compr…
Who is the assignee on this patent?
Royal Bank Of Canada
What technology area does this patent fall under?
Primary CPC classification G06F9/45558. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Apr 07 2026 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 7 related publications on this page (citations in our corpus or others sharing the same primary CPC).