Pre-emptive container load-balancing, auto-scaling and placement

US11604682B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11604682-B2
Application numberUS-202017139210-A
CountryUS
Kind codeB2
Filing dateDec 31, 2020
Priority dateDec 31, 2020
Publication dateMar 14, 2023
Grant dateMar 14, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A resource usage platform is disclosed. The platform performs preemptive container load balancing, auto scaling, and placement in a computing system. Resource usage data is collected from containers and used to train a model that generates inferences regarding resource usage. The resource usage operations are performed based on the inferences and on environment data such as available resources, service needs, and hardware requirements.

First claim

Opening claim text (preview).

What is claimed is: 1. A method, comprising: receiving a request to invoke a container in a computing system from a container image; generating an inference from an inference service by providing an input to the inference service, wherein the inference includes a prediction of resources needed for the request; receiving environment data associated with the computing system, the environment data including information about resources available at nodes in the computing system; attaching services information to the container that describe services used by the container; scheduling the container to run on a node in the computing system based on the inference and the environment data and based on the services information attached to the container, wherein the node has access to the services; and running the container on the node. 2. The method of claim 1 , wherein the environment data comprises container metadata including hardware requirements, resource usage data, and service requirements. 3. The method of claim 1 , further comprising training a model with telemetry data collected from running containers in the computing system. 4. The method of claim 3 , further comprising collecting telemetry data from each container on each node in the computing system. 5. The method of claim 1 , wherein the environment data includes resources consumed by each container at each node, resources available at each node, and execution times of each container. 6. The method of claim 1 , further comprising scheduling the request in an existing and running container, scheduling the request in a new container on a selected node, or rejecting the request. 7. The method of claim 1 , further comprising training a model using telemetry data and/or aggregating the telemetry data. 8. The method of claim 1 , further comprising auto scaling containers in the computing system and/or performing federating learning. 9. A non-transitory storage medium having stored therein instructions that are executable by one or more hardware processors to perform operations comprising: receiving a request to invoke a container in a computing system from a container image; generating an inference from an inference service by providing an input to the inference service, wherein the inference includes a prediction of resources needed for the request; receiving environment data associated with the computing system, the environment data including information about resources available at nodes in the computing system; attaching services information to the container that describe services used by the container; scheduling the container to run on a node in the computing system based on the inference and the environment data and based on the services information attached to the container, wherein the node has access to the services; and running the container on the node. 10. The non-transitory storage medium of claim 9 , wherein the environment data comprises container metadata including hardware requirements, resource usage data, and service requirements. 11. The non-transitory storage medium of claim 9 , further comprising training a model with telemetry data collected from running containers in the computing system. 12. The non-transitory storage medium of claim 11 , further comprising collecting telemetry data from each container on each node in the computing system. 13. The non-transitory storage medium of claim 9 , wherein the environment data includes resources consumed by each container at each node, resources available at each node, and execution times of each container. 14. The non-transitory storage medium of claim 9 , further comprising scheduling the request in an existing and running container, scheduling the request in a new container on a selected node, or rejecting the request. 15. The non-transitory storage medium of claim 9 , further comprising training a model using telemetry data and/or aggregating the telemetry data. 16. The non-transitory storage medium of claim 9 , further comprising auto scaling containers in the computing system and/or performing federating learning.

Assignees

Inventors

Classifications

  • Distributed learning, e.g. federated learning · CPC title

  • Supervised learning · CPC title

  • Learning methods · CPC title

  • G06F9/5083Primary

    Techniques for rebalancing the load in a distributed system · CPC title

  • Workload prediction · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11604682B2 cover?
A resource usage platform is disclosed. The platform performs preemptive container load balancing, auto scaling, and placement in a computing system. Resource usage data is collected from containers and used to train a model that generates inferences regarding resource usage. The resource usage operations are performed based on the inferences and on environment data such as available resources,…
Who is the assignee on this patent?
Emc Ip Holding Co Llc
What technology area does this patent fall under?
Primary CPC classification G06F9/5083. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Mar 14 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).