Managing data center orchestration using service plans and manifests
US-2024385850-A1 · Nov 21, 2024 · US
US2025328391A1 · US · A1
| Field | Value |
|---|---|
| Publication number | US-2025328391-A1 |
| Application number | US-202418637593-A |
| Country | US |
| Kind code | A1 |
| Filing date | Apr 17, 2024 |
| Priority date | Apr 17, 2024 |
| Publication date | Oct 23, 2025 |
| Grant date | — |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
The disclosure describes a node management service that proactively scales up compute nodes in a compute cluster. The node management service interfaces with an orchestration service, a compute provider and a compute cluster running instances of an object. The node management service receives meta data from an orchestration service indicating the desired number of instances of an object. Based on the desired number of instances, the node management service obtains, from the compute provider, new compute nodes for the compute cluster to accommodate the desired number of instances.
Opening claim text (preview).
What is claimed is: 1 . A method of operating a node management service, the method comprising: receiving, from an orchestration service, meta data indicating a number of instances of an object to scale up with respect to an application deployed on compute nodes in a compute cluster, the compute nodes being provided by a compute provider and managed by the node management service; and proactively obtaining, based on the number of instances indicated in the meta data, one or more new compute nodes from the compute provider. 2 . The method of claim 1 further comprising: after obtaining the new compute nodes, receiving a request from the orchestration service to deploy at least an instance of the object in the compute cluster; and providing, to the orchestration service in response to the request, an identification of one of the one or more new compute nodes on which to deploy the instance of the object. 3 . The method of claim 2 wherein, to scale up the application, the orchestration service determines the number of instances of the object to scale up, edits the meta data to reflect the number of the instances, and adds the instances into an instance registry, from where the instances are scheduled for deployment to the compute cluster. 4 . The method of claim 3 wherein obtaining the new compute nodes occurs prior to any of the instances being scheduled for deployment and prior to the orchestration service having added all of the instances to the instance registry. 5 . The method of claim 4 , further comprising: determining that the orchestration service has successfully created at least one instance corresponding to the number of instances, wherein the obtaining the one or more new compute nodes is in response to determining that the orchestration service has successfully added at least one instance to the instance registry. 6 . The method of claim 1 further comprising: predicting, based on the number of instances indicated in the meta data, that current compute nodes in the compute cluster are not sufficient to accommodate the instances, wherein the obtaining the one or more new compute nodes is in response to determining that the current compute nodes are not sufficient. 7 . The method of claim 1 further comprising: generating a user interface for display to an owner of the application, wherein the user interface comprises an element allowing the owner to enable a predictive scaling feature of the node management service, wherein the predictive scaling feature includes the obtaining the one or more new compute nodes based on the number of objects. 8 . A system for operating a node management service, the system comprising: one or more processors; and one or more memories operably coupled to the one or more processors and having stored thereon software instructions that, upon execution by the one or more processors, cause the one or more processors to: receive, from an orchestration service, meta data indicating a number of instances of an object to scale up with respect to an application deployed on compute nodes in a compute cluster, the compute nodes being provided by a compute provider and managed by the node management service; obtain, from the compute provider, one or more new compute nodes for the compute cluster based on the number of instances indicated in the meta data; after obtaining the new compute nodes, receive a request from the orchestration service to deploy at least an instance of the object in the compute cluster; and provide, to the orchestration service in response to the request, an identification of one of the one or more new compute nodes on which to deploy the instance of the object. 9 . The system of claim 8 , wherein, to scale up the application, the orchestration service determines the number of instances of the object to scale up, edits the meta data to reflect the number of the instances, and adds the instances into an instance registry, from where the instances are scheduled for deployment to the compute cluster. 10 . The system of claim 9 , wherein obtaining the new compute nodes occurs prior to any of the instances being scheduled for deployment and prior to the orchestration service having added all of the instances to the instance registry. 11 . The system of claim 9 , wherein the software instructions comprise further instructions that, upon execution by the one or more processors, cause the one or more processors to: determine that the orchestration service has successfully created at least one object corresponding to the number of objects, wherein the obtaining the one or more new compute nodes is in response to determining that the orchestration service has successfully added at least one instance to the instance registry. 12 . The system of claim 8 , wherein the software instructions comprise further instructions that, upon execution by the one or more processors, cause the one or more processors to: predict, based on the number of objects indicated in the meta data, that current compute nodes in the compute cluster are not sufficient to accommodate the instances, wherein the obtaining the one or more new compute nodes is in response to determining that the current compute nodes are not sufficient. 13 . The system of claim 8 , wherein the software instructions comprise further instructions that, upon execution by the one or more processors, cause the one or more processors to: generate a user interface for display to an owner of the application, wherein the user interface comprises an element allowing the owner to enable a predictive scaling feature of the node management service, wherein the predictive scaling feature includes the obtaining the one or more new compute nodes based on the number of objects. 14 . The system of claim 8 , wherein the orchestration service comprises Kubernetes, wherein the compute cluster comprises a Kubernetes cluster, and wherein the number of instances corresponds to a desired number of pod replicas in the Kubernetes cluster. 15 . A computer-readable storage media having program instructions stored thereon to operate a node management service, wherein the program instructions, upon execution by one or more processors, cause the one or more processors to: generate a user interface for display to an owner of an application deployed on compute nodes in a compute cluster, the compute nodes being provided by a compute provider and managed by the node management service, wherein the user interface comprises an element allowing the owner to enable a predictive scaling feature of the node management service; in response to a scale-up event, determine whether the predictive scaling feature is enabled for the application; and in response to determining that the predictive scaling feature is enabled for the application, apply the predictive scaling feature to the application. 16 . The computer-readable storage media of claim 15 wherein the program instructions further cause the one or more processors to: receive, from an orchestration service, meta data indicating a number of instances of an object to scale up with respect to the application; obtain, from the compute provider, one or more new compute nodes for the compute cluster based on the number of instances indicated in the meta data; after obtaining the new compute nodes, receive a request from the orchestration service to deploy at least an instance of the object in the compute cluster; and provide, to the orchestration service in response to the request, an identification of one of the one or more new compute nodes o
Workload prediction · CPC title
Partitioning or combining of resources · CPC title
considering the load · CPC title
Logical partitioning of resources; Management or configuration of virtualized resources (specific details on emulation or internal functioning of virtual machines G06F9/455) · CPC title
Grid computing · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.