Predictive Scale-Up Of Compute Nodes In A Software Orchestration Cluster

US2025328391A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2025328391-A1
Application numberUS-202418637593-A
CountryUS
Kind codeA1
Filing dateApr 17, 2024
Priority dateApr 17, 2024
Publication dateOct 23, 2025
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The disclosure describes a node management service that proactively scales up compute nodes in a compute cluster. The node management service interfaces with an orchestration service, a compute provider and a compute cluster running instances of an object. The node management service receives meta data from an orchestration service indicating the desired number of instances of an object. Based on the desired number of instances, the node management service obtains, from the compute provider, new compute nodes for the compute cluster to accommodate the desired number of instances.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method of operating a node management service, the method comprising: receiving, from an orchestration service, meta data indicating a number of instances of an object to scale up with respect to an application deployed on compute nodes in a compute cluster, the compute nodes being provided by a compute provider and managed by the node management service; and proactively obtaining, based on the number of instances indicated in the meta data, one or more new compute nodes from the compute provider. 2 . The method of claim 1 further comprising: after obtaining the new compute nodes, receiving a request from the orchestration service to deploy at least an instance of the object in the compute cluster; and providing, to the orchestration service in response to the request, an identification of one of the one or more new compute nodes on which to deploy the instance of the object. 3 . The method of claim 2 wherein, to scale up the application, the orchestration service determines the number of instances of the object to scale up, edits the meta data to reflect the number of the instances, and adds the instances into an instance registry, from where the instances are scheduled for deployment to the compute cluster. 4 . The method of claim 3 wherein obtaining the new compute nodes occurs prior to any of the instances being scheduled for deployment and prior to the orchestration service having added all of the instances to the instance registry. 5 . The method of claim 4 , further comprising: determining that the orchestration service has successfully created at least one instance corresponding to the number of instances, wherein the obtaining the one or more new compute nodes is in response to determining that the orchestration service has successfully added at least one instance to the instance registry. 6 . The method of claim 1 further comprising: predicting, based on the number of instances indicated in the meta data, that current compute nodes in the compute cluster are not sufficient to accommodate the instances, wherein the obtaining the one or more new compute nodes is in response to determining that the current compute nodes are not sufficient. 7 . The method of claim 1 further comprising: generating a user interface for display to an owner of the application, wherein the user interface comprises an element allowing the owner to enable a predictive scaling feature of the node management service, wherein the predictive scaling feature includes the obtaining the one or more new compute nodes based on the number of objects. 8 . A system for operating a node management service, the system comprising: one or more processors; and one or more memories operably coupled to the one or more processors and having stored thereon software instructions that, upon execution by the one or more processors, cause the one or more processors to: receive, from an orchestration service, meta data indicating a number of instances of an object to scale up with respect to an application deployed on compute nodes in a compute cluster, the compute nodes being provided by a compute provider and managed by the node management service; obtain, from the compute provider, one or more new compute nodes for the compute cluster based on the number of instances indicated in the meta data; after obtaining the new compute nodes, receive a request from the orchestration service to deploy at least an instance of the object in the compute cluster; and provide, to the orchestration service in response to the request, an identification of one of the one or more new compute nodes on which to deploy the instance of the object. 9 . The system of claim 8 , wherein, to scale up the application, the orchestration service determines the number of instances of the object to scale up, edits the meta data to reflect the number of the instances, and adds the instances into an instance registry, from where the instances are scheduled for deployment to the compute cluster. 10 . The system of claim 9 , wherein obtaining the new compute nodes occurs prior to any of the instances being scheduled for deployment and prior to the orchestration service having added all of the instances to the instance registry. 11 . The system of claim 9 , wherein the software instructions comprise further instructions that, upon execution by the one or more processors, cause the one or more processors to: determine that the orchestration service has successfully created at least one object corresponding to the number of objects, wherein the obtaining the one or more new compute nodes is in response to determining that the orchestration service has successfully added at least one instance to the instance registry. 12 . The system of claim 8 , wherein the software instructions comprise further instructions that, upon execution by the one or more processors, cause the one or more processors to: predict, based on the number of objects indicated in the meta data, that current compute nodes in the compute cluster are not sufficient to accommodate the instances, wherein the obtaining the one or more new compute nodes is in response to determining that the current compute nodes are not sufficient. 13 . The system of claim 8 , wherein the software instructions comprise further instructions that, upon execution by the one or more processors, cause the one or more processors to: generate a user interface for display to an owner of the application, wherein the user interface comprises an element allowing the owner to enable a predictive scaling feature of the node management service, wherein the predictive scaling feature includes the obtaining the one or more new compute nodes based on the number of objects. 14 . The system of claim 8 , wherein the orchestration service comprises Kubernetes, wherein the compute cluster comprises a Kubernetes cluster, and wherein the number of instances corresponds to a desired number of pod replicas in the Kubernetes cluster. 15 . A computer-readable storage media having program instructions stored thereon to operate a node management service, wherein the program instructions, upon execution by one or more processors, cause the one or more processors to: generate a user interface for display to an owner of an application deployed on compute nodes in a compute cluster, the compute nodes being provided by a compute provider and managed by the node management service, wherein the user interface comprises an element allowing the owner to enable a predictive scaling feature of the node management service; in response to a scale-up event, determine whether the predictive scaling feature is enabled for the application; and in response to determining that the predictive scaling feature is enabled for the application, apply the predictive scaling feature to the application. 16 . The computer-readable storage media of claim 15 wherein the program instructions further cause the one or more processors to: receive, from an orchestration service, meta data indicating a number of instances of an object to scale up with respect to the application; obtain, from the compute provider, one or more new compute nodes for the compute cluster based on the number of instances indicated in the meta data; after obtaining the new compute nodes, receive a request from the orchestration service to deploy at least an instance of the object in the compute cluster; and provide, to the orchestration service in response to the request, an identification of one of the one or more new compute nodes o

Assignees

Inventors

Classifications

  • Workload prediction · CPC title

  • Partitioning or combining of resources · CPC title

  • considering the load · CPC title

  • Logical partitioning of resources; Management or configuration of virtualized resources (specific details on emulation or internal functioning of virtual machines G06F9/455) · CPC title

  • G06F9/5072Primary

    Grid computing · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2025328391A1 cover?
The disclosure describes a node management service that proactively scales up compute nodes in a compute cluster. The node management service interfaces with an orchestration service, a compute provider and a compute cluster running instances of an object. The node management service receives meta data from an orchestration service indicating the desired number of instances of an object. Based …
Who is the assignee on this patent?
Netapp Inc
What technology area does this patent fall under?
Primary CPC classification G06F9/5072. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Oct 23 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).