What technology area does this patent fall under?

Primary CPC classification H04L67/63. Mapped technology areas include Electricity.

When was this patent published?

Publication date Tue Jun 24 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Dynamic horizontal autoscaling for multi-tenant service

US12341861B2 · US · B2

Patent metadata
Field	Value
Publication number	US-12341861-B2
Application number	US-202218066339-A
Country	US
Kind code	B2
Filing date	Dec 15, 2022
Priority date	Dec 15, 2022
Publication date	Jun 24, 2025
Grant date	Jun 24, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Embodiments relate to dynamic horizontal autoscaling for a multi-tenant service. According to an aspect, a computer-implemented method includes configuring a network computing element to route a plurality of requests from two or more clients to a service in a containerized-networked system. A horizontal service-level autoscaler monitors one or more aspects of the requests to the service from the two or more clients. A new pod including a set of running containers with shared resources in the containerized-networked system is created based at least in part on the horizontal service-level autoscaler determining that at least one of the one or more aspects of the requests exceeds a threshold. A new service mapping is created to accept subsequent requests from one of the two or more clients that triggered creation of the new pod.

First claim

Opening claim text (preview).

What is claimed is: 1. A computer-implemented method comprising: configuring a network computing element to route a plurality of requests from two or more clients to a service in a containerized-networked system, wherein the two or more clients use the same service address to request the service through the network computing element, and the network computing element uses routing rules to route each of the client requests to a particular client-specific variation of a plurality of client-specific variations of the service each having a different service name that represents the service; monitoring, by a horizontal service-level autoscaler, one or more aspects of the requests to the service from the two or more clients that are received through the network computing element and routed by the network computing element to a first variation of the service using a first service name of the service, wherein the first variation of the service is mapped to a first pod that is a shared pod of all of the two or more clients; creating a new pod comprising a set of running containers with shared resources in the containerized-networked system, the creating based at least in part on the horizontal service-level autoscaler determining that at least one of the one or more aspects of the requests exceeds a threshold; creating a new service mapping comprising a second variation of the service having a second service name that maps to the first pod and to the new pod, wherein the new pod is a dedicated pod of the client that triggered creation of the new pod, wherein the new pod may be converted from the dedicated pod to a partially shared pod that is shared by more than one of the clients and shared by fewer than all of the clients; and updating routing rules in the network computing element by mapping the client that triggered creation of the new pod to the second service name instead of the first service name, thereby causing the network computing element to route subsequent requests from the client that triggered creation of the new pod to the second variation of the service instead of the first variation of the service. 2. The computer-implemented method of claim 1 , wherein the one or more aspects of the requests are checked with respect to a plurality of predefined rules based at least in part on one or more of memory usage and processing resource usage. 3. The computer-implemented method of claim 1 , further comprising: using a label selector to bind a relationship between the new service mapping and the new pod. 4. The computer-implemented method of claim 1 , further comprising: continuing to perform horizontal scaling at a service level for each new client that accesses the network computing element. 5. A system comprising: a memory; and a processing device coupled to the memory, the processing device configured to execute instructions to: configure a network computing element to route a plurality of requests from two or more clients to a service in a containerized-networked system, wherein the two or more clients use the same service address to request the service through the network computing element, and the network computing element uses routing rules to route each of the client requests to a particular client-specific variation of a plurality of client-specific variations of the service each having a different service name that represents the service; monitor, by a horizontal service-level autoscaler, one or more aspects of the requests to the service from the two or more clients that are received through the network computing element and routed by the network computing element to a first variation of the service using a first service name of the service, wherein the first variation of the service is mapped to a first pod that is a shared pod of all of the two or more clients; create a new pod comprising a set of running containers with shared resources in the containerized-networked system, the creating based at least in part on the horizontal service-level autoscaler determining that at least one of the one or more aspects of the requests exceeds a threshold; create a new service mapping comprising a second variation of the service having a second service name that maps to the first pod and to the new pod, wherein the new pod is a dedicated pod of the client that triggered creation of the new pod, wherein the new pod may be converted from the dedicated pod to a partially shared pod that is shared by more than one of the clients and shared by fewer than all of the clients; and update routing rules in the network computing element by mapping the client that triggered creation of the new pod to the second service name instead of the first service name, thereby causing the network computing element to route subsequent requests from the client that triggered creation of the new pod to the second variation of the service instead of the first variation of the service. 6. The system of claim 5 , wherein the one or more aspects of the requests are checked with respect to a plurality of predefined rules based at least in part on one or more of memory usage and processing resource usage. 7. The system of claim 5 , wherein the processing device and the memory are further configured to use a label selector to bind a relationship between the new service mapping and the new pod. 8. The system of claim 5 , wherein the processing device and the memory are further configured to continue to perform horizontal scaling at a service level for each new client that accesses the network computing element. 9. A computer program product comprising a computer readable storage medium having program instructions embodied therewith, the program instructions executable by one or more processors to cause the one or more processors to perform operations comprising: configuring a network computing element to route a plurality of requests from two or more clients to a service in a containerized-networked system, wherein the two or more clients use the same service address to request the service through the network computing element, and the network computing element uses routing rules to route each of the client requests to a particular client-specific variation of a plurality of client-specific variations of the service each having a different service name that represents the service; monitoring, by a horizontal service-level autoscaler, one or more aspects of the requests to the service from the two or more clients that are received through the network computing element and routed by the network computing element to a first variation of the service using a first service name of the service, wherein the first variation of the service is mapped to a first pod that is a shared pod of all of the two or more clients; creating a new pod comprising a set of running containers with shared resources in the containerized-networked system, the creating based at least in part on the horizontal service-level autoscaler determining that at least one of the one or more aspects of the requests exceeds a threshold; creating a new service mapping comprising a second variation of the service having a second service name that maps to the first pod and to the new pod, wherein the new pod is a dedicated pod of the client that triggered creation of the new pod, wherein the new pod may be converted from the dedicated pod to a partially shared pod that is shared by more than one of the clients and shared by fewer than all of the clients; and updating routing rules in the network computing element by mapping the client that triggered creation of the new pod to the second service name instead of the first service name, thereby causing the network computing element to route subsequent requests from

Assignees

Inventors

Classifications

H04L67/61
taking into account QoS or priority requirements · CPC title
H04L43/16
Threshold monitoring · CPC title
H04L67/63Primary
Routing a service request depending on the request content or context · CPC title

Patent family

Related publications grouped by family.

View patent family 91472516

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12341861B2 cover?: Embodiments relate to dynamic horizontal autoscaling for a multi-tenant service. According to an aspect, a computer-implemented method includes configuring a network computing element to route a plurality of requests from two or more clients to a service in a containerized-networked system. A horizontal service-level autoscaler monitors one or more aspects of the requests to the service from th…
Who is the assignee on this patent?: IBM
What technology area does this patent fall under?: Primary CPC classification H04L67/63. Mapped technology areas include Electricity.
When was this patent published?: Publication date Tue Jun 24 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).

How to read this patent

Abstract

First claim

Assignees

Inventors

Classifications

Patent family

External sources

Related patents

Automatic cluster scaling based on varying workloads

Saas infrastructure for flexible multi-tenancy

Using a container orchestration service for dynamic routing

Scaling a cloud infrastructure

Frequently asked questions