Techniques for artificial intelligence capabilities at a network switch

US2025097120A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2025097120-A1
Application numberUS-202418966019-A
CountryUS
Kind codeA1
Filing dateDec 2, 2024
Priority dateDec 28, 2018
Publication dateMar 20, 2025
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Examples include techniques for artificial intelligence (AI) capabilities at a network switch. These examples include receiving a request to register a neural network for loading to an inference resource located at the network switch and loading the neural network based on information included in the request to support an AI service to be provided by users requesting the AI service.

First claim

Opening claim text (preview).

What is claimed is: 1 . At least one non-transitory machine readable medium storing instructions to be executed by at least one machine to be associated with a network switch, the network switch being configurable to comprise graphics processing unit (GPU) resource circuitry, data traffic processing circuitry, and management data processing circuitry, the network switch to be used in association with multiple tenants, the instructions, when executed by the at least one machine, resulting in the network switch being configured for performance of operations comprising: receiving, at the network switch, management data and tenant service-related data, the tenant service-related data being configurable to correspond, at least in part, to artificial intelligence (AI) service requests of the multiple tenants, respective portions of the tenant service-related data being associated with respective of the multiple tenants; generating, by the management data processing circuitry, configuration data, the configuration data to be based upon the management data, the configuration data to configure the GPU resource circuitry of the network switch to implement multi-tenant services associated with the respective of the multiple tenants, the multi-tenant services to be accessed by the respective of the multiple tenants based upon the respective portions of the tenant service-related data; and routing, by the data traffic processing circuitry, the respective portions of the tenant service-related data to the GPU resource circuitry of the network switch so as to permit the respective of the multiple tenants to access, based upon the respective portions of the tenant service-related data, the multi-tenant services associated with the respective of the multiple tenants; wherein: the data traffic processing circuitry is configurable to implement load balancing in association with the routing of the respective portions of the tenant service-related data to the GPU resource circuitry of the network switch; the multiple tenants are to be associated with respective tenant identification data; and the data traffic processing circuitry is configurable to implement the routing of the respective portions of the tenant service-related data to the GPU resource circuitry of the network switch based upon priority data. 2 . The at least one non-transitory machine readable medium of claim 1 , wherein: the network switch is to be comprised in a cloud-based network. 3 . The at least one non-transitory machine readable medium of claim 2 , wherein: the data traffic processing circuitry and/or the management data processing circuitry comprise central processing unit (CPU) core circuitry. 4 . The at least one non-transitory machine readable medium of claim 3 , wherein: the GPU resource circuitry of the network switch comprises multiple GPU processing cores. 5 . The at least one non-transitory machine readable medium of claim 4 , wherein: the multi-tenant services are to be implemented via one or more neural networks, one or more inference resources, neural processing, and/or tensor processing to be implemented using the GPU resource circuitry of the network switch. 6 . The at least one non-transitory machine readable medium of claim 5 , wherein: the respective of the multiple tenants are associated with respective service agreements to be implemented, at least in part, using the network switch; the respective service agreements are associated with the multi-tenant services; and the respective tenant identification data is configurable to be associated with respective tenant billing data to be generated based upon providing of the multi-tenant services to the respective of the multiple tenants. 7 . A method to be implemented in association with a network switch, the network switch being configurable to comprise graphics processing unit (GPU) resource circuitry, data traffic processing circuitry, and management data processing circuitry, the network switch to be used in association with multiple tenants, the method comprising: receiving, at the network switch, management data and tenant service-related data, the tenant service-related data being configurable to correspond, at least in part, to artificial intelligence (AI) service requests of the multiple tenants, respective portions of the tenant service-related data being associated with respective of the multiple tenants; generating, by the management data processing circuitry, configuration data, the configuration data to be based upon the management data, the configuration data to configure the GPU resource circuitry of the network switch to implement multi-tenant services associated with the respective of the multiple tenants, the multi-tenant services to be accessed by the respective of the multiple tenants based upon the respective portions of the tenant service-related data; and routing, by the data traffic processing circuitry, the respective portions of the tenant service-related data to the GPU resource circuitry of the network switch so as to permit the respective of the multiple tenants to access, based upon the respective portions of the tenant service-related data, the multi-tenant services associated with the respective of the multiple tenants; wherein: the data traffic processing circuitry is configurable to implement load balancing in association with the routing of the respective portions of the tenant service-related data to the GPU resource circuitry of the network switch; the multiple tenants are to be associated with respective tenant identification data; and the data traffic processing circuitry is configurable to implement the routing of the respective portions of the tenant service-related data to the GPU resource circuitry of the network switch based upon priority data. 8 . The method of claim 7 , wherein: the network switch is to be comprised in a cloud-based network. 9 . The method of claim 8 , wherein: the data traffic processing circuitry and/or the management data processing circuitry comprise central processing unit (CPU) core circuitry. 10 . The method of claim 9 , wherein: the GPU resource circuitry of the network switch comprises multiple GPU processing cores. 11 . The method of claim 10 , wherein: the multi-tenant services are to be implemented via one or more neural networks, one or more inference resources, neural processing, and/or tensor processing to be implemented using the GPU resource circuitry of the network switch. 12 . The method of claim 11 , wherein: the respective of the multiple tenants are associated with respective service agreements to be implemented, at least in part, using the network switch; the respective service agreements are associated with the multi-tenant services; and the respective tenant identification data is configurable to be associated with respective tenant billing data to be generated based upon providing of the multi-tenant services to the respective of the multiple tenants. 13 . Circuitry to implement a network switch, the network switch being configurable to comprise graphics processing unit (GPU) resource circuitry, the network switch to be used in association with multiple tenants, the circuitry to implement the network switch comprising: circuitry to receive, at the network switch, management data and tenant service-related data, the tenant service-related data being configurable to correspond, at least in part, to artificial intelligence (AI) service requests of the multiple tenants, respective portions of the tenant service-related data being associated with respective of the multiple tenants; management

Assignees

Inventors

Classifications

  • Convolutional networks [CNN, ConvNet] · CPC title

  • characterised by memory or gating, e.g. long short-term memory [LSTM] or gated recurrent units [GRU] · CPC title

  • Quantised networks; Sparse networks; Compressed networks · CPC title

  • the condition being an adaptation, e.g. in response to network events · CPC title

  • Service on demand, e.g. definition and deployment of services in real time · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2025097120A1 cover?
Examples include techniques for artificial intelligence (AI) capabilities at a network switch. These examples include receiving a request to register a neural network for loading to an inference resource located at the network switch and loading the neural network based on information included in the request to support an AI service to be provided by users requesting the AI service.
Who is the assignee on this patent?
Intel Corp
What technology area does this patent fall under?
Primary CPC classification H04L41/16. Mapped technology areas include Electricity.
When was this patent published?
Publication date Thu Mar 20 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).