Who is the assignee on this patent?

Beijing Baidu Netcom Sci & Tech Co Ltd

What technology area does this patent fall under?

Primary CPC classification H04L67/34. Mapped technology areas include Electricity.

When was this patent published?

Publication date Tue Sep 27 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Management of artificial intelligence development platform

US11455173B2 · US · B2

Patent metadata
Field	Value
Publication number	US-11455173-B2
Application number	US-202117207268-A
Country	US
Kind code	B2
Filing date	Mar 19, 2021
Priority date	Jun 30, 2020
Publication date	Sep 27, 2022
Grant date	Sep 27, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method for management of an artificial intelligence development platform is provided. The artificial intelligence development platform is deployed with instances of a plurality of model services, and each of the model services is provided with one or more instances. The method includes: acquiring calling information of at least one model service; determining the activity of the at least one model service according to the calling information; and at least deleting all instances of the at least one model service in response to that the determined activity meets a first condition.

First claim

Opening claim text (preview).

The invention claimed is: 1. A method for management of an artificial intelligence, AI, development platform, the AI development platform being deployed with instances of a plurality of AI model services, and each of the plurality of AI model services being provided with one or more instances, wherein the method comprises: acquiring calling information of all instances of at least one AI model service of the plurality of AI model services; determining calling activity of the at least one AI model service according to the calling information; and deleting all instances of the at least one AI model service without deleting a routing configuration of the at least one AI model service in an access layer of the AI development platform in response to the determined calling activity meeting a calling frequency within the predetermined time window. 2. The method of claim 1 , further comprising: in response to determining that the at least one AI model service of which the instances have been deleted is not called within a predetermined period, deleting the routing configuration of the at least one AI model service in the access layer of the AI development platform. 3. The method of claim 1 , wherein deleting all instances of the at least one AI model service comprises: determining whether the at least one AI model service exists in a cluster of the AI development platform; and in response to determining that the at least one AI model service exists in the cluster, switching a backend of the at least one AI model service to a traffic receiving module, wherein the traffic receiving module is used for receiving a model reasoning traffic for the AI model service of which the instances have been deleted. 4. The method of claim 3 , wherein deleting all instances of the at least one AI model service further comprises: continuing to store original data information of the at least one AI model service into the cluster, wherein the original data information at least comprises a first number of the instances deployed on the AI development platform before the instances of the at least one AI model service is deleted. 5. The method of claim 4 , further comprising: in response to the traffic receiving module receives the model reasoning traffic of the at least one AI model service of which the instances have been deleted, triggering a wakeup operation for the at least one AI model service, wherein the wakeup operation comprises: recovering the first number of instances of the at least one AI model service on the AI development platform, and polling states of the recovered instances until the starting of the first number of instances is completed; and switching the backend of the at least one AI model service back to the first number of recovered instances. 6. The method of claim 5 , wherein in the case that the routing configuration of the at least one AI model service in the access layer of the AI development platform has been deleted, the wakeup operation further comprises: reconfiguring a routing rule corresponding to the at least one AI model service. 7. The method of claim 5 , wherein the traffic receiving module has a plurality of instances, wherein when a first instance in the plurality of instances of the traffic receiving module receives the model reasoning traffic of the at least one AI model service of which the instances have been deleted first, the first instance acquires a distributed lock resource and triggers the wakeup operation, wherein the first instance is capable of preventing other instances in the plurality of instances of the traffic receiving module from triggering the wakeup operation when the first instance possesses the distributed lock resource. 8. The method of claim 1 , wherein acquiring the calling information of the at least one AI model service comprises: acquiring the creation time of the at least one AI model service; and acquiring at least one of the following information of the at least one AI model service by analyzing traffic logs related to all the instances of the at least one AI model service: the most recent calling time, the number of calling within a predetermined time window and calling time distribution within the predetermined time window. 9. The method of claim 1 , wherein the AI development platform is a single cluster. 10. A computing system for management of an artificial intelligence, AI, development platform, the AI development platform being deployed with instances of a plurality of AI model services, and each of the plurality of AI model services being provided with one or more instances, wherein the computing system comprises: one or more processors; and a non-transitory memory that stores a program, the program comprising instructions that, when executed by the one or more processors, cause the one or more processors to: acquire calling information of all instances of at least one AI model service of the plurality of AI model services; determine calling activity of the at least one AI model service according to the calling information; and delete all instances of the at least one AI model service without deleting a routing configuration of the at least one AI model service in an access layer of the AI development platform in response to the determined calling activity meeting a calling frequency within the predetermined time window. 11. The computing system of claim 10 , wherein the instructions further cause the one or more processors to: in response to determining that the at least one AI model service of which the instances have been deleted is not called within a predetermined period, delete the routing configuration of the at least one AI model service in the access layer of the AI development platform. 12. The computing system of claim 10 , wherein deleting all instances of the at least one AI model service comprises: determining whether the at least one AI model service exists in a cluster of the AI development platform system; and in response to determining that the at least one AI model service exists in the cluster, switching a backend of the at least one AI model service to a traffic receiving module, wherein the traffic receiving module is used for receiving a model reasoning traffic for the AI model service of which the instances have been deleted. 13. The computing system of claim 12 , wherein deleting all instances of the at least one AI model service further comprises: continuing to store original data information of the at least one AI model service into the cluster, wherein the original data information at least comprises a first number of the instances deployed on the AI development platform before the instances of the at least one AI model service is deleted. 14. The computing system of claim 13 , wherein the instructions further cause the one or more processors to: in response to the traffic receiving module receives the model reasoning traffic of the at least one AI model service of which the instances have been deleted, trigger a wakeup operation for at least one AI model service, wherein the wakeup operation comprises: recovering the first number of instances of the at least one AI model service on the AI development platform, and poll states of the recovered instances until the starting of the first number of instances is completed; and switching the backend of the at least one AI model service back to the first number of recovered instances. 15. The computing system of claim 14 , wherein in the case that the routing configuration of the at least one AI model service in the access layer of the AI development platform has b

Assignees

Beijing Baidu Netcom Sci & Tech Co Ltd

Inventors

Classifications

H04L67/34Primary
involving the movement of software or configuration parameters (network booting or remote initial program loading [RIPL] G06F9/4416) · CPC title
G06F9/44505Primary
Configuring for program initiating, e.g. using registry, configuration files · CPC title
G06F18/214
Generating training patterns; Bootstrap methods, e.g. bagging or boosting · CPC title
H04L41/5054
Automatic deployment of services triggered by the service manager, e.g. service implementation by automatic configuration of network components · CPC title
H04L67/2895
Intermediate processing functionally located close to the data provider application, e.g. reverse proxies · CPC title

Patent family

Related publications grouped by family.

View patent family 72760076

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11455173B2 cover?: A method for management of an artificial intelligence development platform is provided. The artificial intelligence development platform is deployed with instances of a plurality of model services, and each of the model services is provided with one or more instances. The method includes: acquiring calling information of at least one model service; determining the activity of the at least one m…
Who is the assignee on this patent?: Beijing Baidu Netcom Sci & Tech Co Ltd
What technology area does this patent fall under?: Primary CPC classification H04L67/34. Mapped technology areas include Electricity.
When was this patent published?: Publication date Tue Sep 27 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).

How to read this patent

Abstract

First claim

Assignees

Inventors

Classifications

Patent family

External sources

Related patents

Utilizing artificial intelligence to improve productivity of software development and information technology operations (devops)

Network-accessible machine learning model training and hosting system

Leveraging microservice containers to provide tenant isolation in a multi-tenant api gateway

Frequently asked questions