Management of artificial intelligence development platform

US11455173B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11455173-B2
Application numberUS-202117207268-A
CountryUS
Kind codeB2
Filing dateMar 19, 2021
Priority dateJun 30, 2020
Publication dateSep 27, 2022
Grant dateSep 27, 2022

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method for management of an artificial intelligence development platform is provided. The artificial intelligence development platform is deployed with instances of a plurality of model services, and each of the model services is provided with one or more instances. The method includes: acquiring calling information of at least one model service; determining the activity of the at least one model service according to the calling information; and at least deleting all instances of the at least one model service in response to that the determined activity meets a first condition.

First claim

Opening claim text (preview).

The invention claimed is: 1. A method for management of an artificial intelligence, AI, development platform, the AI development platform being deployed with instances of a plurality of AI model services, and each of the plurality of AI model services being provided with one or more instances, wherein the method comprises: acquiring calling information of all instances of at least one AI model service of the plurality of AI model services; determining calling activity of the at least one AI model service according to the calling information; and deleting all instances of the at least one AI model service without deleting a routing configuration of the at least one AI model service in an access layer of the AI development platform in response to the determined calling activity meeting a calling frequency within the predetermined time window. 2. The method of claim 1 , further comprising: in response to determining that the at least one AI model service of which the instances have been deleted is not called within a predetermined period, deleting the routing configuration of the at least one AI model service in the access layer of the AI development platform. 3. The method of claim 1 , wherein deleting all instances of the at least one AI model service comprises: determining whether the at least one AI model service exists in a cluster of the AI development platform; and in response to determining that the at least one AI model service exists in the cluster, switching a backend of the at least one AI model service to a traffic receiving module, wherein the traffic receiving module is used for receiving a model reasoning traffic for the AI model service of which the instances have been deleted. 4. The method of claim 3 , wherein deleting all instances of the at least one AI model service further comprises: continuing to store original data information of the at least one AI model service into the cluster, wherein the original data information at least comprises a first number of the instances deployed on the AI development platform before the instances of the at least one AI model service is deleted. 5. The method of claim 4 , further comprising: in response to the traffic receiving module receives the model reasoning traffic of the at least one AI model service of which the instances have been deleted, triggering a wakeup operation for the at least one AI model service, wherein the wakeup operation comprises: recovering the first number of instances of the at least one AI model service on the AI development platform, and polling states of the recovered instances until the starting of the first number of instances is completed; and switching the backend of the at least one AI model service back to the first number of recovered instances. 6. The method of claim 5 , wherein in the case that the routing configuration of the at least one AI model service in the access layer of the AI development platform has been deleted, the wakeup operation further comprises: reconfiguring a routing rule corresponding to the at least one AI model service. 7. The method of claim 5 , wherein the traffic receiving module has a plurality of instances, wherein when a first instance in the plurality of instances of the traffic receiving module receives the model reasoning traffic of the at least one AI model service of which the instances have been deleted first, the first instance acquires a distributed lock resource and triggers the wakeup operation, wherein the first instance is capable of preventing other instances in the plurality of instances of the traffic receiving module from triggering the wakeup operation when the first instance possesses the distributed lock resource. 8. The method of claim 1 , wherein acquiring the calling information of the at least one AI model service comprises: acquiring the creation time of the at least one AI model service; and acquiring at least one of the following information of the at least one AI model service by analyzing traffic logs related to all the instances of the at least one AI model service: the most recent calling time, the number of calling within a predetermined time window and calling time distribution within the predetermined time window. 9. The method of claim 1 , wherein the AI development platform is a single cluster. 10. A computing system for management of an artificial intelligence, AI, development platform, the AI development platform being deployed with instances of a plurality of AI model services, and each of the plurality of AI model services being provided with one or more instances, wherein the computing system comprises: one or more processors; and a non-transitory memory that stores a program, the program comprising instructions that, when executed by the one or more processors, cause the one or more processors to: acquire calling information of all instances of at least one AI model service of the plurality of AI model services; determine calling activity of the at least one AI model service according to the calling information; and delete all instances of the at least one AI model service without deleting a routing configuration of the at least one AI model service in an access layer of the AI development platform in response to the determined calling activity meeting a calling frequency within the predetermined time window. 11. The computing system of claim 10 , wherein the instructions further cause the one or more processors to: in response to determining that the at least one AI model service of which the instances have been deleted is not called within a predetermined period, delete the routing configuration of the at least one AI model service in the access layer of the AI development platform. 12. The computing system of claim 10 , wherein deleting all instances of the at least one AI model service comprises: determining whether the at least one AI model service exists in a cluster of the AI development platform system; and in response to determining that the at least one AI model service exists in the cluster, switching a backend of the at least one AI model service to a traffic receiving module, wherein the traffic receiving module is used for receiving a model reasoning traffic for the AI model service of which the instances have been deleted. 13. The computing system of claim 12 , wherein deleting all instances of the at least one AI model service further comprises: continuing to store original data information of the at least one AI model service into the cluster, wherein the original data information at least comprises a first number of the instances deployed on the AI development platform before the instances of the at least one AI model service is deleted. 14. The computing system of claim 13 , wherein the instructions further cause the one or more processors to: in response to the traffic receiving module receives the model reasoning traffic of the at least one AI model service of which the instances have been deleted, trigger a wakeup operation for at least one AI model service, wherein the wakeup operation comprises: recovering the first number of instances of the at least one AI model service on the AI development platform, and poll states of the recovered instances until the starting of the first number of instances is completed; and switching the backend of the at least one AI model service back to the first number of recovered instances. 15. The computing system of claim 14 , wherein in the case that the routing configuration of the at least one AI model service in the access layer of the AI development platform has b

Assignees

Inventors

Classifications

  • H04L67/34Primary

    involving the movement of software or configuration parameters  (network booting or remote initial program loading [RIPL] G06F9/4416) · CPC title

  • Configuring for program initiating, e.g. using registry, configuration files · CPC title

  • Generating training patterns; Bootstrap methods, e.g. bagging or boosting · CPC title

  • Automatic deployment of services triggered by the service manager, e.g. service implementation by automatic configuration of network components · CPC title

  • Intermediate processing functionally located close to the data provider application, e.g. reverse proxies · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11455173B2 cover?
A method for management of an artificial intelligence development platform is provided. The artificial intelligence development platform is deployed with instances of a plurality of model services, and each of the model services is provided with one or more instances. The method includes: acquiring calling information of at least one model service; determining the activity of the at least one m…
Who is the assignee on this patent?
Beijing Baidu Netcom Sci & Tech Co Ltd
What technology area does this patent fall under?
Primary CPC classification H04L67/34. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Sep 27 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 3 related publications on this page (citations in our corpus or others sharing the same primary CPC).