User acceptance test system for machine learning systems

US2022300754A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2022300754-A1
Application numberUS-202117203921-A
CountryUS
Kind codeA1
Filing dateMar 17, 2021
Priority dateMar 17, 2021
Publication dateSep 22, 2022
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Methods, systems, and computer-readable storage media for receiving, by a ML application executing within a cloud platform, a first inference request, the first inference request including first inference data, transmitting, by the ML application, the first inference data to the UAT system within the cloud platform, retrieving, by the UAT system, a first ML model in response to the inference request, the first ML model being in an inactive state, providing, by the UAT system, a first inference based on the first inference data using the first ML model, providing a first accuracy evaluation at least partially based on the first inference, and transitioning the first ML model from the inactive state to an active state, the first ML model being used for production in the active state.

First claim

Opening claim text (preview).

What is claimed is: 1 . A computer-implemented method for selectively deploying machine learning (ML) models to production using a user acceptance test (UAT) system, the method comprising: receiving, by a ML application executing within a cloud platform, a first inference request, the first inference request comprising first inference data; transmitting, by the ML application, the first inference data to the UAT system within the cloud platform; retrieving, by the UAT system, a first ML model in response to the inference request, the first ML model being in an inactive state; providing, by the UAT system, a first inference based on the first inference data using the first ML model; providing a first accuracy evaluation at least partially based on the first inference; and transitioning the first ML model from the inactive state to an active state, the first ML model being used for production in the active state. 2 . The method of claim 1 , further comprising: generating, by the ML application, a second inference based on the first inference data using a second ML model in parallel with generating the first inference, the second ML model being in the active state; and replacing the second ML model with the first ML model for subsequent production use in response to transitioning the first ML model to the active state. 3 . The method of claim 2 , wherein the first ML model is an updated version of the second ML model. 4 . The method of claim 1 , wherein the first accuracy evaluation comprises: determining an accuracy of the first ML model that represents correct inferences of the first ML model; and comparing the accuracy of the first ML model to a threshold accuracy. 5 . The method of claim 1 , wherein providing a first accuracy evaluation is executed in response to occurrence of a polling condition. 6 . The method of claim 1 , wherein the first inference data comprises production data. 7 . The method of claim 1 , further comprising: retrieving, by the UAT system, a second ML model in response to a second inference request, the second ML model being in an inactive state; providing, by the UAT system, a second inference based on second inference data of the second inference request using the second ML model; determining a second accuracy evaluation at least partially based on the second inference; and transmitting an alert regarding the second ML model in response to the second accuracy evaluation. 8 . A non-transitory computer-readable storage medium coupled to one or more processors and having instructions stored thereon which, when executed by the one or more processors, cause the one or more processors to perform operations for selectively deploying machine learning (ML) models to production using a user acceptance test (UAT) system, the operations comprising: receiving, by a ML application executing within a cloud platform, a first inference request, the first inference request comprising first inference data; transmitting, by the ML application, the first inference data to the UAT system within the cloud platform; retrieving, by the UAT system, a first ML model in response to the inference request, the first ML model being in an inactive state; providing, by the UAT system, a first inference based on the first inference data using the first ML model; providing a first accuracy evaluation at least partially based on the first inference; and transitioning the first ML model from the inactive state to an active state, the first ML model being used for production in the active state. 9 . The non-transitory computer-readable storage medium of claim 8 , wherein operations further comprise: generating, by the ML application, a second inference based on the first inference data using a second ML model in parallel with generating the first inference, the second ML model being in the active state; and replacing the second ML model with the first ML model for subsequent production use in response to transitioning the first ML model to the active state. 10 . The non-transitory computer-readable storage medium of claim 9 , wherein the first ML model is an updated version of the second ML model. 11 . The non-transitory computer-readable storage medium of claim 8 , wherein the first accuracy evaluation comprises: determining an accuracy of the first ML model that represents correct inferences of the first ML model; and comparing the accuracy of the first ML model to a threshold accuracy. 12 . The non-transitory computer-readable storage medium of claim 8 , wherein providing a first accuracy evaluation is executed in response to occurrence of a polling condition. 13 . The non-transitory computer-readable storage medium of claim 8 , wherein the first inference data comprises production data. 14 . The non-transitory computer-readable storage medium of claim 8 , wherein operations further comprise: retrieving, by the UAT system, a second ML model in response to a second inference request, the second ML model being in an inactive state; providing, by the UAT system, a second inference based on second inference data of the second inference request using the second ML model; determining a second accuracy evaluation at least partially based on the second inference; and transmitting an alert regarding the second ML model in response to the second accuracy evaluation. 15 . A system, comprising: a computing device; and a computer-readable storage device coupled to the computing device and having instructions stored thereon which, when executed by the computing device, cause the computing device to perform operations for selectively deploying machine learning (ML) models to production using a user acceptance test (UAT) system, the operations comprising: receiving, by a ML application executing within a cloud platform, a first inference request, the first inference request comprising first inference data; transmitting, by the ML application, the first inference data to the UAT system within the cloud platform; retrieving, by the UAT system, a first ML model in response to the inference request, the first ML model being in an inactive state; providing, by the UAT system, a first inference based on the first inference data using the first ML model; providing a first accuracy evaluation at least partially based on the first inference; and transitioning the first ML model from the inactive state to an active state, the first ML model being used for production in the active state. 16 . The system of claim 15 , wherein operations further comprise: generating, by the ML application, a second inference based on the first inference data using a second ML model in parallel with generating the first inference, the second ML model being in the active state; and replacing the second ML model with the first ML model for subsequent production use in response to transitioning the first ML model to the active state. 17 . The system of claim 16 , wherein the first ML model is an updated version of the second ML model. 18 . The system of claim 15 , wherein the first accuracy evaluation comprises: determining an accuracy of the first ML model that represents correct inferences of the first ML model; and comparing the accuracy of the first ML model to a threshold accuracy. 19 . The system of claim 15 , wherein providing a first accuracy evaluation is executed in response to occurrence of a polling condition. 20 . The system of claim 15 , wherein the first infer

Assignees

Inventors

Classifications

  • based on feedback of a supervisor · CPC title

  • G06F18/285Primary

    Selection of pattern recognition techniques, e.g. of classifiers in a multi-classifier system · CPC title

  • G06N20/00Primary

    Machine learning · CPC title

  • Backpropagation, e.g. using gradient descent · CPC title

  • modifying the architecture, e.g. adding, deleting or silencing nodes or connections · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2022300754A1 cover?
Methods, systems, and computer-readable storage media for receiving, by a ML application executing within a cloud platform, a first inference request, the first inference request including first inference data, transmitting, by the ML application, the first inference data to the UAT system within the cloud platform, retrieving, by the UAT system, a first ML model in response to the inference re…
Who is the assignee on this patent?
Sap Se
What technology area does this patent fall under?
Primary CPC classification G06F18/285. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Sep 22 2022 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).