What technology area does this patent fall under?

Primary CPC classification G06N20/00. Mapped technology areas include Physics.

When was this patent published?

Publication date Thu Mar 23 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).

Training data generation via reinforcement learning fault-injection

US2023087837A1 · US · A1

Patent metadata
Field	Value
Publication number	US-2023087837-A1
Application number	US-202117482068-A
Country	US
Kind code	A1
Filing date	Sep 22, 2021
Priority date	Sep 22, 2021
Publication date	Mar 23, 2023
Grant date	—

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems/techniques for generating training data via reinforcement learning fault-injection are provided. A system can access a computing application. In various aspects, the system can train one or more machine learning models based on responses of the computing application to iterative fault-injections determined via reinforcement learning. More specifically, the system can: inject a first fault into the computing application; record a resultant dataset outputted by the computing application in response to the first fault; train the one or more machine learning models on the resultant dataset and the first fault; compute a reinforcement learning reward based on performance metrics of the one or more machine learning models and based on a quantity of the resultant dataset; update, via execution of a reinforcement learning algorithm, the fault-injection policy based on the reinforcement learning reward; and inject a second fault into the computing application, based on the updated fault-injection policy.

First claim

Opening claim text (preview).

What is claimed is: 1 . A system, comprising: a processor that executes computer-executable components stored in a computer-readable memory, the computer-executable components comprising: a transceiver component that accesses a computing application; and a training component that trains one or more machine learning models based on responses of the computing application to iterative fault-injections that are determined via reinforcement learning. 2 . The system of claim 1 , wherein the computer-executable components further comprise: a fault-injection component that injects a first fault into the computing application, based on a fault-injection policy. 3 . The system of claim 2 , wherein the computer-executable components further comprise: a logging component that records a resultant dataset outputted by the computing application in response to the first fault. 4 . The system of claim 3 , wherein the training component trains the one or more machine learning models on the resultant dataset and the first fault. 5 . The system of claim 4 , wherein the computer-executable components further comprise: a reward component that evaluates one or more performance metrics of the one or more machine learning models after training, that evaluates a quantity of the resultant dataset, and that computes a reinforcement learning reward based on the one or more performance metrics and the quantity. 6 . The system of claim 5 , wherein the computer-executable components further comprise: an update component that updates, via execution of a reinforcement learning algorithm, the fault-injection policy based on the reinforcement learning reward. 7 . The system of claim 6 , wherein the fault-injection component injects a second fault into the computing application, based on the updated fault-injection policy. 8 . A computer-implemented method, comprising: accessing, by a device operatively coupled to a processor, a computing application; and training, by the device, one or more machine learning models based on responses of the computing application to iterative fault-injections that are determined via reinforcement learning. 9 . The computer-implemented method of claim 8 , wherein the training the one or more machine learning models based on responses of the computing application to iterative fault-injections includes: injecting, by the device, a first fault into the computing application, based on a fault-injection policy. 10 . The computer-implemented method of claim 9 , wherein the training the one or more machine learning models based on responses of the computing application to iterative fault-injections further includes: recording, by the device, a resultant dataset outputted by the computing application in response to the first fault. 11 . The computer-implemented method of claim 10 , wherein the training the one or more machine learning models based on responses of the computing application to iterative fault-injections further includes: training, by the device, the one or more machine learning models on the resultant dataset and the first fault. 12 . The computer-implemented method of claim 11 , wherein the training the one or more machine learning models based on responses of the computing application to iterative fault-injections further includes: evaluating, by the device, one or more performance metrics of the one or more machine learning models after training; evaluating, by the device, a quantity of the resultant dataset; and computing, by the device, a reinforcement learning reward based on the one or more performance metrics and the quantity. 13 . The computer-implemented method of claim 12 , wherein the training the one or more machine learning models based on responses of the computing application to iterative fault-injections further includes: updating, by the device and via execution of a reinforcement learning algorithm, the fault-injection policy based on the reinforcement learning reward. 14 . The computer-implemented method of claim 13 , wherein the training the one or more machine learning models based on responses of the computing application to iterative fault-injections further includes: injecting, by the device, a second fault into the computing application, based on the updated fault-injection policy. 15 . A computer program product for facilitating training data generation via reinforcement learning fault-injection, the computer program product comprising a computer-readable memory having program instructions embodied therewith, the program instructions executable by a processor to cause the processor to: access, by the processor, a computing application; and train, by the processor, one or more machine learning models based on responses of the computing application to iterative fault-injections that are determined via reinforcement learning. 16 . The computer program product of claim 15 , wherein the processor trains the one or more machine learning models based on responses of the computing application to iterative fault-injections by: injecting, by the processor, a first fault into the computing application, based on a fault-injection policy. 17 . The computer program product of claim 16 , wherein the processor trains the one or more machine learning models based on responses of the computing application to iterative fault-injections by: recording, by the processor, a resultant dataset outputted by the computing application in response to the first fault. 18 . The computer program product of claim 17 , wherein the processor trains the one or more machine learning models based on responses of the computing application to iterative fault-injections by: training, by the processor, the one or more machine learning models on the resultant dataset and the first fault. 19 . The computer program product of claim 18 , wherein the processor trains the one or more machine learning models based on responses of the computing application to iterative fault-injections by: evaluating, by the processor, one or more performance metrics of the one or more machine learning models after training; evaluating, by the processor, a quantity of the resultant dataset; and computing, by the processor, a reinforcement learning reward based on the one or more performance metrics and the quantity. 20 . The computer program product of claim 19 , wherein the processor trains the one or more machine learning models based on responses of the computing application to iterative fault-injections by: updating, by the processor and via execution of a reinforcement learning algorithm, the fault-injection policy based on the reinforcement learning reward.

Assignees

Inventors

Classifications

G06N20/00Primary
Machine learning · CPC title
G06F11/3476Primary
Data logging (G06F11/14, G06F11/2205 take precedence) · CPC title
G06F18/217
Validation; Performance evaluation; Active pattern learning techniques · CPC title
G06K9/6262
Physics · mapped topic
G06N3/084
Backpropagation, e.g. using gradient descent · CPC title

Patent family

Related publications grouped by family.

View patent family 85572163

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2023087837A1 cover?: Systems/techniques for generating training data via reinforcement learning fault-injection are provided. A system can access a computing application. In various aspects, the system can train one or more machine learning models based on responses of the computing application to iterative fault-injections determined via reinforcement learning. More specifically, the system can: inject a first fau…
Who is the assignee on this patent?: IBM
What technology area does this patent fall under?: Primary CPC classification G06N20/00. Mapped technology areas include Physics.
When was this patent published?: Publication date Thu Mar 23 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).