Systems and methods for active learning from sparse training data

US11710038B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11710038-B2
Application numberUS-202016847098-A
CountryUS
Kind codeB2
Filing dateApr 13, 2020
Priority dateApr 13, 2020
Publication dateJul 25, 2023
Grant dateJul 25, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A method for active learning using sparse training data can include training a machine learning model using less than ten first training data points to generate a candidate machine learning model. The method can include performing a Monte Carlo process to sample one or more first outputs of the candidate machine learning model. The method can include testing the one or more first outputs to determine if each of the one or more first outputs satisfy a respective convergence condition. The method can include, responsive to at least one first output not satisfying the respective convergence condition, training the candidate machine learning model using at least one second training data point corresponding to the at least one first output. The method can include, responsive to the one or more first outputs each satisfying the respective convergence condition, outputting the candidate machine learning model.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for machine learning using sparse training data, comprising: training, by one or more processors, a machine learning model using a first training data point of a plurality of training data points to generate a trained machine learning model, wherein training the machine learning model comprises modifying the machine learning model to satisfy a first convergence condition; selecting, by the one or more processors, a first input to provide to the trained machine learning model, the first input selected from a domain of inputs that includes inputs of the plurality of training data points by performing at least one of a random process or a Monte Carlo process, the first input associated with a first training data output; sampling, by the one or more processors, a first candidate output of the trained machine learning model by providing the first input as input to the trained machine learning model; determining, by the one or more processors, whether the first candidate output satisfies a second convergence condition based on a difference between the first training data output and the first candidate output, the second convergence condition different from the first convergence condition; responsive to the first candidate output not satisfying the second convergence condition, modifying, by the one or more processors, the trained machine learning model using a second training data point of the plurality of training data points, the second training data point comprising the first input and the first training data output; and “responsive to the first candidate output satisfying the second convergence condition, outputting, by the one or more processors, the trained machine learning model. 2. The method of claim 1 , wherein the first training data point is retrieved from a dataset comprising less than one thousand training data points. 3. The method of claim 1 , wherein the machine learning model comprises a neural network comprising an input layer, one or more hidden layers, and an output layer. 4. The method of claim 1 , wherein the first training data output is a reference output of a reference high-fidelity model. 5. The method of claim 1 , wherein training the machine learning model using the first training data point comprises retraining the machine learning model using less than ten training data points. 6. The method of claim 1 , wherein the first training data point comprises at least one of an experimental data point or a synthetic data point from a reference high-fidelity model. 7. The method of claim 1 , wherein the plurality of training data points are from a first subset of data points of a training database, and the domain of inputs comprises a second subset of data points of the training database. 8. The method of claim 1 , wherein: sampling, by the one or more processors, the first candidate output comprises sampling a plurality of first candidate outputs of the trained machine learning model; determining, by the one or more processors, whether the first candidate output satisfies the second convergence condition comprises determining, by the one or more processors, whether each first candidate output of the plurality of first candidate outputs satisfies a respective second convergence condition; and modifying, by the one or more processors, the trained machine learning model comprises retraining the trained machine learning model using one or more second training data points corresponding to one or more first candidate outputs of the plurality of first candidate outputs that did not satisfy the respective second convergence condition. 9. The method of claim 1 , wherein training the machine learning model comprises causing weights and biases of a neural network of the machine learning model to be adjusted until an optimization condition is satisfied. 10. A system, comprising: one or more hardware processors configured to: train a machine learning model using a first training data point of a plurality of training data points to generate a trained machine learning model, wherein training the machine learning model comprises modifying the machine learning model to satisfy a first convergence condition; select a first input to provide to the trained machine learning model, the first input selected from a domain of inputs that includes inputs of the plurality of training data points by performing at least one of a random process or a Monte Carlo process, the first input associated with a first training data output; sample a first candidate output of the trained machine learning model by providing the first input as input to the trained machine learning model; determine whether the first candidate output satisfies a second convergence condition based on a difference between the first training data output and the first candidate output, the second convergence condition different from the first convergence condition; modify, responsive to the first candidate output not satisfying the second convergence condition, the trained machine learning model using a second training data point of the plurality of training data points, the second training data point comprising the first input and the first training data output; and output, responsive to the first candidate output satisfying the second convergence condition, the trained machine learning model. 11. The system of claim 10 , wherein the first training data point is retrieved from a training database comprising less than one thousand training data points. 12. The system of claim 10 , wherein the machine learning model comprises a neural network comprising an input layer, one or more hidden layers, and an output layer. 13. The system of claim 10 , wherein the one or more processors are configured to train the machine learning model using the first training data point by training the machine learning model using less than ten training data points. 14. The system of claim 10 , wherein the first training data point comprises at least one of an experimental data point or a synthetic data point from a reference high-fidelity model. 15. A method, comprising: training a machine learning model using less than ten first training data points of a plurality of training data points to generate a trained machine learning model that satisfies a first convergence condition; performing a Monte Carlo process to select a plurality of first inputs from the plurality of training data points, each first input of the plurality of first inputs corresponding to a first training data output of a plurality of first training data outputs; applying the plurality of first inputs as input to the trained machine learning model to sample a plurality of first candidate outputs of the trained machine learning model; testing the plurality of first candidate outputs to determine if each of the plurality of first candidate outputs satisfies a respective second convergence condition based on differences between the plurality of first candidate outputs and the plurality of first training data outputs, the respective second convergence condition different from the first convergence condition; responsive to at least one first candidate output not satisfying the respective second convergence condition, training the trained machine learning model using at least one second training data point of the plurality of training data points corresponding to the at least one first candidate output, the at least one second training data point comprising at least one first input used as input to the trained machine learning model to sample the at least one first candidate output

Assignees

Inventors

Classifications

  • Feedforward networks · CPC title

  • Active learning · CPC title

  • Supervised learning · CPC title

  • G06N3/08Primary

    Learning methods · CPC title

  • Architecture, e.g. interconnection topology · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11710038B2 cover?
A method for active learning using sparse training data can include training a machine learning model using less than ten first training data points to generate a candidate machine learning model. The method can include performing a Monte Carlo process to sample one or more first outputs of the candidate machine learning model. The method can include testing the one or more first outputs to det…
Who is the assignee on this patent?
Uchicago Argonne Llc
What technology area does this patent fall under?
Primary CPC classification G06N3/08. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jul 25 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).