Method for reproducibility of deep learning classifiers using ensembles

US11574166B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11574166-B2
Application numberUS-202016886344-A
CountryUS
Kind codeB2
Filing dateMay 28, 2020
Priority dateMay 28, 2020
Publication dateFeb 7, 2023
Grant dateFeb 7, 2023

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Example implementations described herein involve systems and methods for generating an ensemble of deep learning or neural network models, which can involve, for a training set of data, generating a plurality of model samples for the training set of data, the plurality of model samples generated from deep learning or neural network methods; and aggregating output of the model samples to generate an output of the ensemble models.

First claim

Opening claim text (preview).

What is claimed is: 1. A method of generating an ensemble of deep learning or neural network models, the method comprising: for a training set of data: generating a plurality of model samples for the training set of data, the plurality of model samples generated from deep learning or neural network methods; and aggregating output of the model samples to generate an output of the ensemble models, wherein the generating the plurality of model samples for the training set of data comprises: executing an inference process on the plurality of model samples to generate additional model samples through passing inputs on the plurality of model samples; and employing a dropout process on the plurality of model samples and the additional model samples to obtain a subset of model samples as the plurality of model samples, the dropout process configured to reduce the plurality of model samples and the additional model samples to the subset of the model samples based on validation accuracy against the training set of data. 2. The method of claim 1 , wherein the generating the plurality of model samples for the training set of data further comprises executing a distributed training process during a training phase of the plurality of model samples, the distributed training process training each of the plurality of model samples across separate servers, each of the separate servers configured to generate a model sample. 3. The method of claim 1 , wherein the plurality of model samples are predictive maintenance models, and wherein the output is a maintenance recommendation. 4. A non-transitory computer readable medium, storing instructions of generating an ensemble of deep learning or neural network models, the instructions comprising: for a training set of data: generating a plurality of model samples for the training set of data, the plurality of model samples generated using sampling of trained learners generated from deep learning or neural network methods; and aggregating output of the model samples to generate an output of the ensemble models, wherein generating the plurality of model samples for the training set of data comprises: executing an inference process on the plurality of model samples to generate additional model samples through passing inputs on the plurality of model samples; and employing a dropout process on the plurality of model samples and the additional model samples to obtain a subset of model samples as the plurality of model samples, the dropout process configured to reduce the plurality of model samples and the additional model samples to the subset of the model samples based on validation accuracy against the training set of data. 5. The non-transitory computer readable medium of claim 4 , wherein the generating the plurality of model samples for the training set of data further comprises executing a distributed training process during a training phase of the plurality of model samples, the distributed training process training each of the plurality of model samples across separate servers, each of the separate servers configured to generate a model sample. 6. The non-transitory computer readable medium of claim 4 , wherein the plurality of model samples are predictive maintenance models, and wherein the output is a maintenance recommendation. 7. An apparatus configured to generate an ensemble of deep learning or neural network models, the apparatus comprising: a processor, configured to: for a training set of data: generate a plurality of model samples for the training set of data, the plurality of model samples generated from deep learning or neural network methods; and aggregate output of the model samples to generate an output of the ensemble models, wherein the processor is configured to generate the plurality of model samples for the training set of data by, executing an inference process on the plurality of model samples to generate additional model samples through passing inputs on the plurality of model samples; and employing a dropout process on the plurality of model samples and the additional model samples to obtain a subset of model samples as the plurality of model samples, the dropout process configured to reduce the plurality of model samples and the additional model samples to the subset of the model samples based on validation accuracy against the training set of data. 8. The apparatus of claim 7 , wherein the processor is configured to generate the plurality of model samples for the training set of data by further executing a distributed training process during a training phase of the plurality of model samples, the distributed training process training each of the plurality of model samples across separate servers, each of the separate servers configured to generate a model sample. 9. The apparatus of claim 7 , wherein the plurality of model samples are predictive maintenance models, and wherein the output is a maintenance recommendation.

Assignees

Inventors

Classifications

  • Distributed learning, e.g. federated learning · CPC title

  • Supervised learning · CPC title

  • Convolutional networks [CNN, ConvNet] · CPC title

  • characterised by memory or gating, e.g. long short-term memory [LSTM] or gated recurrent units [GRU] · CPC title

  • modifying the architecture, e.g. adding, deleting or silencing nodes or connections · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11574166B2 cover?
Example implementations described herein involve systems and methods for generating an ensemble of deep learning or neural network models, which can involve, for a training set of data, generating a plurality of model samples for the training set of data, the plurality of model samples generated from deep learning or neural network methods; and aggregating output of the model samples to generat…
Who is the assignee on this patent?
Hitachi Ltd
What technology area does this patent fall under?
Primary CPC classification G06N3/045. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Feb 07 2023 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 1 related publication on this page (citations in our corpus or others sharing the same primary CPC).